|
|
|
|
|
by nikolayasdf123
396 days ago
|
|
2GB for 0.5B smallest model. it does not make sense for each app to download this. apple must have plans to pre-load these models on os level and expose SDK for all apps to call these models locally. exciting times! opened issue for them to confirm this: https://github.com/apple/ml-fastvlm/issues/7 |
|
Especially if the API gives opportunity for app developers to load their custom LoRa fine-tunings onto OS-standard foundation models at runtime, then you can (ideally) have the best of both worlds -- fine-tuned app-specific models with reasonable app sizes.