|
|
|
|
|
by hlfshell
724 days ago
|
|
It started by accident, with the original llama weights being leaked by two separate employees. They've since embraced opening the weights, which I'm all for. As for why? I have a theory: Meta is not in a position to capitalize upon the model itself. Yes, they can use it internally, and maybe their competitors can copy it to - but there are no real competitors to Facebook or Instagram that can benefit from it enough to make it a differentiating facet. Thus, releasing stuff for open source does two things: 1) Make them more attractive to research talent (Apple famously recently started to publish research because their traditional secrecy was causing issues with hiring top talent) and... 2) Continues to undermine the ability to make $$$ off of model alone, driving it towards being a commodity rather than the long term profit engine for other companies. |
|
For example:
Faster R-CNN - state of the art image segmentation, released in 2017.
FastText - text embedding models, 2016.
FAISS - vector DB, 2018.
https://github.com/orgs/facebookresearch/repositories has over 1,000 repos.