Hacker News new | ask | show | jobs
by PokeyCat 798 days ago
Would love to experiment with this for work, but the following clause in the license (notably absent in the Llama 2 license) would make this really hard:

> i. If you distribute or make available the Llama Materials (or any derivative works thereof), or a product or service that uses any of them, including another AI model, you shall (A) provide a copy of this Agreement with any such Llama Materials; and (B) prominently display “Built with Meta Llama 3” on a related website, user interface, blogpost, about page, or product documentation. If you use the Llama Materials to create, train, fine tune, or otherwise improve an AI model, which is distributed or made available, you shall also include “Llama 3” at the beginning of any such AI model name.

Really impressive HumanEval results for the 8B model though, would love to plug this into Continue for tab completion since the current benchmark numbers for Llama 3 8B blow every other 8B model out of the water

2 comments

This is the mildest possible clause they could have included short of making the whole thing public domain. Heck the MIT license has similar requirements ("The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.")
deepseek-coder-instruct 6.7B still looks like is better than llama 3 8B on HumanEval [0], and deepseek-coder-instruct 33B still within reach to run on 32 GB Macbook M2 Max - Lamma 3 70B on the other hand will be hard to run locally unless you really have 128GB ram or more. But we will see in the following days how it performs in real life.

[0] https://github.com/deepseek-ai/deepseek-coder?tab=readme-ov-...

With quantized models you can run 70B models on 64GB RAM comfortably.