Hacker News new | ask | show | jobs
by geniium 733 days ago
I was having a look at the model mentioned, specifcially `casperhansen/llama-3-70b-instruct-awq`.

When checking this model, I found out [1] it's based on llama-2 ?

``` Expand Llama 3 70B Instruct AWQ Parameters and Internals LLM Name Llama 3 70B Instruct AWQ Repository Open on Base Model(s) Llama 2 70B Instruct quantumaikr/llama-2-70B-instruct Model Size 70b ```

I added a question [2] on Hugging Face to learn more about this.

Anyone could explain to me what this means? Does it mean that it has been trained on the version 2 and wrongly named version 3? Or is it something that is not well intended?

[1] https://llm.extractum.io/model/casperhansen%2Fllama-3-70b-in...

[2] https://huggingface.co/casperhansen/llama-3-70b-instruct-awq...

1 comments

I don't know this site that you're citing there but it's clearly wrong.

Go look at the model config, you can clearly see it's Llama 3.