Hacker News new | ask | show | jobs
by ammo1662 752 days ago
Related

https://www.reddit.com/r/LocalLLaMA/comments/1d6f1f3/llama3v...

https://aksh-garg.medium.com/llama-3v-building-an-open-sourc...

1 comments

> Edit (June 2)

> A big thank you to people who pointed out similarities to previous research in the comments. We realized that our architecture is very similar to OpenBMB’s “MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone,” who beat us to the implementation. We have taken down our original model in respect to the authors.

> The link to the original author’s repository can be found here: https://github.com/OpenBMB/MiniCPM-V/tree/main?tab=readme-ov...

> — Aksh Garg, Sid Sharma