|
|
|
|
|
by jauntywundrkind
24 days ago
|
|
Allen AI (ai2) is doing ridiculously good work, with such a clear focus on enabling others.
https://bsky.app/profile/ai2.bsky.social Their work on SERA (open training, open weights) is fantastic. 40 GPU days of time, training a competitive model, but also, a model built for further close fine-tuning. That refining and distilling models down, especially for complex code-bases, to make the model want to do the right thing, to know the process you use, has such promise. And it's done so in the open, with so much work to help you train or refine yourself, at such low costs! https://allenai.org/blog/open-coding-agents I'm so so so happy AI2 is helping bring up NSF OMAI compute center, some modern equipment they'll have access to. https://bsky.app/profile/ai2.bsky.social/post/3mlbihzxsei2a
https://bsky.app/profile/ai2.bsky.social/post/3mlbii3d37t2u Incredible company. And such versatility! Earth sensing/geospatial models MolmoEarth, their own benchmarks for example for Instruction Following IFBench, MolmoAct robotics / VLA, and radical new MoE models EMO,
https://bsky.app/profile/ai2.bsky.social/post/3mm7udixycs2h
https://bsky.app/profile/ai2.bsky.social/post/3mm7udixycs2h
https://bsky.app/profile/ai2.bsky.social/post/3ml4pooclic23
https://bsky.app/profile/ai2.bsky.social/post/3mle56nehfz2w |
|