Hacker News new | ask | show | jobs
by jauntywundrkind 24 days ago
Allen AI (ai2) is doing ridiculously good work, with such a clear focus on enabling others. https://bsky.app/profile/ai2.bsky.social

Their work on SERA (open training, open weights) is fantastic. 40 GPU days of time, training a competitive model, but also, a model built for further close fine-tuning. That refining and distilling models down, especially for complex code-bases, to make the model want to do the right thing, to know the process you use, has such promise. And it's done so in the open, with so much work to help you train or refine yourself, at such low costs! https://allenai.org/blog/open-coding-agents

I'm so so so happy AI2 is helping bring up NSF OMAI compute center, some modern equipment they'll have access to. https://bsky.app/profile/ai2.bsky.social/post/3mlbihzxsei2a https://bsky.app/profile/ai2.bsky.social/post/3mlbii3d37t2u

Incredible company. And such versatility! Earth sensing/geospatial models MolmoEarth, their own benchmarks for example for Instruction Following IFBench, MolmoAct robotics / VLA, and radical new MoE models EMO, https://bsky.app/profile/ai2.bsky.social/post/3mm7udixycs2h https://bsky.app/profile/ai2.bsky.social/post/3mm7udixycs2h https://bsky.app/profile/ai2.bsky.social/post/3ml4pooclic23 https://bsky.app/profile/ai2.bsky.social/post/3mle56nehfz2w