Hacker News new | ask | show | jobs
by ctas 41 days ago
Can you also share the base model before fine-tuning on tool calls? Might be a great foundation for various fine-tuning jobs.
1 comments

The base model is a Simple Attention Network, a foundation model family we’ve been experimenting on at Cactus.