Y
Hacker News
new
|
ask
|
show
|
jobs
by
ctas
41 days ago
Can you also share the base model before fine-tuning on tool calls? Might be a great foundation for various fine-tuning jobs.
1 comments
HenryNdubuaku
41 days ago
The base model is a Simple Attention Network, a foundation model family we’ve been experimenting on at Cactus.
link