Hacker News new | ask | show | jobs
by Rudra_Jadhav 8 days ago
Reducing API costs is a massive priority for teams right now. Are you using a smaller model like Llama 3 for the local filtering layer?
1 comments

Yes, gemma was just an example.