Hacker News new | ask | show | jobs
by RestartKernel 148 days ago
What are the costs looking like to run this? I wonder whether you would be able to use this approach within a mixture-of-experts model trained end-to-end in ensemble. That might take out some guesswork insofar the roles go.