Hacker News new | ask | show | jobs
by codelion 76 days ago
How does it compare to some of the newer mlx inference engines like optiq that support turboquantization - https://mlx-optiq.pages.dev/