Universal LLM Deployment Engine with ML Compilation

Y	Hacker News new \| ask \| show \| jobs

	Universal LLM Deployment Engine with ML Compilation (blog.mlc.ai)
	17 points by ruihangl 740 days ago

7 comments

zhye 740 days ago

Glad to see MLC is becoming more mature :) I can imagine the unified engine could help build agents on multiple devices.

Any ideas on how those edge and cloud models collaborate on compound tasks (e.g. the compound ai systems: https://bair.berkeley.edu/blog/2024/02/18/compound-ai-system...)

link

ruihangl 740 days ago

A unified efficient open-source LLM deployment engine for both cloud server and local use cases.

It comes with full OpenAI-compatible API that runs directly with Python, iOS, Android, browsers. Supporting deploying latest large language models such as Qwen2, Phi3, and more.

link

yongwww 740 days ago

The MLCEngine presents an approach to universal LLM deployment, glad to know it works for both local servers and cloud devices with competitive performance. Looking forward to exploring it further!

link

neetnestor 740 days ago

Looks cool. I'm looking forward to trying building some interesting apps using the SDKs.

link

CharlieRuan 740 days ago

From first-hand experience, the all-in-one framework really helps reduce engineering effort!

link

cyx6 740 days ago

AI ALL IN ONE! Super universal and performant!

link

crowwork 740 days ago

runs on qwen2 on iphone with 26 tok/sec and a OpenAI style swift API

link