Hacker News new | ask | show | jobs
by papersnake 1360 days ago
Have you tested this on big models involving multi-gpu communication, or any plans?
1 comments

For now it's for single GPU inference only.