Y
Hacker News
new
|
ask
|
show
|
jobs
by
summarity
1059 days ago
If you're running on Ampere, using llama.cpp is probably not ideal. While it's optimized for ARM, Ampere has native acceleration for workloads like this:
https://cloudmarketplace.oracle.com/marketplace/en_US/adf.ta...