|
|
|
|
|
by gsuuon
1039 days ago
|
|
Congrats Junru! I'm not on AMD but love seeing progress in this project. Excited for batched inference -- I didn't think it'd be useful for me but I've realized batched inference is also useful for a single user / edge device workload. Btw - I got biased sampling working in ad-llama! Catching up to guidance slowly but surely :) |
|