Y
Hacker News
new
|
ask
|
show
|
jobs
by
cloudhan
919 days ago
Might be the training code related with the model
https://github.com/mistralai/megablocks-public/tree/pstock/m...
1 comments
cloudhan
919 days ago
Mixtral-8x7B support --> Support new model
https://github.com/stanford-futuredata/megablocks/pull/45
link
https://github.com/stanford-futuredata/megablocks/pull/45