|
|
|
|
|
by Snuggly73
280 days ago
|
|
Congrats - there is a very small problem with the LLM - its reusing transformer blocks and you want to use different instances of them. Its a very cool excercise, I did the same with Zig and MLX a while back, so I can get a nice foundation, but since then as I got hooked and kept adding stuff to it, switched to Pytorch/Transformers. |
|