|
|
|
|
|
by tome
913 days ago
|
|
I think we use a system with 576 Groq chips for this demo (but I am not certain). There is no DRAM on our chip. We have 220 MB of SRAM per chip, so at 576 chips that would be 126 GB in total. Graphics processors are still the best for training, but our language processors (LPUs) are by far the best performance for inference! |
|