|
|
|
|
|
by dnhkng
94 days ago
|
|
I stick with models I can run on VRAM, but DeepSeek Speciale have the best reasoning capabilities of the models I can actually run (https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale). What hardware can you access? I have Deepseek etc, but inferencing on DDR5 would take about 2-3 weeks for a simple scan. I think this works best with dense models, but it also seems ok with MoE. @everyone: Can someone hook me up with Nvidia sponsorship? |
|
but yeah on demand would be a lot of ssd churn so id just do it for testing or getting some hidden state vectors.