2) M3 Ultra can load Deepseek R1 671B Q4.
Using a very large LLM across the CPU and GPU is not new. It's been done since the beginning of local LLMs.