Ya, i dont know of anyone wanting to run very large AI models in a windows environment. Or, frankly, on a laptop. Why not just VPN into a dedicated server?
With BUILD happening tomorrow, I suspect Microsoft is going to have some stuff about local AI there with MS Foundry on Windows/Foundry Local. The timing of this announcement a day before BUILD is obviously intentional.
Suddenly all the Windows K2 stuff makes sense, but I doubt it'll be enough. Its too little too late for Microsoft.
I do. I can take my laptop anywhere I want, for example to a coffee shop and run a coding model while eating a croissant without worrying about an internet connection, as the term local model implies.
I could be wrong but my understanding is that 24/7 dedicated servers are wildly economically unviable. The reason cloud tends to cost less than local today (other than the subsidization) is because you aren't running models 24/7. So like 6 hours of cloud per weekday might beat the yearly cost of building local machines, but it's not in the same universe if you're running 24/7, as evidenced by two months of H200 rental costing more than the DGX Spark this Laptop is built out of.
Suddenly all the Windows K2 stuff makes sense, but I doubt it'll be enough. Its too little too late for Microsoft.