I'm curious how they will get these LLM to work with consumer hardware myself. Is FP8 is the way to get them small?