Thanks! It looks like the ASIC inference space (if we can it that) is getting more popular. There is also https://www.etched.ai/ that I recently saw.
I didn't follow asic mining during the bitcoin bubble but I have the impression it was the way to go for mining. I don't see why that wouldn't be true for inference, a long as one is ok being limited in flexibility and wed to a particular architecture.
Also as the other comment mentioned, https://positron.ai seems to be live now.