|
|
|
|
|
by dbmikus
660 days ago
|
|
This is very cool! Most of the work I've seen on reducing inference costs has been via things like LoRAX that lets multiple fine-tunes share the same underlying base model. Do you imagine Outerport being a better fit for OSS model hosts like Replicate, Anyscale, etc. or for companies that are trying to host multiple models themselves? Your use case mentioned speaks more to the latter, but it seems like the value at scale is with model hosting as a service companies. |
|
I think both are fits- we've gotten interest from both types of companies and our first customer is a "OSS model host".
Our 40% savings result is also specifically for the 5 model services case, so there could be non-trivial cost reduction even with a reasonably small number of models.