|
|
|
|
|
by lukev
563 days ago
|
|
Yeah but even then they won't describe it using the same sort of language that everyone else developing these things does. How many parameters? What kind of corpus was it trained on? MoE, single model, or something else? Will the weights be available? It doesn't even use the words "LLM", "multimodal" or "transformer" which are clearly the most relevant terms here... "foundation model" isn't wrong but it's also the most abstract way to describe it. |
|
a) How does it perform on my set of evals
b) What is the cost/latency of serving it to my consumers.
It shouldn't matter to me how many parameters, corpus it is trained on, whether it's LLM or Transformer or something else