|
|
|
|
|
by summerlight
1292 days ago
|
|
This is so true. Some folks in Ads also tried to explore using large language models (one example: LLM is going to be the ultimate solution for contextual targeting if it's properly done), but one of the major bottleneck is always its cost and latency. Even if you can afford cpu/gpu/tpu costs, you always have to play within a finite latency budget. Large language model often adds latency by order of seconds, not even milliseconds! This is simply not acceptable. I think Pathways is one approach to tackle this issue at scale by making the network sparsely activated so the computation cost can be somehow bounded based on difficulty of each query. This effectively gives Google knobs for the axis across computational cost and the result quality by limiting the size of network to be activated. If it turns out to work well, then we might be able to see it incorporated to Search in a foreseeable future. |
|