|
|
|
|
|
by jeffbee
1312 days ago
|
|
It does not sound like a realistic capacity plan. The reason this works in the cloud is the inference can be run in parallel on a huge amount of hardware for a short time. To run those kind of models on your rinkydink computer would take forever. |
|
Searching all of a downloaded copy of Wikipedia wouldn't be that computationally expensive either if the assistant has hot words it picks up to look up.
[0] https://news.ycombinator.com/item?id=32928207