Hacker News new | ask | show | jobs
by ssivark 1155 days ago
The original “pre-training” Is what’s expensive. The “fine-tuning” (also training that it modifies network weights) for instruction following or other tasks costs the thousand dollar range.