Hacker News new | ask | show | jobs
by peter_retief 1710 days ago
500 million parameters seems like a lot, are there not duplication or redundancies that can reduce the parameters. One could also use batches of data. Seems very expensive!