Hacker News new | ask | show | jobs
by gone35 354 days ago
Not noise but an actual byte-by-byte encoding/serialization of the GPT-2 small model weights.