Y
Hacker News
new
|
ask
|
show
|
jobs
by
sdpmas
103 days ago
yes, agreed, modded-nanogpt is already a data-efficient variant of original nanogpt. just that the kinds of algorithms it allows are somewhat constrained because it optimizes for wall clock time.