Hacker News new | ask | show | jobs
by cyberninja15 901 days ago
A new post-pretraining method for LLMs with an expansion of Transformer blocks.