Hacker News new | ask | show | jobs
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models [pdf] (storage.googleapis.com)
6 points by alekandreev 797 days ago
1 comments

Code here: https://github.com/google-deepmind/recurrentgemma

Checkpoints here for both base pre-trained model and an IT version for dialogue: https://www.kaggle.com/models/google/recurrentgemma