Hacker News new | ask | show | jobs
by countWSS 916 days ago
Its a new LLM type: instead of transformers it use state-space machines, which are orders of magnitude faster. Its currently very new and less coherent than GPT-2.
1 comments

? its better than GPT 2 for sure...