Hacker News new | ask | show | jobs
by xfalcox 871 days ago
Context is 4096? My app db DDL is 19877 tokens (using Llama2 tokenizer) long, so that means we need to do a RAG for handling the DDL prompt injection.

A model like this with a 32k long seq_len, like Mixtral, would be a killer for me.

1 comments

Great call out. Will definitely focus on that in the next iteration!