| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by xfalcox 871 days ago
	Context is 4096? My app db DDL is 19877 tokens (using Llama2 tokenizer) long, so that means we need to do a RAG for handling the DDL prompt injection. A model like this with a 32k long seq_len, like Mixtral, would be a killer for me.

1 comments

Great call out. Will definitely focus on that in the next iteration!