| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by m3kw9 954 days ago
	Now do a programming task that requires more than 32k of context and see who’s “better”. If you don’t bench mark that you cannot get an overall pic. GitHub copilot for example could benefit big from the increased context

2 comments

broast 954 days ago

Obviously it's a drawback but the silver lining of the small context window is it forces me to decouple everything and have very sensible and strict api's where I just write the docs and it writes the code.

link

biobootloader 954 days ago

we are working on creating "real world" benchmarks that require a lot of context, and will report when we have results!

link