| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by anotherpaulg 956 days ago

Same. I am eager to run my code editing benchmark [1] against it, to compare it with gpt-4-0314 and gpt-4-0613.

Edit: Ha, I just re-read the announcement [2] and it says 1pm in the 5th sentence:

  We’ll begin rolling out new features to OpenAI customers starting at 1pm PT today.

[1] https://aider.chat/docs/benchmarks.html

[2] https://openai.com/blog/new-models-and-developer-products-an...

5 comments

anotherpaulg 956 days ago

I've been able to generate some preliminary code editing evaluations. OpenAI is enforcing very low rate limits on the new GPT-4 model. I will update the results as quickly my rate limit allows.

https://news.ycombinator.com/item?id=38172621

Also, aider now supports these new models, including `gpt-4-1106-preview` with the massive 128k context window.

https://github.com/paul-gauthier/aider/releases/tag/v0.17.0

link

reitzensteinm 956 days ago

I'm also eager for you to run your code editing benchmark against it. :)

link

famouswaffles 956 days ago

Hey. Would really love to know the results of your benchmark testing.

link

ignite2 956 days ago

"begin".

Other comments says this can take days to get to everyone.

link

tornato7 956 days ago

Good find - Looks like I now have access!

link