Hacker News new | ask | show | jobs
by anotherpaulg 956 days ago
Same. I am eager to run my code editing benchmark [1] against it, to compare it with gpt-4-0314 and gpt-4-0613.

Edit: Ha, I just re-read the announcement [2] and it says 1pm in the 5th sentence:

  We’ll begin rolling out new features to OpenAI customers starting at 1pm PT today.

[1] https://aider.chat/docs/benchmarks.html

[2] https://openai.com/blog/new-models-and-developer-products-an...

5 comments

I've been able to generate some preliminary code editing evaluations. OpenAI is enforcing very low rate limits on the new GPT-4 model. I will update the results as quickly my rate limit allows.

https://news.ycombinator.com/item?id=38172621

Also, aider now supports these new models, including `gpt-4-1106-preview` with the massive 128k context window.

https://github.com/paul-gauthier/aider/releases/tag/v0.17.0

I'm also eager for you to run your code editing benchmark against it. :)
Hey. Would really love to know the results of your benchmark testing.
"begin".

Other comments says this can take days to get to everyone.

Good find - Looks like I now have access!