Hacker News new | ask | show | jobs
by aero142 1247 days ago
I've been asking friends in non-programming engineering fields how ChatGPT does in their area of expertise, and I believe programming is the area that ChatGPT is the most accurate. Finding solution to general engineering problems seems blatantly wrong in almost all cases, whereas in programming, it seems to be able to generate mostly correct code for simple, boiler-plate like tasks.
2 comments

why is "mostly correct" ok for programming? also, i don't believe that good programmers want to have boiler-plate in their code.
Because that's what most programmers achieve too.

You can iterate from there by taking advantage of the last 50 years of software engineering wisdom.

yes, but why? Why is GPT so much better at programming than other tasks?

can it be that programming itself can be so easily predicted in a generative way, while others require more ingenuity and real world model to be solved?

In this case I would totally offload programming to a GPT /LLM AI, while my job is simply to specify largely the business case.

I have to imagine its because so much of its training data is readily available programming docs, tutorials, and general Q&A that there is an amazing abundance of online. How many times have you just pasted an error into google and hoped someone else has asked the exact same question on stack overflow?
True. Also there's a lot of commented open-source code out there.
As Chomsky points out, the AIs understand syntax not semantics. Code is all (or almost all) syntax whereas human language has both syntax and meaning. Meaning is not something ChatGPT understands.
Thanks, that is the answer!

The meaning of a programming language is its syntax.

Is it because programming is a more limited and specific language than the ones people speak? There's less room for double-meanings, slang, meaning, or even sentence structure.
It seems the improvement in the models is the size of the training set. Hence the "Large" in large language models. I am assuming that it is better at programming because there a is a ton of training data available on Github, etc. There isn't a similar set of data on solving a variety of physics problems.