Could you provide an example of what you're asking for? It appears to work well for me. I would estimate that 80% of the answers are very good, and 10% provide at least a glimpse of the information I was seeking.
One of the first things I asked it to do was to "write a program to compute the rise and set times of a star". It got a couple of pieces right, but was way off the mark. I asked it a few other astronomy related questions, and it would try to use a library like Astropy, but make API calls that don't exist. I tried things like "don't use Astropy", and it would just switch to a different library.
It seems to do OK on things like the instructions for Fizz-Buzz.
It seems to do OK on things like the instructions for Fizz-Buzz.