But they're not small flaws if you're relying on it to eg replace a person's job.
If i ask it for the dimensions of a product and it gives the wrong figures it should just tell me it doesn't know instead of inventing something.
That's the problem. It doesn't tell you when something is wrong so you can never trust if it's right unless you happen to know the field. That makes it far less useful.
I've had a few instances where it returned bad code or was unable to solve a challenge, however most were fixed by better prompts, or by clarifying prompts. In a way I think there is a two-way "learning" process going on here. I'm training it how to give me what I ask for, and it trains me how to ask for what I want it to give me.
The tricky bit IMO is when you’re at the threshold of being able to identify errors it makes. I tested some situations a while ago where I asked for some physics calculations functions. I’m an experienced programmer but haven’t really done anything with physics since high school 20 years ago. The code returned looked plausible and would run, but going through it line by line and looking up the real formulas it turned out to be super wrong.
Regardless, I find it silly to focus on the small flaws when we're witnessing a foundational shift in what kind of problems we can solve.