Hacker News new | ask | show | jobs
by ekleraki 1188 days ago
> Ask it to solve novel mathematical and logic problems

How exactly do you define novel tasks and problems?

My brother has had it do his programming homework (he's a teenager), written in Greek mind you and therefore novel task to the first-degree, and it succeeded. Thus it solved a logic task [1].

I have had ChatGPT parse math I had written in LaTeX and reach a correct result, do modifications and so on.

> Further, an AGI should logically attempt to "improve" its understanding of the world during a conversation, yet chat GPT never initiates a topic or asks you anything at all (even to clarify what you mean). It also repeatedly reminds you that it is only a language model.

Anecdata: I asked Bing (GPT4) to answer a few stuff and it asked for clarification twice in a row. The fact that it repeatedly clarifies its prompt is not a limitation of the underlying model as much as it is a limitation of the interface that injects a prompt. Dan and all the other jailbreaks out there bypass that.

[1] https://en.wikipedia.org/wiki/Curry%E2%80%93Howard_correspon...

2 comments

> I have had ChatGPT parse math I had written in LaTeX and reach a correct result, do modifications and so on.

Interesting, didn’t know it could do that.

Wondering if it could also follow along with the lemmas in various papers and reach reasonable conclusions.

It depends, I have used Copilot to modify LaTeX and write segments of my thesis, and it works surprisingly well, though its arguments are not very precise.
https://medium.com/illumination/i-put-chatgpt-to-the-test-he... has some examples of simple maths/logic problems it can't really solve. It'd be good to know how much better chatgpt-4 is at them.