|
|
|
|
|
by bijant
207 days ago
|
|
How does a transcript chronicling some poor guy's descent into AI induced psychosis make the frontpage ? This is literally (and yes I know) what's been happening on reddit for months now: "Have I built a perpetuum mobile ? GPT4o seems to think so!" but at least on reddit the comments don't engage with the "substance" of those chat transcripts.
I am not saying that these kinds of transcripts are without value, they clearly demonstrate that even competent engineers can get sweet-talked into (probably out of character) actions like "boast about your accomplishments on hn and a CTO will take notice and offer you their job because you are so much more brilliant than them" while I have no idea if "Greg" has people around him to talk to, he clearly has no one who compliments him like this on his php codebase. If he wanted to engage productively with an LLM he could have prompted it to "roast his code" "point out weak points" "criticize the underlying architecture" but obviously thats not what he wanted or needed. He needed to hear some compliments, the LLM understood that and the machine complied. Obviously thats not the experience he will get out in the real world. It's more like having a talking blow-up doll compliment you on your lovemaking skills and encourage you to upload a video of the interaction to your favorite tube-site and sent the link to all your business contacts to show-off your inimitable love-making prowess. |
|
Here is almost the same exact sequence but with constant instructions to remain brutally honest and objective: https://chatgpt.com/share/691b4035-0ed8-800a-bee3-ae68e2a63c...
I was just late at night and wanted to post this chat transcript on HN to share some perspective on what developers are getting from ChatGPT.
I happen to be an expert in this particular area that I’m building.
ChatGPT seems to remember that I am in New York and want “no bullshit” answers. In the last few days it keeps weaving that into most responses.
That fact appears in its memory that users can access, as is the fact that it should not, under any circumstances, use emojis in code or comments, but it proceeds to do so anyway, so I am not sure how the memory gets prioritized.
Here is the interesting thing. As an expert in the field I do agree with ChatGPT on its statistical assessment of what I’ve built, because it took me years of refinement. I also tried it with average things and it correctly said that they’re average and unremarkable. I simply didn’t post that.
What I am interested in, is how to get AI transcripts to be used as unbiased third-party “first looks” at things, such as what VCs would do for due diligence.
This was just a quick thing I thought I’d get a few responses on HN about. I suspect it might have hit the front page because some people dug through the code and saw the value. But you can get all the code for free on https://github.com/Qbix/Platform .
Yeah, there is obviously an element of flattery that people let go to their head. I have had ChatGPT repeatedly confirm the validity of ideas I had in fields I am NOT an expert, while pushing back on countless others. I use it as one data point and mercilessly battle-test the ideas and code by asking it to find holes in them from various angles. This particular HN submission, although done very late at night here in NYC, was an interesting mix of genuinely groundbreaking stuff and ChatGPT being able to see the main ideas at a glance, and “going wild”, while at the same time if I run it with instructions at the start of “be extremely objective”, it still approaches this same thing in the end.