Hacker News new | ask | show | jobs
by morph123 928 days ago
Still dont understand why anyone would "pay" for this. The default model will probably be better in pretty much all cases. I guess if someone randomly has some sort of esoteric knowledge? Seems very limited.
4 comments

Not sure how much you've used custom GPTs. They are currently flawed and buggy but I think they hold promise. They combine 3 things:

1. A custom initial prompt (yeah - you can paste this in every time and keep a personal database of special purpose prompts - but it rather kills the convenience and utility)

2. A JSON description of one or more APIs that can be used (not just for retrieveal of information - but potentially to take actions. This aspect is under-explored IMHO)

3. Uploaded documents which (I presume) are encoded into a vector database. This part is currently the most flaky. I was hoping this was a way to get round context-window limits. It has worked very badly so far and I'm not sure I understand why.

If these things start working effectively in combination then there's potentially a "the whole is greater than the sum of it's parts" going on here. Each of these things is fairly useful on it's own but if you combine them and have them on tap - and someone else has put the hours in to making them useful and well-tuned - then it is potentially transformative.

Yes - GPT on it's own is pretty good at a lot of things, but you spend a long time working around it's flaws and blind-spots.

Custom GPTs don't cure all the warts but they are still potentially a force-multiplier.

EDIT - I'm also currently underwhelmed how well the other "special powers" work together. Image generation can't be fed into Image Understandin. Python is hamstrung in various ways. And GPT has no real understanding of how to prompt Dall-E. In fact it's worse at prompting than I am.

These flaws are potentially fixable. I hope.

3. working seamlessly would be a major thing. 1. and 2.? I used both, together, for months, via TypingMind.com (alternative frontend UI, bring-your-own-API-key).

This is not to advertise the service - even though it does deserve it, at the very least for having a perpetual license instead of subscription bullshit - just to say that a) OpenAI is lagging way behind the obvious in everything surrounding the models themselves, and b) the combination of 1. and 2. is convenient, but not ground-breaking. At least, not unless you hook a powerful enough "actuator" via 2. This, I agree, seems underexplored.

(Or underreported? It's an obvious lever to go after, a lot of people have said they're going after it, it's reasonable to expect some spectacular results from this, and yet... nothing. Crickets. Is everyone just staying silent and trying to build a bullshit startup to monetize whatever cool thing they made?)

I guess. I assume the OpenAI vision, in the short term, is for there to be a beefy LLM on top that can call the correct sub LLMs that have more bespoke knowledge about random things, and sort of orchestrate an answer based on that.

Not sure why OpenAI feels the need to pay people for this though, I guess to incentivize use?

They all use a beefy LLM though.
ArxivXplorer works fairly well all considered... Bypasses the PDF and exposes math in a better format to the LLM.
They work OK if you don't set your expectations too high.
Is a custom GPT just a boatload of prompts injected by default?