Hacker News new | ask | show | jobs
by selridge 123 days ago
“Share the prompts”

How would that be feasible for a project of any complexity whatsoever?

2 comments

I'm reminded of something I read recently about disclosure of AI use in scientific papers [1]:

> Authors should be asked to indicate categories of AI use (e.g., literature discovery, data analysis, code generation, language editing), not narrate workflows or share prompts. This standardization reduces ambiguity, minimizes burden, and creates consistent signals for editors without inviting overinterpretation. Crucially, such declarations should be routine and neutral, not framed as exceptional or suspicious.

I think that sharing at least some of the prompts is a reasonable thing to do/require. I log every prompt to a LLM that I make. Still, I think this is a discussion worth having.

[1] https://scholarlykitchen.sspnet.org/2026/02/03/why-authors-a...

This is totally infeasible.

If I have a vibe coded project with 175k lines of python, there would be genuinely thousands and thousands of prompts to hundreds of agents, some fed into one another.

Whats the worth of digging through that? What do you learn? How would you know that I shared all of them?

> I log every prompt to a LLM that I make.

How many do you have in the log total?

I have a daily journal where I put every online post I make. I include anything I send to a LLM on my own time in there. (I have a separate work log on their computer, though I don't log my work prompts.) Likely I miss a few posts/prompts, but this should have the vast majority.

A few caveats: I'm not a heavy LLM user (this is probably what you're getting at) and the following is a low estimate. Often, I'll save the URL only for the first prompt and just put all subsequent prompts under that one URL.

Anyhow, running a simple grep command suggests that I have at least 82 prompts saved.

In my view, it would be better to organize saved prompts by project. This system was not set up with prompt disclosure in mind, so getting prompts for any particular project would be annoying. The point is more to keep track of what I'm thinking of at a point in time.

Right now, I don't think there are tools to properly "share the prompts" at the scale you mentioned in your other comment, but I think we will have those tools in the future. This is a real and tractable problem.

> Whats the worth of digging through that? What do you learn? How would you know that I shared all of them?

The same questions could be asked for the source code of any large scale project. The answers to the first two are going to depend on the project. I've learned quite a bit from looking at source code, personally, and I'm sure I could learn a lot from looking at prompts. As for the third question, there's no guarantee.

I can't reply to the other comment, but here goes:

This is one (1) conversation: https://chatgpt.com/share/69991d7e-87fc-8002-8c0e-2b38ed6673...

It has 9 "prompts" On just the issue of path re-writing, that's probably one of a dozen conversations, NOT INCLUDING prompts fed into an LLM that existed to strip spaces and newlines caused by copying things out of a TUI.

It's ok for things to be different than they used to be. It's ok for "prompts" to have been a meaningful unit of analysis 2 years ago but pointless today.

No the same question CANNOT be asked of source code because it can execute.

You might as well ask for a record of the conversations between two engineers while code was being written. That's what the chat is. I have a pre-pre-alpha project which already has potentially hundreds of "prompts"--really turns in continuing conversations. Some of them with 1 kind of embedded agent, some with another. Some with agents on the web with no project access.

Sometimes I would have conversations about plans that I drop. do I include those, if no code came out of them but my perspective changed or the agent's context changed so that later work was possible?

I don't mean to be dismissive, but maybe you don't have the necessary perspective to understand what you're asking for.

> maybe you don't have the necessary perspective to understand what you're asking for

Please don't cross into personal attack. You're making fine points, and that's enough.

https://news.ycombinator.com/newsguidelines.html

Btw, I think this is a particularly good point: "You might as well ask for a record of the conversations between two engineers while code was being written. That's what the chat is."

That's a good reframing. I can see why it might be impractical to share all of that, hard to make sense of as a reader, and too onerous to demand of submitters.

Since you have experience in this area, I'd like to hear your view on what we could reasonably require submitters to share, given that the flood of generated Github repos is creating a lot of low-quality submissions that don't gratify curiosity and thus don't fit the spirit of either Show HN or HN in general.

Some people would say "just ban them", but I'd rather find a way to adapt to this wave, since it is the largest technical development in a long time, and the price of opposing it is obsolescence.

"maybe you don't have the necessary perspective to understand what you're asking for"

this is in no way a personal attack. It's just a statement that's true. I didn't imply anything about them or their character or limitations, but they might not have the necessary perspective if that's the question they are asking.

I think it's critically important people figure out what they want to learn from what's being shared.

What do you need from submitters here? Even setting aside the burden of supplying it, what do you hope to learn?

> maybe you don't have the necessary perspective to understand what you're asking for.

I disagree. Thinking about this more, I can give an example from my time working as a patent examiner at the USPTO. We were required to include detailed search logs, which were primarily autogenerated using the USPTO's internal search tools. Basically, every query I made was listed. Often this was hundreds of queries for a particular application. You could also add manual entries. Looking at other examiners' search logs was absolutely useful to learn good queries, and I believe primary examiners checked the search logs to evaluate the quality of the search before posting office actions (primary examiners had to review the work of junior examiners like myself). With the right tools, this is useful and not burdensome, I think. Like prompts, this doesn't include the full story (the search results are obviously important too but excluded from the logs), but that doesn't stop the search logs from being useful.

> You might as well ask for a record of the conversations between two engineers while code was being written.

No, that's not typically logged, so it would be very burdensome. LLM prompts and responses, if not automatically logged, can easily be automatically logged.

> LLM prompts and responses, if not automatically logged, can easily be automatically logged.

What will you do with what you’ve logged? Where is “the prompt” when the chat is a chat? What prompt “made” the software?

If you’re assuming that it is prompt > generation > release, that’s not a correct model at all. The model is *much* closer to conversations between engineers which you’ve indicated would be burdensome to log and noisy to review.

I don't know! We'd have to figure something out along the way.

Btw I realize "prompt" is no longer the right word for this but I don't know what a better term would be. "Conversation with LLM" is too clumsy.

The submitter could decide and curate what is useful to be shared, whether that’s exact message logs, or the subagents and skills they think made the difference.
I really have no idea. There's almost no good mapping from conversation to code. Take e.g. https://github.com/Protonk/PAWL

There's...thousands of conversations in dozens of different configurations over months to make that. What...out of that do you want to know? What do you deserve to know by dint of my making a submission?

Those aren't rhetorical. I think if you can answer those two, that's a fair start. Good luck lol.