Hacker News new | ask | show | jobs
by timr 827 days ago
Oh, I'm not saying there's no value, just that I don't think the value is nearly the magnitude being hyped, and certainly not for the "speeding up the junior to senior transition" posited by the comment at the top of the thread. And sure, every teacher is wrong at some rate -- but the way we deal with that is by thinking for ourselves, asking lots of teachers, working out the differences, etc. This inherently takes time.

Pick some domain that you know nothing about, and ask a transformer model to solve a known problem in the space. It will give you a reply. Is the reply correct? Assuming that you even know how to ask the right question to get a sensible answer (which itself requires expertise), assessing the quality of the answer certainly requires expertise that you don't have. So either you figure it out for yourself (as slow as learning from any other source), or you take it on blind faith.

If I had to wager on the area where I think these models are going to lead to big changes, it's reading and summarization, not generation. "Describe how node deletion in b-trees works in 500 words" is a heck of a lot more useful than asking a transformer to write code to implement node deletion in a b-tree.

1 comments

I can't really speak for previous posts, but I've personally seen a huge uptick in how fast I can move through material using LLMs - not because I'm asking them how things work or to explain things to me, but because they remove the slog of memorizing ancillary boilerplate and help me find the terms I need to search the docs for.

These are very specific things that you can't get as efficiently elsewhere. Essentially, it's not so much about the language model outputting information that I can understand, but more about the language model being able to parse MY queries and respond with something that's kinda good-enough.

Say you're looking at a chunk of code you didn't write, and you want to know what function foo_man_chu() is doing. You could go straight to the documentation for that function, but in many cases it's interacting with a bunch of other systems that are documented elsewhere, and not referenced in that material. If you ask GPT to explain it to you, it pops out a (reliable enough) list of things to RTFM on. This takes what could be a couple hours deep-reading through stack / google and distills it down to like ten minutes.

This effect is magnified the less you know about the domain, if you do it right.

Multiply that by the amount of times you have to do this in the course of teaching yourself something and it adds up quite a bit. This may be what the parent is talking about.