Hacker News new | ask | show | jobs
by avg_dev 1107 days ago
Wow, thanks again!

I really enjoyed this response and I have learned from it. (Which I guess an LLM could not do while generating something!)

Some references I glanced at (I mostly read the top paras):

- https://en.wikipedia.org/wiki/In_situ

- https://en.wikipedia.org/wiki/Theory_of_mind

Really enjoyed this response, and feel like I've developed a better understanding of some of the concepts relating to generative ML as it is used in LLMs.

An aside: I took a course on ML in a university a few years back, and it was interesting (it was an intro and survey course offered by the CompSci faculty), but difficult for me. I excelled at implementing using Keras/TF code in Python, and I had fun manually implementing some gradient descent algorithm but a lot of the math including all of the multi-var calc, stats, probability was quite difficult for me to wrap my head around, and I really didn't feel like I got a solid grounding on a meta-level of what we were doing or why. I have been reading a bit about LLMs and I think your post has filled in some of the gaps in what at this point I was really looking to understand.