Hacker News new | ask | show | jobs
by tikkun 563 days ago
It sounds like you want more broad stuff, not necessarily learning how to train models. More like learning to use them and how they work.

https://news.ycombinator.com/item?id=36195527 and

Hacker's Guide to LLMs by Jeremy from Fast.ai - https://www.youtube.com/watch?v=jkrNMKz9pWU

State of GPT by Karpathy - https://www.youtube.com/watch?v=bZQun8Y4L2A

LLMs by 3b1b - https://www.youtube.com/watch?v=LPZh9BOjkQs

Visualizing transformers by 3b1b - https://www.youtube.com/watch?v=KJtZARuO3JY

How ChatGPT trained - https://www.youtube.com/watch?v=VPRSBzXzavo

AI in a nutshell - https://www.youtube.com/watch?v=2IK3DFHRFfw

How Carlini uses LLMs - https://nicholas.carlini.com/writing/2024/how-i-use-ai.html

For staying updated:

X/Twitter & Bluesky. Go and follow people that work at OpenAI, Anthropic, Google DeepMind, and xAI.

Podcasts: No Priors, Generally Intelligent, Dwarkesh Patel, Sequoia's "Training Data"

1 comments

For Bluesky, there's a Starter Pack consisting of only Google DeepMind employees. Seems like a good place to start on Bluesky: https://bsky.app/starter-pack/sharky6000.bsky.social/3l7kt6x...
P.S. Just noticed there's also one for xAI: https://bsky.app/starter-pack-short/BYkRryU