Hacker News new | ask | show | jobs
by davidatbu 29 days ago
I can spare a minute :). This isn't exhaustive because this is just stuff I know of, obviously.

- At Stanford, Led research on the first (to my knowledge) crop of joint image/text models. Super widely cited work.

- At Tesla, led their whole self driving effort for a while, came up with critical techniques that allowed them to make progress (e.g., the concept of "auto labelling": using a much larger NN to generate training data with which to train smaller models that could fit in the on-device compute. IIRC, Elon said they would not have been able to make progress without this insight).

I'm not sure his educative efforts for the mold of what you're looking for, but if so, the course he designed at Stanford (and availed online):for neural networks, as well as his blog posts, (most famous of which, to my knowledge, is "the unreasonable effectiveness of LSTMs"), made a huge impact on educating a generation of tinkerers and researchers.

7 comments

The auto labeling work (which has been partially described/presented at Tesla AI day events) seems more like engineering than research, a grab bag of techniques that I would guess the whole team must have contributed to. For example, they auto label low resolution/indeterminate objects (image segments) by temporal continuity... Something that is a low-res blob in the distance becomes a hi-res and easy to identify object when you drive by it, so by tracking objects backwards across frames you can learn how to more confidently label the lo-res blob. Things like this are useful, but it's the sort of stuff that engineers and developers are coming up with every day.
Not back in 2016.
You don't think that tracking objects from frame to frame is obvious ?!

I can guarantee you this was built-in from day #1

I'm guessing you're not a developer if you don't then automatically think of end cases like "what if car # 1 isn't in the preceding frame" ... (then you look at some relevant test data and see it was there, unlabelled ...)

Obvious in hindsight and obvious at the time are very different things.

You seem to have missed the main point anyway - using a larger model to generate labels for a smaller one is what the parent was highlighting, not the temporal labeling alone. The gold standard at the time was human labeling (eg Waymo). Deep learning was just having its moment, all of this stuff was cutting edge, and there is a lot of work in between a published paper and actually applying that to production vehicles.

Yes, the automated labelling (which replaced a large team they had doing manual labelling) that Tesla implemented consisted of a bunch of different things.

Generating a training set, training on it, and then inferencing on the trained model are three different things.

1) Generating the auto-labelled training set was of course done on Tesla's supercomputer, based on data from 1000s of cars.

2) Using the generated training set to train the in-car model would also be done offline.

3) The trained (and tested) model is then deployed to the car and used by the vision system to label image segments ("stop sign", "cyclist" etc).

How could this be divided up any other way?!

Karpathy seems like a great guy, but honestly there seems to be little to nothing in his background that makes him stand out as an architecture guy or being very creative. Maybe his thesis on image captioning is his most creative work, but at the end of the day this consisted of feeding the output of a CNN into an LSTM, conceptually very similar to the way language translation was being done at the time by feeding the output of an encoder LSTM for language A into a decoder LSTM for language B, except Karpathy was using an image encoder (off the shelf CNN) since he wanted to describe (caption) images. It was certainly at least somewhat innovative at the time, but what he was really famous/popular for at Stanford was for teaching the CS 231n class on using CNNs, and this is what he continues to be best known for - explaining how things work.

Karpathy is also badmephisto, a name you might have heard of if you're into cubing.

http://badmephisto.com/

You're thinking of the unreasonable effectiveness of RNN's.
Tesla still hasn't achieved their 2016 self-drive goal by their self imposed deadline of 2017, even now a decade later. So, politely, is that accolade merited?
The current vehicles sure seem to come close. I'm not entirely clear on how they've missed this goal, but the current models can do full self driving where I live, including parking.
Sure they have improved but how do we define success? Is success "It can drive a road it has never been on?" Even then I'm not sure because the model (not the physical car) has probably scanned that road before so it is recalling a prior route while being aware of hazards. Is that learning, or rote memorization?
A Tesla drove coast to coast on full autopilot.

My Tesla drives to walmart, finds a parking spot, comes to me outside walmart and drives me home. I've been driving my model 3 for years, and honestly, i've never had to "Take over" due to a saftey issue.

I could never trust a Tesla to drive safely around people. They seem like death traps. Could you share a link to the coast to coast drive please? How aided was it?
https://x.com/karpathy/status/2006436622909452501

How old are you if you don't mind me asking?

>I could never trust a Tesla to drive safely around people. They seem like death traps.

Have you ever been in a Tesla? It's literally been rated the safest car in America since it's inception.

That's just a man standing by his car. I'm asking for video proof, I'm sorry that wasn't clear off the bat. I also abstain from X due to Elon's track record, so I'm not going to keep searching there for it. Could you please tell me how to self serve on this?

I've driven a Tesla on and off for about 4 years, and I'm thankful to never do so again.

> death trap

https://apnews.com/article/tesla-crash-doors-musk-regulators...

> It's literally been rated the safest car in America since it's inception.

I'm locating our disconnect a bit better: something can be marked "safest car" for road tests but still be rife with issues, like its obnoxious UI choices, etc.

https://www.tesladeaths.com

> also abstain from X due to Elon's track record

I can just tell you hate eveything Elon, so this is a pointless conversation. That was a post from Karpathy, who we are all talking about in the thread, so i thought it was the most pertinent. I'm sure you can google it, it's proven, so no point arguing that it didn't happen.

Obviously, since you can't even use X out of your hate for Elon, There's no way you have "Driven a Tesla on and off for four years". Thats just a lie. NHTSA has given every vehicle a 5/5, and model 3 is "The top saftey pick" of all cars for their crash test results.

The safety comes from the inherent electric drives. They are much less likely to flip and much less likely to catch fire.

from your Tesla Deaths, 772 deaths over hundreds of billions of miles is absolutely incredible. Do you have any data to share on Fords mile to death ratio? Do you offer any comparisons? Or are you still just hating elon, for being elon?

Edit: Also, have you looked through the Tesla Deaths that you posted? A drunk driver is involved in alot of those, at no fault to the Tesla. One of the largest "Tesla Deaths" was someone driving on the wrong side of the freeway and they crashed into the tesla killing a whole family in the Tesla. How on earth are you using this slop as evidence that Tesla's are unsafe.... That's not ignorance, that's actually just evil...

How does Elon's arbitrary deadlines impact whether the accolade is "merited"? Incredible progress was made in a fairly short amount of time. His accolade isn't based on his employer's ability to predict delivery dates, they're based on the quality of the systems that are actively deployed today.
I think an accolade's merit is based on the definition of done for work delivered. Elon certainly told the public a certain vision of self-driving (a definition of done) and it didn't come to fruition despite PR progress; i.e. a washing machine can do a lot of work, but is it the right work?

We can arbitrate about what "self-driving success" means until the cows come home, but my point is I've seen a lot of self-driving failures from the Teslas I've witnessed in person.

Thank you!

I was more looking for signal that him + Anthropic might yield something beyond a step-change from Opus 4.7 (disappointing so far). We have not gotten to use Mythos yet, I wonder if that will become Opus 5 or something.

It wasn't LSTMs, it was RNNs.
Thanks for correcting the title I misremembered. Fwiw, the article did culminate with LSTMs: https://karpathy.github.io/2015/05/21/rnn-effectiveness/

---------------------

EDIT: It looks like you deleted the part of your post I quoted below. So feel free to ignore my question about it, I guess.

---------------------

Not sure what you mean by

> Shows how much you know

Do you mean that the fact that I misremembered a word on the title suggests that I know very little about Karpathy's contributions to the field of neural networks?

Add microgpt to that list