Hacker News new | ask | show | jobs
by kanyethegreat 1210 days ago
I've definitely been feeling deflated. Every angle I've come up with... some project has narrowly shipped before me. And they gain so much popularity so quickly that mindshare per project becomes a power law overnight. It's like the first person to release is at the top of the App Store, and there's no unseating them; the feedback loop is already too powerful. It very much feels like first to release wins. And I swear, the quality of the code I've seen in some of these projects that are getting thousands of stars is abysmal. It's making me think releasing vaporware in order to capture mindshare, then actually building something valuable is the only way to compete.
4 comments

It is not enough to be the first one to ship. I shipped a Mac app that uses whisper to transcribe audio before anyone else, I even implemented a dictation algorithm on top of it. It was my first app, I am not an ios developer. However, after a while, someone with a better track record in app development came up and made a similar stuff. He knows how to build an audience, and he knows how to market an app. As a result, smashed out the app I developed.

So I don't think being the first one is that much important, it is not also important to take care clean coding etc. Someone with better marketing skills probably would ruin every work you did.

Did you build macwhisper?
No, but I was talking about it, so you proved that :)
I looked at your webpage and was underwhelmed, uploading the files for transcription is probably not acceptable for most people.

Then I looked at the macwhisper page and, wow… It does all the stuff.

Guessing they are using whisperX for the timecodes.

I may be out of the loop, but I have seen very little apps with new AI tech get any traction. Or, I've seen them, tried for a few minutes, and then went on with my day. Most of these are just flash in a pan; a cool tech demo or some initial hype from an over-promising landing page, but not many viable businesses. Just someone downloading a model, fine-tuning it on some dataset and trying to pawn it off.

And if some project "narrowly shipped before you", how much value or moat is there really if we're talking multiple projects and they're so quick&easy to do? Sorry to be a bit harsh, I just don't get feeling deflated by others making the same MVP as you do if it's just a natural idea on top of some existing tech. Your focus on code quality is also moot, it's delivering value that counts, not the tech stack beneath.

Isn't this extremely common advice? All the way down to "just do it all manually then build the software" where applicable (obviously not in AI). No one cares if the code is crap if it does what they want and you can always iterate on code after securing a revenue/funding stream.
I would not put so much stock in the first mover effect. I can't bring it to mind immediately, but there was an excellent podcast that brought up how the second movers oftentimes do better in the end, at least as a company.

Case in point -- once upon a time, there was a big race called Dawnbench for training CIFAR10 to 94% accuracy in the shortest amount of time a little while ago by Stanford University. During that time, there was a lot of cool movement, and there were a few notable people who really moved the bar (Chen Wang is underrecognized for their contributions, while David Page is relatively well known for his, which indeed truly are excellent).

I remember reading Page's notes on it and thinking that I could never come up with the caliber of ideas that he brought to the training table for these networks, and plus, 24 seconds on a V100?!?! Crazy.

That was years ago that I saw it. I didn't touch it at all -- not anyone really did, transformers were sorta the big thing now, and still are. And the one or two times I did try to do anything with it...anything I tried made it worse, and I really struggled with his code (it's very functionally-written stylistically, very cool but didn't jive with my rapid experimentation style).

In any case, I thought maybe I could do better though if I really and truly took a cool crack at it. And even if I didn't, I sorta needed a good living resume to prove that I could make a good software project. So I reimplemented it in a more hackable (to me, at least) kind of way ala karpathy's nanoGPT (and was almost way too meticulous with writing, organizing, and documenting my code), reorganized and streamlined a few things, and moved it to a more-accessible-to-me GPU, an A100. ~18.1 seconds or so (17.2 with some other open-source code). So that was the line.

Since then, every single time it feels like I've found all that I can find, there's something else (eventually, at least) waiting behind that wall for me. 18.1 seconds turned to 12.7, which I thought was about as far as I could go. Then 12.7 turned to 12.3, which turned to ~9.91 seconds. Then ~9.91 seconds became, incredibly, ~7.7 seconds or so.

Earlier this week I released an update that brought it to roughly ~6.97-6.99 seconds or so. That is unreal, to me. At first, I was numb to how much things could improve, now I'm sorta in denial. The throughput is totally insane, roughly 88,389 training images through the GPU _every second_. This also means that our step time is roughly ~11.35 microseconds per batch, which is...blistering, to stay the least. Hard really for me to wrap my own head around it.

I'd say from the experience that I've had, I've felt similar feelings to what you've talked about here, especially if someone already with a lot of followers from a more hype point of view does something like glue huggingface code together, make a fancy GIF that's well stylized, and gets a ton of adoration from it.

But that said, the market for quality software is small, and the market for hype is large. Not that the above project doesn't have hype, but it's meant to be more valuable as a researcher's workbench than a toy. It did thankfully get a huge boost early on because Karpathy tweeted it out, but even the last release, for example, maybe got 10 likes on Twitter, and an additional 10-20 (or 30) stars on Github from the sum total interactions (including a Reddit post), even if that.

But! The good thing in some senses is that the people that I get to talk to if I'm proactive, that like this software, are often people who are known or are skilled in their field of work. And I honestly don't have too many warm fuzzies about that from lived experience as that is new to me. But I can say that I appreciate the opportunity.

Everytime I've thought about going down the hype/vaporware road just to get eyes on the project(s) I do, I have to ask myself -- "Do I want these eyes on the project? Do I want this kind of attention from this kind of person to make up most of my interactions and what I am building?"

Sure, if you have to feed a family, that sort of make sense. And we have to feed ourselves and our emotional needs too. But maybe we can be okay with being content with the smaller audience, as it is. At least, that's what I'm working towards, though I do fear that I'll stumble and give in to the allure of chasing the hype every now and again. And if I do, I'm sure that particular extreme emptiness (of a sort) will help pull me back towards just working on being content with the little things I have.

I want to close with a video that was made almost exclusively for you, and would like to ask you to watch it in its entirety if you have the time. It talks about content creation (which is what we do, in a sense), but is taught in a way that is very general and I think is the best take I've ever heard on this topic in a condensed/beginner-friendly way, that I can remember at least.

It should not only help alleviate some of your concerns or negative feelings from the shipping arms-race, it'll give you clarity on good next-step solutions that will help hopefully contextualize and give a good 'path forward' to making software that people like. I really cannot recommend this video enough, the wisdom is simple, practical, distilled, and hard-won (and has certainly helped me, I am glad I got to learn this earlier rather than later): https://youtu.be/lNzWsp5UUPA

Happy to discuss or offer any thoughts on any questions. I do recommend the video first, I often enjoy talking about that kind of particular topic.

Thank you very much for your thought-out response. I really appreciate the effort. I will certainly watch the linked video.