| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ssgodderidge 954 days ago
	Genuine question - is it more valuable to spend time ramping up on the ML basics, or to spend time building apps on top of the LLMs of the world?

13 comments

faeriechangling 954 days ago

Unless something I can’t predict changes, using LLMs for EVERYTHING smacks of bloat. They’re just not very efficient at many tasks. I don’t know why in a world where people complain relentlessly about bloat in web development, how it could possibly be right for everybody to just use something as heavyweight as LLMs.

girvo 953 days ago

Because while you and I rightfully complain about web development bloat, normal people who use the products made with those tools don't care, as long as it does what they need the product to do

thomasskis 953 days ago

How about using an LLM to help you write a MuZero-like model designed for a specific task? (Also MuZero took like 12 hours to train on old hardware, so my MacBook might be good enough here) obviously we’re not here yet, but it doesn’t seem far away. Hell, you could train a small LLM specifically just to do this.

whartung 954 days ago

I don’t know, it’s not my field, but the applicability of an LLM in a lot of current AI fields such as motion control, vision, planning, etc. That is classic AI stuff, I don’t think LLMs are appropriate yet.

Too big, too slow, too much resources etc. And it’s not even clear to me (mind, who is ignorant) that the LLM is some generic model suitable for all AI like tasks.

Making a big splash right now to be sure, but seems to me there’s still room for the core concepts folks have been working on for a long time.

orochimaaru 954 days ago

Yes. You need traditional ML to detect, predict, cluster, etc. Then you can layer on LLMs for reasoning on this (provided you have existing documented reasoning on such predictions).

Edit: At least that’s what I’m doing. I could be wrong though.

imjonse 954 days ago

It's almost like asking whether it's more valuable to learn about networking, http, html, js, databases or start building apps on top of nextjs.

raincole 953 days ago

The answer is the latter.

RegW 953 days ago

Wot? Even if you are building a reverse proxy?

mnky9800n 953 days ago

microsoft frontpage does everything i need.

mnky9800n 953 days ago

i dont really understand what an LLM is going to do for you in probably 99% of all problems that have an ML solution except maybe write a block of code using sklearn or whatever to get you started. There are several reasons for this:

1. there is no reason to pay the api costs for an LLM to ingest data for you and do something with it when basically all it will be doing is writing the python codes for you that you will eventually be using

2. the LLM doesn't represent some sort of conceptual understanding of whatever you are trying to do to solve your ML problem, so you can't rely on it to be clever and answer questions or brain storm new ideas

3. even if you have a reason to use an LLM in some data processing pipeline it will only be one stop on the information super highway you are trying to create. you probably are going to use it to do something, but you probably also are going to be doing other things (e.g., image segmentation, time series analysis, etc.).

LLMs are great. but they are really just like, one more tool to have, they aren't the only tool.

tayo42 954 days ago

I just have a hobbyist curiosity in the area, I think learning ml basics removed alot of the magic of ai hype, I at least like to think I'm less susceptible to hype and bs. I feel like I have a good idea what is going on and what the limits are of models and how to use them. There is a bigger world of ai besides llm

Depends on your goals though, using llm just seems like using any other api to me.

jononor 953 days ago

Definitely read up on ML basics if you are going to build anything. And by ML basics I mean the concepts of supervised/unsupervised /reinforcement learning - and in particular model evaluation. It is absolutely critical for any ML-based system to have a structured way to judge how well a system works: whether it works well enough (according to some target), or which of two alternatives is better, and that the nature of the errors are not critical or maybe some of them can be mitigated etc.

bmitc 954 days ago

My personal approach, as someone not in the field but who has tangential interest (perhaps intellectual and not industrial interest) in it, my approach is to start from scratch, implementing the basic ideas in my favorite languages with no supporting libraries. I personally like this approach, but I doubt it gets anyone a job. So it just depends on your goals.

GaggiX 954 days ago

LLMs are simply too overkill and resource-intensive for many tasks.

pjmlp 954 days ago

I think it is more relevant to learn how the technology that will take over most junior tasks works, before it takes over everything and the only job left is being LLM Architect, like in those offshoring projects, the difference being those offshore folks aren't there any longer.

sangnoir 953 days ago

The answer to your question depends on whether you think LLMs will remain the state-of-the art solution in the domains you are interested over the next few years/decades/timescale you care about.

poulpy123 953 days ago

It's like asking if it's better to learn to code or to write on MS words. It entirely depends on your use case

esafak 954 days ago

If you need motivation, build an app. If not, study the theory, and implement basic models like linear regression yourself.

jononor 953 days ago

Depends. What is your goal?