Unless something I can’t predict changes, using LLMs for EVERYTHING smacks of bloat. They’re just not very efficient at many tasks. I don’t know why in a world where people complain relentlessly about bloat in web development, how it could possibly be right for everybody to just use something as heavyweight as LLMs.
Because while you and I rightfully complain about web development bloat, normal people who use the products made with those tools don't care, as long as it does what they need the product to do
How about using an LLM to help you write a MuZero-like model designed for a specific task? (Also MuZero took like 12 hours to train on old hardware, so my MacBook might be good enough here) obviously we’re not here yet, but it doesn’t seem far away. Hell, you could train a small LLM specifically just to do this.
I don’t know, it’s not my field, but the applicability of an LLM in a lot of current AI fields such as motion control, vision, planning, etc. That is classic AI stuff, I don’t think LLMs are appropriate yet.
Too big, too slow, too much resources etc. And it’s not even clear to me (mind, who is ignorant) that the LLM is some generic model suitable for all AI like tasks.
Making a big splash right now to be sure, but seems to me there’s still room for the core concepts folks have been working on for a long time.
Yes. You need traditional ML to detect, predict, cluster, etc. Then you can layer on LLMs for reasoning on this (provided you have existing documented reasoning on such predictions).
Edit: At least that’s what I’m doing. I could be wrong though.
i dont really understand what an LLM is going to do for you in probably 99% of all problems that have an ML solution except maybe write a block of code using sklearn or whatever to get you started. There are several reasons for this:
1. there is no reason to pay the api costs for an LLM to ingest data for you and do something with it when basically all it will be doing is writing the python codes for you that you will eventually be using
2. the LLM doesn't represent some sort of conceptual understanding of whatever you are trying to do to solve your ML problem, so you can't rely on it to be clever and answer questions or brain storm new ideas
3. even if you have a reason to use an LLM in some data processing pipeline it will only be one stop on the information super highway you are trying to create. you probably are going to use it to do something, but you probably also are going to be doing other things (e.g., image segmentation, time series analysis, etc.).
LLMs are great. but they are really just like, one more tool to have, they aren't the only tool.
I just have a hobbyist curiosity in the area, I think learning ml basics removed alot of the magic of ai hype, I at least like to think I'm less susceptible to hype and bs. I feel like I have a good idea what is going on and what the limits are of models and how to use them. There is a bigger world of ai besides llm
Depends on your goals though, using llm just seems like using any other api to me.
Definitely read up on ML basics if you are going to build anything. And by ML basics I mean the concepts of supervised/unsupervised /reinforcement learning - and in particular model evaluation. It is absolutely critical for any ML-based system to have a structured way to judge how well a system works: whether it works well enough (according to some target), or which of two alternatives is better, and that the nature of the errors are not critical or maybe some of them can be mitigated etc.
My personal approach, as someone not in the field but who has tangential interest (perhaps intellectual and not industrial interest) in it, my approach is to start from scratch, implementing the basic ideas in my favorite languages with no supporting libraries. I personally like this approach, but I doubt it gets anyone a job. So it just depends on your goals.
I think it is more relevant to learn how the technology that will take over most junior tasks works, before it takes over everything and the only job left is being LLM Architect, like in those offshoring projects, the difference being those offshore folks aren't there any longer.
The answer to your question depends on whether you think LLMs will remain the state-of-the art solution in the domains you are interested over the next few years/decades/timescale you care about.