Hacker News new | ask | show | jobs
by avgbusinessuser 725 days ago
great series of posts, i went down a similar path recently for a slightly different use case - i did not use axolotl though, i was worried about missing out on understanding some details due to potential abstractions. it's great to see documentation on how others tackle similar problems, i documented the process i went through here - https://atredis.com/blog/2024/6/3/how-to-train-your-large-la...
1 comments

There's a ton of abstraction in axolotl, for sure, but so far I haven't found that it gets in the way. The main competitor in that space seems to be Unsloth, but that only works with a single GPU machine, so didn't fit my purposes. I'll dive into your blogpost. Thanks for posting!
I used unsloth, I was only using a single GPU for testing - looking forward to follow up posts.