Y
Hacker News
new
|
ask
|
show
|
jobs
user:
kashifr
created:
2011-03-11
karma:
750
https://github.com/kashif https://twitter.com/krasul
submissions:
Carbon: Autoregressive Genomic Foundation Model
7 points
|
1 comments
The ultimate guide to RL environments: building and scaling them in the LLM era
7 points
|
0 comments
Distilling 100B+ Models 40x Faster with TRL
13 points
|
0 comments
0 points
|
0 comments
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
2 points
|
0 comments
Transformers V5 is out!
10 points
|
0 comments
The Smol Training Playbook: The Secrets to Building World-Class LLMs
265 points
|
19 comments
Unlocking On-Policy Distillation for Any Model Family
6 points
|
1 comments
Transformers 4.55 New OpenAI GPT OSS
2 points
|
1 comments
Smollm3: Smol, multilingual, long-context reasoner LLM
388 points
|
79 comments
0 points
|
0 comments
Epic vs. Apple
7 points
|
0 comments
AIMO (AI Math Olympiad) progress prize winning solution
9 points
|
0 comments
0 points
|
0 comments
MaPO: A reference-free alignment technique for diffusion models
2 points
|
1 comments
0 points
|
0 comments
OpenHermesPreferences: Dataset of ~1M AI preferences from teknium/OpenHermes-2.5
7 points
|
1 comments
HuggingFace Training Cluster as a Service
101 points
|
45 comments
HuggingFace 235M series D at a $4.5B valuation
3 points
|
0 comments
Fine-tune Llama 2 with DPO
3 points
|
0 comments
0 points
|
0 comments
QLoRA 4-bit finetuning of LLMs
7 points
|
1 comments
0 points
|
0 comments
0 points
|
0 comments
StackLlama: A hands-on guide to train LlaMa with RLHF
165 points
|
38 comments
0 points
|
0 comments
HuggingFace Diffusers 0.2 with Stable Diffusion pipeline
2 points
|
1 comments
0 points
|
0 comments
Diffusers: Modular Diffusion model library from HuggingFace
47 points
|
5 comments