User: kashifr | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

user: kashifr
created: 2011-03-11
karma: 750

https://github.com/kashif https://twitter.com/krasul

submissions:

Carbon: Autoregressive Genomic Foundation Model

7 points | 1 comments

The ultimate guide to RL environments: building and scaling them in the LLM era

7 points | 0 comments

Distilling 100B+ Models 40x Faster with TRL

13 points | 0 comments

0 points | 0 comments

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

2 points | 0 comments

Transformers V5 is out!

10 points | 0 comments

The Smol Training Playbook: The Secrets to Building World-Class LLMs

265 points | 19 comments

Unlocking On-Policy Distillation for Any Model Family

6 points | 1 comments

Transformers 4.55 New OpenAI GPT OSS

2 points | 1 comments

Smollm3: Smol, multilingual, long-context reasoner LLM

388 points | 79 comments

0 points | 0 comments

Epic vs. Apple

7 points | 0 comments

AIMO (AI Math Olympiad) progress prize winning solution

9 points | 0 comments

0 points | 0 comments

MaPO: A reference-free alignment technique for diffusion models

2 points | 1 comments

0 points | 0 comments

OpenHermesPreferences: Dataset of ~1M AI preferences from teknium/OpenHermes-2.5

7 points | 1 comments

HuggingFace Training Cluster as a Service

101 points | 45 comments

HuggingFace 235M series D at a $4.5B valuation

3 points | 0 comments

Fine-tune Llama 2 with DPO

3 points | 0 comments

0 points | 0 comments

QLoRA 4-bit finetuning of LLMs

7 points | 1 comments

0 points | 0 comments

StackLlama: A hands-on guide to train LlaMa with RLHF

165 points | 38 comments

0 points | 0 comments

HuggingFace Diffusers 0.2 with Stable Diffusion pipeline

2 points | 1 comments

0 points | 0 comments

Diffusers: Modular Diffusion model library from HuggingFace

47 points | 5 comments