Hacker News new | ask | show | jobs
user: anakin87
created: 2023-04-04
karma: -1

submissions:

0 points | 0 comments
Show HN: Hands-on course for building RL environments for LLMs
1 points | 1 comments
0 points | 0 comments
Environments Hub: Your Language Model needs better (open) environments to learn
2 points | 1 comments
0 points | 0 comments
GRPO experiment - I trained a Language Model to schedule events
1 points | 1 comments
0 points | 0 comments
I trained a Language Model to schedule events with GRPO
1 points | 1 comments
0 points | 0 comments
Llama2 + Haystack on Colab
7 points | 1 comments
0 points | 0 comments