Hacker News new | ask | show | jobs
user: leonardtang
created: 2019-10-01
karma: 41

submissions:

EvoForge: Evolutionary Harness Optimization
2 points | 0 comments
Chinese Calligraphy Is a Frontier Task
1 points | 0 comments
TournO: Tournament Optimization for Non-Verifiable RL
3 points | 0 comments
j1-micro and j1-nano: Tiny (0.6B, 1.7B) and Mighty Reward Models
3 points | 0 comments
Verdict: A Library for Scaling Judge-Time Compute
3 points | 0 comments
Awesome-LLM-Judges
2 points | 0 comments
LLM Judges
2 points | 0 comments
0 points | 0 comments
Cascade: A fast, automated, multi-turn LLM jailbreaking method
2 points | 0 comments
RBAC RAG
1 points | 0 comments
RBAC RAG with MongoDB
2 points | 0 comments
Simple and Safe RAG with RBAC
2 points | 0 comments
Inducing LLM Hallucinations
2 points | 0 comments
Sphynx: Fuzz Testing Hallucination Detection Models
2 points | 0 comments
It's a bad day to be a language model
2 points | 1 comments
0 points | 0 comments
0 points | 0 comments
Thorn in a HaizeStack test for evaluating long-context adversarial robustness
19 points | 11 comments
Thorn in a HaizeStack Long-Context Jailbreak Test
5 points | 0 comments
A Convenient Ensembled Perplexity API
1 points | 0 comments
A Trivial Llama 3 Jailbreak
70 points | 47 comments
Making a SOTA Adversarial Attack on LLMs 38x Faster
2 points | 0 comments
LLM Red-Teaming Resistance Leaderboard
2 points | 0 comments
OpenAI Content Moderation Is Really, Really Bad
2 points | 1 comments
Degraded Polygons Raise Fundamental Questions of Neural Network Perception
1 points | 0 comments
0 points | 0 comments