User: leonardtang | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

user: leonardtang
created: 2019-10-01
karma: 41

submissions:

EvoForge: Evolutionary Harness Optimization

2 points | 0 comments

Chinese Calligraphy Is a Frontier Task

1 points | 0 comments

TournO: Tournament Optimization for Non-Verifiable RL

3 points | 0 comments

j1-micro and j1-nano: Tiny (0.6B, 1.7B) and Mighty Reward Models

3 points | 0 comments

Verdict: A Library for Scaling Judge-Time Compute

3 points | 0 comments

Awesome-LLM-Judges

2 points | 0 comments

2 points | 0 comments

0 points | 0 comments

Cascade: A fast, automated, multi-turn LLM jailbreaking method

2 points | 0 comments

1 points | 0 comments

RBAC RAG with MongoDB

2 points | 0 comments

Simple and Safe RAG with RBAC

2 points | 0 comments

Inducing LLM Hallucinations

2 points | 0 comments

Sphynx: Fuzz Testing Hallucination Detection Models

2 points | 0 comments

It's a bad day to be a language model

2 points | 1 comments

0 points | 0 comments

0 points | 0 comments

Thorn in a HaizeStack test for evaluating long-context adversarial robustness

19 points | 11 comments

Thorn in a HaizeStack Long-Context Jailbreak Test

5 points | 0 comments

A Convenient Ensembled Perplexity API

1 points | 0 comments

A Trivial Llama 3 Jailbreak

70 points | 47 comments

Making a SOTA Adversarial Attack on LLMs 38x Faster

2 points | 0 comments

LLM Red-Teaming Resistance Leaderboard

2 points | 0 comments

OpenAI Content Moderation Is Really, Really Bad

2 points | 1 comments

Degraded Polygons Raise Fundamental Questions of Neural Network Perception

1 points | 0 comments

0 points | 0 comments