User: 3d27 | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

user: 3d27
created: 2023-11-09
karma: 20

submissions:

How to evaluate multi-turn LLM chatbots

3 points | 0 comments

We wrote a comprehensive guide on LLM security

1 points | 0 comments

How to generate synthetic data using SOTA data evolution methods

1 points | 0 comments

How to build your own LLM evaluation framework

2 points | 0 comments

Overview of All Major LLM Benchmarks

1 points | 0 comments

0 points | 0 comments

I wrote an article about everything I know about LLM metrics

2 points | 1 comments

0 points | 0 comments

Best practices I learnt from helping health tech enterprise test LLMs

1 points | 0 comments

0 points | 0 comments

Am I too needy? From a data science perspective

1 points | 1 comments

0 points | 0 comments

Best Practices for Unit Testing RAG Systems in Prod

4 points | 0 comments

Tried Apple's Vision Pros, would not recommend it

2 points | 0 comments

Everything I know about LLM evaluation metrics

7 points | 0 comments

Google 2024 Layoffs on a rolling-basis

1 points | 0 comments

0 points | 0 comments

Meta Going All in on GenAI

3 points | 2 comments

0 points | 0 comments

I used QAG to implement an LLM text summarization evals

3 points | 0 comments

I found a way to code like Shakespear

1 points | 1 comments

I implemented 12+ LLM evaluation metrics so you don't have to

4 points | 1 comments

AI Makes Commercial Masterpiece [video]

2 points | 0 comments

0 points | 0 comments

Show HN: I implemented evals metrics for LLMs that runs locally on your machine

22 points | 3 comments

Overcoming the biggest barrier to practical quantum computers

1 points | 0 comments

Google's new model is good but the demo's not reproducible in Bard

1 points | 0 comments

What Is RAG? (With Examples)

1 points | 0 comments

Found this weird programming language

2 points | 0 comments