Hacker News new | ask | show | jobs
user: 3d27
created: 2023-11-09
karma: 20

submissions:

How to evaluate multi-turn LLM chatbots
3 points | 0 comments
We wrote a comprehensive guide on LLM security
1 points | 0 comments
How to generate synthetic data using SOTA data evolution methods
1 points | 0 comments
How to build your own LLM evaluation framework
2 points | 0 comments
Overview of All Major LLM Benchmarks
1 points | 0 comments
0 points | 0 comments
I wrote an article about everything I know about LLM metrics
2 points | 1 comments
0 points | 0 comments
Best practices I learnt from helping health tech enterprise test LLMs
1 points | 0 comments
0 points | 0 comments
Am I too needy? From a data science perspective
1 points | 1 comments
0 points | 0 comments
Best Practices for Unit Testing RAG Systems in Prod
4 points | 0 comments
Tried Apple's Vision Pros, would not recommend it
2 points | 0 comments
Everything I know about LLM evaluation metrics
7 points | 0 comments
Google 2024 Layoffs on a rolling-basis
1 points | 0 comments
0 points | 0 comments
Meta Going All in on GenAI
3 points | 2 comments
0 points | 0 comments
I used QAG to implement an LLM text summarization evals
3 points | 0 comments
I found a way to code like Shakespear
1 points | 1 comments
I implemented 12+ LLM evaluation metrics so you don't have to
4 points | 1 comments
AI Makes Commercial Masterpiece [video]
2 points | 0 comments
0 points | 0 comments
Show HN: I implemented evals metrics for LLMs that runs locally on your machine
22 points | 3 comments
Overcoming the biggest barrier to practical quantum computers
1 points | 0 comments
Google's new model is good but the demo's not reproducible in Bard
1 points | 0 comments
What Is RAG? (With Examples)
1 points | 0 comments
Found this weird programming language
2 points | 0 comments