Hacker News new | ask | show | jobs
user: shutty
created: 2016-06-04
karma: 295

submissions:

Benchmarking SlateDB vs. RocksDB
3 points | 0 comments
Show HN: MurrDB: A RocksDB-based NVMe/S3 cache for AI inference workloads
1 points | 0 comments
Lies, damned lies, and Elastic's benchmarks
4 points | 0 comments
Firefox Smart Window
12 points | 1 comments
Memento Mori Motivator
1 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Accessing LLMs with a Fax [DE]
1 points | 1 comments
Fine-tuning Qwen3 at home to respond to any prompt with a dad joke
1 points | 0 comments
The Day Our Own Queries DoS'ed Us: Inside Zalando Search
1 points | 0 comments
Show HN: Fine-tuning Qwen3 at home to respond to any prompt with a dad joke
10 points | 0 comments
Show HN: I benchmarked read latency of AWS S3, S3Express, EBS and Instance store
3 points | 0 comments
0 points | 0 comments
I put a real search engine into a Lambda, so you only pay when you search
48 points | 13 comments
We found embedding indexing bottleneck in the least expected place: JSON parsing
2 points | 0 comments
0 points | 0 comments
Show HN: Nixiesearch, an open-source alternative to Elasticsearch Serverless
2 points | 0 comments
Finite State Transducers
1 points | 0 comments
0 points | 0 comments
How [NOT] to Evaluate Your RAG
5 points | 0 comments
Measuring OpenAI embedding API latency (and why you should always cache it)
3 points | 0 comments
Llama.cpp AI Performance with the GeForce RTX 5090
2 points | 0 comments
0 points | 0 comments
Show HN: A dataset of all HN submission texts (2006-2024) in Markdown
1 points | 0 comments
Hacker News Comments Dataset
2 points | 1 comments
Nixiesearch: Running Lucene over S3, and why we're building a new search engine
128 points | 76 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
How to compute LLM embeddings 3X faster with model quantization
2 points | 0 comments