User: shutty | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

user: shutty
created: 2016-06-04
karma: 295

submissions:

Benchmarking SlateDB vs. RocksDB

3 points | 0 comments

Show HN: MurrDB: A RocksDB-based NVMe/S3 cache for AI inference workloads

1 points | 0 comments

Lies, damned lies, and Elastic's benchmarks

4 points | 0 comments

Firefox Smart Window

12 points | 1 comments

Memento Mori Motivator

1 points | 0 comments

0 points | 0 comments

0 points | 0 comments

Accessing LLMs with a Fax [DE]

1 points | 1 comments

Fine-tuning Qwen3 at home to respond to any prompt with a dad joke

1 points | 0 comments

The Day Our Own Queries DoS'ed Us: Inside Zalando Search

1 points | 0 comments

Show HN: Fine-tuning Qwen3 at home to respond to any prompt with a dad joke

10 points | 0 comments

Show HN: I benchmarked read latency of AWS S3, S3Express, EBS and Instance store

3 points | 0 comments

0 points | 0 comments

I put a real search engine into a Lambda, so you only pay when you search

48 points | 13 comments

We found embedding indexing bottleneck in the least expected place: JSON parsing

2 points | 0 comments

0 points | 0 comments

Show HN: Nixiesearch, an open-source alternative to Elasticsearch Serverless

2 points | 0 comments

Finite State Transducers

1 points | 0 comments

0 points | 0 comments

How [NOT] to Evaluate Your RAG

5 points | 0 comments

Measuring OpenAI embedding API latency (and why you should always cache it)

3 points | 0 comments

Llama.cpp AI Performance with the GeForce RTX 5090

2 points | 0 comments

0 points | 0 comments

Show HN: A dataset of all HN submission texts (2006-2024) in Markdown

1 points | 0 comments

Hacker News Comments Dataset

2 points | 1 comments

Nixiesearch: Running Lucene over S3, and why we're building a new search engine

128 points | 76 comments

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

How to compute LLM embeddings 3X faster with model quantization

2 points | 0 comments