Y
Hacker News
new
|
ask
|
show
|
jobs
user:
ashu_trv
created:
2016-09-05
karma:
79
Building @videodb_io I like building fundamental systems. Wanders in deep thoughts of technology, science, spirituality and human nature.
submissions:
Show HN: Do Thought Streams Matter? A Benchmark of VLM Reasoning in Gemini 2.5
3 points
|
0 comments
Show HN: I packaged decade of video infra battle scars into tools for AI agents
7 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Live video feed for every multimodal model not just Gemini
7 points
|
2 comments
Show HN: VideoDB – 80 % fewer hallucinations on NFL game analysis
1 points
|
0 comments
Lessons Learned Building MCP for Video Infrastructure Startup
2 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Auto-Sync Your Docs, SDKs and Examples for LLMs and AI Agents
6 points
|
3 comments
Ask HN: Model to Analyse Financial Transactions
1 points
|
1 comments
Underwhelming MCP vs Hype
4 points
|
10 comments
Benchmarking vision-language models on OCR in dynamic video environments
142 points
|
58 comments
0 points
|
0 comments
Vision-Language Models vs. Traditional OCR in Video – New Benchmark
6 points
|
1 comments
0 points
|
0 comments
0 points
|
0 comments
Show HN:Video is hard: until now
4 points
|
4 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Show HN: Instantly create video clips from LLM prompts
4 points
|
5 comments
0 points
|
0 comments
Show HN: GPT-Powered Video Retrieval and Streaming
5 points
|
1 comments
0 points
|
0 comments
Show HN: Twitter bot generates interactive transcript of any audio/video
7 points
|
2 comments