Hacker News new | ask | show | jobs
by wokwokwok 1080 days ago
Are they being obnoxious without cause though?

The Khoj website says, and I quote:

> Khoj's offline AI models allow you to find information using natural language queries. Search using terms that are similar to what you're looking for, rather than exact or fuzzy matches. Khoj search works offline. So if you self-host your data never leaves your machine and search works without internet.

Emphasis mine.

It seems somewhat disingenuous.

I get it, parts of it run offline, parts of it use the openai api… but that’s not what it says on the box.

Why is the project making a song and dance about self hosting and being open source when it’s just another openai app.

If it’s not just another openai wrapper, cut the openai part of it out and pitch it that way, sure.

…but as it stands, I’m pretty sceptical.

Lots of people are doing the “ai magic” tech demo stuff at the moment, but when you cut them off from the openai api the magic goes away and what’s left isn't very good or interesting.

Maybe this is different? …but it doesn’t look like it; and since they’re tied up with the openai api and you can’t use it without that, how would I even tell?

3 comments

>> Khoj search works offline. So if you self-host your data never leaves your machine and search works without internet.

> Emphasis mine.

> It seems somewhat disingenuous.

I've been trying it. Khoj search does work offline. Khoj chat (they are literally seperate functions in the app) requires an openAI key and if you give it one, uses openAI.

Yes! It's a bit more than, "somewhat disingenuous," to say a system built to use the OpenAI API works with you to make sure, "your data never leaves your machine".

That's like saying I invented a new form of transportation where you're feet never leave the ground but in actuality I'm just a travel agent sending you to the airport.

"your data never leaves your machine" is only mentioned in the Search section, where it is it true. No-one reading that would assume that meant everything considering the two last sentences above in the Chat section explicitly says it's using OpenAI.

Really feels like people are nitpicking and hating on this project for no good reason. I feel sorry for the authors.

It feels like you are reading too much into this. Really don't understand all the bashing here. It's an open source software for building things using OpenAI. Do you think LangChain is similarly disingenuous? Or the Vercel AI SDK?
Neither of those things claim:

> So if you self-host your data never leaves your machine

You're quoting the paragraph under "Search", describing their search engine. I feel you're misrepresenting it.

Anyway, I definitely don't think this deserves to be described as "a simple interface for the OpenAI-API drenched in fake buzzwords boosted to the top of HN to scam investors" or "twitter get-rich-quick-guru level lousy and fake, and is clearly boosted to the top of HN".

Horrible reactions in this thread to open source software you can fork to use whatever you want. Really disappointing.

Langchain works completely offline with appropriate LLM/API backend & vectore store if needed
No-one is preventing you from creating a PR or fork this project to add whatever backend you want. Did LangChain fully cover all backends on release? Are you not allowed to release a project that only supports OpenAI?

You really need to explain what you are hating on here.

STFU all I said is that what they claim is not what's the state of their project today. Don't even get me started on their alignment BS.