Hacker News new | ask | show | jobs
by Gormo 334 days ago
How do you stop users who aren't using the custom browser from accessing these 'tinyweb' HTTP sites? How do you prevent content scrapers and search indexers from accessing them? How do you suppress direct incorporation of 'mainstream' web content into 'tinyweb' content?

If your goal is precisely to create an parallel ecosystem that's "airgapped" from the mainstream web, and you're already going to have to develop custom clients, content formats, and server-side configuration to implement it on top of HTTP, and engage in lots of development work to imperfectly isolate the two ecosystems from each other, why wouldn't you just develop a parallel protocol and start with a clean slate?

2 comments

> How do you prevent content scrapers and search indexers from accessing them?

How do you that with Gemini?

> If your goal is precisely to create an parallel ecosystem that's "airgapped" from the mainstream web

There is no way you can have an air gapped network with public access. The moment this "parallel ecosystem" showed any content that hinted at something lucrative, you will have people creating bridges between the two networks. Case in point: Google and USENET.

> How do you that with Gemini?

You keep it isolated from the ecosystem in which all of those things are taking place.

> The moment this "parallel ecosystem" showed any content that hinted at something lucrative, you will have people creating bridges between the two networks. Case in point: Google and USENET.

The whole point is to minimize the chance of that happening -- by limiting mainstream appeal, keeping it a niche, and avoiding Eternal September -- and to maximize the friction of bridging these two ecosystems. And so far, they've done a fairly good job of it, since Gemini has been expanding for six years without any indication of any of this starting to happen.

> and to maximize the friction of bridging these two ecosystems.

There is no friction. It's trivial to write a program that can scrape a Gemini network.

If there is no one pulling data from Gemini servers yet, is not because it's difficult do it, but merely because it's still too small to be relevant.

> There is no friction. It's trivial to write a program that can scrape a Gemini network.

It's not trivial at all. First, you have to want to do it, then you have to commit time and effort to doing it, then you have to maintain the solution you deploy specifically for Gemini in parallel to your web scraping architecture.

> If there is no one pulling data from Gemini servers yet, is not because it's difficult do it, but merely because it's still too small to be relevant.

Exactly. But it if was using web tech, all of the existing web scrapers could just be pointed at it with minimal effort. So using a separate, custom tech stack is what keeps the threshold of effort in front the threshold of desire.

And using a separate tech stack also creates intentional friction in terms of new user adoption, keeping it slow and maintaining the protocol's niche status. So this also helps keep that threshold of desire distant.

> First, you have to want to do it, then you have to commit time and effort to doing it

Implementing a Gemini to HTTP gateway seems like a perfect type of weekend project for someone that wants to play with a new programming language. With that, any barrier that you think you have is gone. Crawlers will get to you no matter what.

> So using a separate, custom tech stack is what keeps the threshold of effort in front the threshold of desire.

You know what else you could do? Just run your web server on a non-standard port.

Seriously, the more you try to rationalize this as anything else other than hobby, the less sense it makes. Just go with "it's a hobby and I enjoy spending my time with it", and I promise I will get out of your hair.

I don't know what to tell you at this point other than 'eppur si muove' -- you're giving me a bunch of theoretical reasons as to why you think it shouldn't work, and yet Gemini has been around for years now: it is maintaining itself as a distinct niche; it is not full of spam, ads, or slop; its content isn't readily accessible from the web or turning up in major web search engines despite the fact that there already are multiple web gateways around; and it generally does fulfill the intentions of the people using it.

You're treating this like a hypothetical discussion, but we're talking about something that already exists and functions.

And I don't think the "hobby" distinction you're making is particularly relevant, because the whole point of it is again to function as a community of amateurs -- and it's doing that quite effectively.

Why would you need to? The big web existing doesn’t hinder or harm the existence of the tinyweb.