Hacker News new | ask | show | jobs
by 4bpp 848 days ago
I imagine the folks over at NSA must be rubbing their hands over the possibilities this will open up for querying the data they have been diligently storing over the years.
4 comments

That's the point where hallucinations are pretty dangerous.
Not too hard to verify.
Verify? There are plenty of examples where things have been "verified" to prove a point. WMDs ring a bell?
USA never technically lied about Sadam's WMD, as he had used it against the kurds and during the war with Iran.

They did lie about Sadam trying to get nukes.

Now the problem is whenever the WMD was produced before or after 1991 as that was when Iraq agreed to dismantle.

What is your point? That obtaining absolute knowledge of truth is impossible, and therefore anything claiming to be true is worthless?

In general, be careful not to kill "good" on the way to attempting to obtain "perfect" in vain. And GPT4's hallucination rate is quite low at this point (may of course depend on the topic).

Not at all. I'm coming from the opposite side saying that anything can be "verified" true if you just continue to repeat it as truth so that people accept it. Say often, say it loud. Facts be damned
Yes but that's the argument by repetition (ad nauseam) fallacy
This has always happened and will continue always happening. Best not get caught up on it, just put people right when you feel you have the facts to back it up.
Palantir already provides this product to them.
Palantir sells a glorified Airflow instance
I heard it’s a big Cassandra too. But that’s only the backends and the frontend and their data engineers are important too.
Palantir provides this by re-packaging and re-selling GPT-4, Claude, and Gemini
NSA does not have large-scale storage. If they did, it would be in a building somewhere, it would have electric power and they would have bought storage devices for it from some company. The largest-known actual NSA datacenter is the size of a runty leftover from having built a real cloud datacenter, and there's only one of those while FAMG have hundreds of far larger datacenters.
Phew, glad the NSA doesn't have any large scale storage. Big relief. By the way, what do they use their $10 billion AWS contract for?
I know it's not storing every email, phone call, photo, and video ever transmitted on the internet, like certain people want you to believe.

A $10b AWS contract would not even amount to enough storage to keep one copy of the public web.

If you think 100,000 square feet is a large data center, you obviously do not work in the industry.
Ad hominem much?
I'm fairly sure the budget of archive.is is less than $10B. (Admittedly they don't store videos though.)
https://en.wikipedia.org/wiki/Utah_Data_Center

Genuine question, is Exabyte-scale small in the context of cloud? Is Amazon stacking yottabytes?

Edit: 'Exabyte scale' was from a Forbes article in 2013

If you think this is a large datacenter, you are mistaken. You could fit this inside any cloud datacenter, and there are hundreds of those. The NSA thing draws 65MW. Google alone has over 7000MW of first-party energy generation resources, that doesn't begin to account for what they draw from the grid, and they're not even the biggest datacenter owner.
A building somewhere. Like inside a military base for example. Good luck finding out what’s inside it and having anyone who worked on a restricted contract telling you about it.
Given the history of NSA, warrantless surveillance and the overt existence of the Utah data center + collaboration with telcos and big tech, and the promise of AI analysis and quantum computing...

I find it difficult to accept your underlying premise that NSA doesn't have access to a massive amount of data which AI may be able to analyze for them.

They could always try using tech to reduce their false positive rate rather than increase it.