Hacker News new | ask | show | jobs
by dhx 707 days ago
If this is of interest, Wikidata contains a huge amount of data on file formats. For example, [1] is a huge list of file formats and [2] is a list of documentation or source code (structs etc) describing file formats. Other data includes links to other file format information databses such as PRONOM, LOC and NARA, magic sequences to identify files of a particular format, and links to source code as parsers of file formats.

Where most people would say "It's a JPEG file", the data within Wikidata breaks that down into the detail of different versions of JFIF, SPIFF, EXIF, etc that are all have different internal formats, even though the file extensions and media types are common.

[1] https://www.wikidata.org/wiki/Wikidata:WikiProject_Informati...

[2] https://www.wikidata.org/wiki/Wikidata:WikiProject_Informati...

2 comments

I got my hopes up for understanding TIFF better, but alas I could not find any references I hadn't seen before. I have trouble generating anything but the most basic TIFF without getting into trouble with various consumers.
If you notice, a lot of the more esoteric 3D formats listed on wikidata link right back to the paulbourke.net site from this post, it's a small world!