Hacker News new | ask | show | jobs
by tzs 834 days ago
> It is a full 7000 pages long with lots of conflicting details > [...] > Compare it to ODF which is 1/10th of the size

A few things worth noting. First, the size comparison was done by counting pages in the PDFs of the two from their respective standards committees. The OOXML PDF used something like twice the line spacing of the ODF PDF, and may have also had a larger font size. Print the two with the same settings and OOXML still was a lot bigger.

Second, ODF deferred some important things to be done in later revisions of the spec. For example spreadsheet formulas. OOXML on the other hand had hundreds of pages covering spreadsheet formulas, including detailed mathematical definitions and explanations for functions.

> On top of that it is full of references like "this works like wordperfect version-n". While references are useful in specifications, they need to be to existing open standards to be meaningful. Wordperfect has never standardized its format, so referring to it is meaningless.

I believe those were in a draft but taken out in the final spec. Also, I think you might be overlooking what those references were meant to be used for.

There were lots of organizations with lots of documents that were created in older proprietary products like WordPerfect, Lotus, etc, and many of those organizations had reverse engineered or partly reverse engineered those formats and built toolchains around them.

Let's say such an organization would like to switch to ODF. They would have to rewrite their tools but they are willing to do that because an XML format will make future development easier. They will also have to convert their existing documents to ODF, which they can do.

But there are things in those documents that must work like they worked in WordPerfect. For example if they print an old document it may be essential that it has the same line breaks that it did when WordPerfect printed it.

They accept that StarOffice or OpenOffice won't be able to do WordPerfect line breaks but that's fine. Their toolchain can handle document printing. And for new documents that they create in StarOffice or OpenOffice using whatever line breaks those programs use is fine--it is just dealing with legacy documents that requires matching what those old programs did.

What they want, then, is when using ODF as a storage format for their legacy documents for some way to mark in the ODF that the document needs WordPerfect line breaks.

Now imagine two different organizations both are doing this. So they both come up with some way to add to their ODF files that some text needs WordPerfect line breaks. But one of them calls it "WP6LineBreak" and one calls it "LB_LIKE_WP6".

Wouldn't it be nicer if all the organizations that are adding a "Use WordPerfect 6 line break" indicator to their converted-to-ODF legacy files did it the same way? It would make it easier if they ever exchanged legacy files, and it would be less confusing to the rest of is if one of these files ever got into the wild.

I remember some people brought up adding support to ODF for legacy documents early in ODF standardization but Sun was not interested. Their attitude generally was ODF was going to support everything StarOffice needed and nothing more.

Microsoft on the other hand did want to support people using OOXML for legacy document storage, and so they made a big list of the various things from the most popular earlier word processors and spreadsheets that they thought people would be wanting to extend OOXML to store, and reserved some names and markup for them.

They were quite clear that these were not meant to be used by general purpose OOXML word processors or spreadsheets. They were just for people in the scenario described above.