Having trouble finding a word in multiple files

Peter Flynn peter at silmaril.ie
Sun Jun 14 19:28:22 UTC 2020


On 14/06/2020 16:34, Karl Auer wrote:
[...]
> Microsoft had the chance to move to a genuinely open format, namely
> ODF — well-documented, unencumbered and free. Instead it used
> various hitherto irrelevant but still vote-bearing entities to bribe,
> browbeat and muscle its way to getting OOXML declared a "standard",
> in spite of the fact that was then and still is largely undocumented
> and most definitely not open.

It is actually documented, but not all of it, and they keep changing it, 
just so you have problems processing it.

> Microsoft has not implemented the "open" standard OOXML in its DOCX 
> format, which is why the only programs that can reliably render ALL 
> DOCX files are, you guessed it, Microsoft programs.

Fortunately, no-one in their right minds uses Word as a source for 
anything except the text and the Named Styles, which are essential, 
otherwise it's just a bucket of words.

> I have a better idea - convert them to genuinely open formats. ODF is
> the prime candidate.

That would also work, although it makes a dog's breakfast of the style 
names, which is a poor piece of implementation.

> For files whose formatting is irrelevant, convert them to text.

Convert them to something meaningful and accessible like DocBook, TEI, 
JATS, or even HTML.

Peter




More information about the ubuntu-users mailing list