Skip to content

Need strict docx → transitional docx conversion step #3

@gimsieke

Description

@gimsieke

Attached .docx is saved as “strict docx.” Strict docx is one of the more brainfucked concepts on top of the other ISO/IEC 29500-1-related madness. It was decided that really standards-compliant OOXML files would have the same namespace prefixes, but different namespace URIs. That makes all XML-based processing tools moot.

So we either need a preprocessing step within docx2hub that replaces the namespaces when creating the single tree or a standalone strict→transitional step.

Simply manipulating namespaces in the single tree document may not be enough for cases where we first unzip the docx file, create a single tree and only selectively overwrite some of the unzipped files with manipulated chunks. Then some files in the archive will have xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main" while others have xmlns:w="http://purl.oclc.org/ooxml/wordprocessingml/main". This will probably lead to an error when opening the manipulated docx.

https://social.technet.microsoft.com/Forums/en-US/e969fc0a-9fcd-4efe-bf6d-79ea8c34360f/what-is-the-default-file-format-for-saving-in-ms-office-2013-is-it-still-the-transitional-ooxml-or?forum=officeitpro

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions