What is Miklos hacking

– Optimizing ODT ↔ XHTML conversion performance for simple documents (original) (raw)

https://farm5.staticflickr.com/4605/26697712598_2ace3f45a3_o.png

The focus here was really simple documents, like just one sentence with minimal formatting. The use-case is to have thousands of these simple documents, only a minority containing complex formatting, the rest is just that simple.

Performance work usually focuses on one specific complex feature, e.g. lots of bookmarks, lots of document-level user-defined metadata, and so on — this way there were room for improvements when it comes to trivial documents.

I managed to reduce the cost of the conversion to the _fifth of the original_cost in both directions — the chart above shows the impact of my work for the ODT → XHTML direction. The steps that helped:

Note that the XHTML mode for the Writer HTML export is still a work in progress, but it already produces valid output for such simple documents.