8ca191604d
This exports functions that uses xml-conduit's parser to produce an xml-light Element or [Content]. This allows existing pandoc code to use a better parser without much modification. The new parser is used in all places where xml-light's parser was previously used. Benchmarks show a significant performance improvement in parsing XML-based formats (especially ODT and FB2). Note that the xml-light types use String, so the conversion from xml-conduit types involves a lot of extra allocation. It would be desirable to avoid that in the future by gradually switching to using xml-conduit directly. This can be done module by module. The new parser also reports errors, which we report when possible. A new constructor PandocXMLError has been added to PandocError in T.P.Error [API change]. Closes #7091, which was the main stimulus. These changes revealed the need for some changes in the tests. The docbook-reader.docbook test lacked definitions for the entities it used; these have been added. And the docx golden tests have been updated, because the new parser does not preserve the order of attributes. Add entity defs to docbook-reader.docbook. Update golden tests for docx. |
||
---|---|---|
.. | ||
block_quotes.docx | ||
codeblock.docx | ||
comments.docx | ||
custom_style_no_reference.docx | ||
custom_style_preserve.docx | ||
custom_style_reference.docx | ||
definition_list.docx | ||
document-properties-short-desc.docx | ||
document-properties.docx | ||
headers.docx | ||
image.docx | ||
inline_code.docx | ||
inline_formatting.docx | ||
inline_images.docx | ||
link_in_notes.docx | ||
links.docx | ||
lists.docx | ||
lists_continuing.docx | ||
lists_multiple_initial.docx | ||
lists_restarting.docx | ||
nested_anchors_in_header.docx | ||
notes.docx | ||
raw-blocks.docx | ||
raw-bookmarks.docx | ||
table_one_row.docx | ||
table_with_list_cell.docx | ||
tables.docx | ||
track_changes_deletion.docx | ||
track_changes_insertion.docx | ||
track_changes_move.docx | ||
track_changes_scrubbed_metadata.docx | ||
unicode.docx | ||
verbatim_subsuper.docx |