pandoc

Author	SHA1	Message	Date
Albert Krewinkel	58fbf56548	Jira writer: use `{color}` when span has a color attribute Closes: tarleb/jira-wiki-markup#10	2021-05-24 09:56:02 +02:00
John MacFarlane	1af2cfb287	Handle relative lengths (e.g. `2`) in HTML column widths. See <https://www.w3.org/TR/html4/types.html#h-6.6>. "A relative length has the form "i", where "i" is an integer. When allotting space among elements competing for that space, user agents allot pixel and percentage lengths first, then divide up remaining available space among relative lengths. Each relative length receives a portion of the available space that is proportional to the integer preceding the "". The value "" is equivalent to "1". Thus, if 60 pixels of space are available after the user agent allots pixel and percentage space, and the competing relative lengths are 1, 2, and 3, the 1* will be alloted 10 pixels, the 2* will be alloted 20 pixels, and the 3* will be alloted 30 pixels." Closes #4063.	2021-05-22 22:03:54 -07:00
John MacFarlane	07d299d353	DocBook reader: ensure that first and last names are separated. Closes #6541.	2021-05-20 18:45:39 -07:00
John MacFarlane	d7b5def287	Ms writer: handle tables with multiple paragraphs. Previously they overflowed the table cell width. We now set line lengths per-cell and restore them after the table has been written. Closes #7288.	2021-05-20 17:12:38 -07:00
John MacFarlane	bb11f5fb86	LaTeX reader: More siunitx improvements. Closes #6658 . There's still one slight divergence from the siunitx behavior: we get 'kg m/A/s' instead of 'kg m/(A s)'. At the moment I'm not going to worry about that.	2021-05-20 15:30:31 -07:00
John MacFarlane	4e990a8cf9	LaTeX/siunitx: fix parsing of `\cubic` etc. See #6658 .	2021-05-20 10:13:20 -07:00
John MacFarlane	bc5058234f	LaTeX reader sinuitx: fix + sign on ang.	2021-05-20 10:13:20 -07:00
John MacFarlane	5dc917da3e	LaTeX reader siunitx: add leading 0 to numbers starting with .	2021-05-20 10:13:20 -07:00
Denis Maier	183ce58477	ConTeXt reader: improve ordered lists (#7304 ) Closes #5016 - change ordered list from itemize to enumerate - adds new itemgroup for ordered lists - add fontfeature for table figures - remove width from itemize in context writer	2021-05-20 09:59:53 -07:00
John MacFarlane	a366bd6abc	LaTeX reader: Fix parsing of `+-` in siunitx numbers. See #6658.	2021-05-20 09:03:29 -07:00
John MacFarlane	8437a4a002	LaTeX reader: support `\pm` in `SI{..}`. Closes #6620.	2021-05-20 08:16:46 -07:00
Albert Krewinkel	b6239f4150	ZimWiki writer: allow links and emphasis in headers The latest version of ZimWiki supports this. Closes: #6605	2021-05-20 12:48:05 +02:00
John MacFarlane	5736b331d8	LaTeX reader: better support for `\xspace`. Previously we only supported it in inline contexts; now we support it in all contexts, including math. Partially addresses #7299.	2021-05-19 16:14:49 -07:00
Albert Krewinkel	eb3dff148e	LaTeX writer: separate successive quote chars with thin space Successive quote characters are separated with a thin space to improve readability and to prevent unwanted ligatures. Detection of these quotes sometimes had failed if the second quote was nested in a span element. Closes: #6958	2021-05-18 22:55:47 +02:00
Albert Krewinkel	1843a8793a	HTML writer: keep attributes from code nested below pre tag. If a code block is defined with `<pre><code class="language-x">…</code></pre>`, where the `<pre>` element has no attributes, then the attributes from the `<code>` element are used instead. Any leading `language-` prefix is dropped in the code's class attribute are dropped to improve syntax highlighting. Closes: #7221	2021-05-17 18:08:02 +02:00
Albert Krewinkel	25f5b92777	HTML writer: ensure headings only have valid attribs in HTML4 Fixes: #5944	2021-05-17 15:42:15 +02:00
Albert Krewinkel	4417dacc44	ConTeXt writer: use span identifiers as reference anchors. Closes: #7246	2021-05-17 13:14:32 +02:00
Albert Krewinkel	d3ca48656f	ConTeXt writer tests: keep code lines below 80 chars.	2021-05-17 13:11:33 +02:00
John MacFarlane	cc088687b4	LaTeX template: move title, author, date up to top of preamble. This allows header-includes to use them, and puts them in a position where you can see them immediately. Closes #7295.	2021-05-16 14:35:13 -07:00
John MacFarlane	5a6399d9f6	Markdown writer: fewer unneeded escapes for `#`. See #6259.	2021-05-16 12:23:34 -07:00
John MacFarlane	0a4c6925b6	Docx writer: copy over more settings from referenc.odcx. From settings.xml in the reference-doc, we now include: `zoom`, `embedSystemFonts`, `doNotTrackMoves`, `defaultTabStop`, `drawingGridHorizontalSpacing`, `drawingGridVerticalSpacing`, `displayHorizontalDrawingGridEvery`, `displayVerticalDrawingGridEvery`, `characterSpacingControl`, `savePreviewPicture`, `mathPr`, `themeFontLang`, `decimalSymbol`, `listSeparator`, `autoHyphenation`, `compat`. Closes #7240.	2021-05-15 15:40:49 -07:00
John MacFarlane	2cf971cf56	docx writer: Remove rsids from settings.docx. Word will add these when revisions are made. But it's pointless to start out with a set of them.	2021-05-15 10:54:05 -07:00
Albert Krewinkel	0794862aac	HTML writer: parse `<header>` as a Div HTML5 `<header>` elements are treated like `<div>` elements.	2021-05-15 16:46:02 +02:00
Albert Krewinkel	013e4a3164	HTML reader: keep h1 tags as normal headers (#7274 ) The tags `<title>` and `<h1 class="title">` often contain the same information, so the latter was dropped from the document. However, as this can lead to loss of information, the heading is now always retained. Use `--shift-heading-level-by=-1` to turn the `<h1>` into the document title, or a filter to restore the previous behavior. Closes: #2293	2021-05-14 12:31:24 -07:00
John MacFarlane	76a4e7127b	Beamer writer: support exampleblock and alertblock. A block will be rendered as an exampleblock if the heading has class `example` and alertblock if it has class `alert`. Closes #7278.	2021-05-14 10:09:46 -07:00
Albert Krewinkel	17d96404f5	Docx writer: allow multirow table headers	2021-05-14 16:19:20 +02:00
Albert Krewinkel	875f8f3654	HTML reader: don't fail on unmatched closing "script" tag. Prevent the reader from crashing if the HTML input contains an unmatched closing `</script>` tag. Fixes: #7282	2021-05-14 12:13:40 +02:00
John MacFarlane	3f09f53459	Implement curly-brace syntax for Markdown citation keys. The change provides a way to use citation keys that contain special characters not usable with the standard citation key syntax. Example: `@{foo_bar{x}'}` for the key `foo_bar{x}`. Closes #6026. The change requires adding a new parameter to the `citeKey` parser from Text.Pandoc.Parsing [API change]. Markdown reader: recognize @{..} syntax for citatinos. Markdown writer: use @{..} syntax for citations when needed. Update manual with curly-brace syntax for citations. Closes #6026.	2021-05-13 21:59:32 -07:00
John MacFarlane	0217ae2a4f	Hande 'annote' field in bibtex/biblatex writer. Closes #7266.	2021-05-12 11:05:55 -07:00
John MacFarlane	5eb7ad7d1e	Improve integration of settings from reference.docx. The settings we can carry over from a reference.docx are autoHyphenation, consecutiveHyphenLimit, hyphenationZone, doNotHyphenateCap, evenAndOddHeaders, and proofState. Previously this was implemented in a buggy way, so that the reference doc's values AND the new values were included. This change allows users to create a reference.docx that sets w:proofState for spelling or grammar to "dirty," so that spell/grammar checking will be triggered on the generated docx. Closes #1209.	2021-05-11 22:31:38 -06:00
John MacFarlane	2bd5d0cafb	LaTeX writer: better handling of line breaks in simple tables. Now we also handle the case where they're embedded in other elements, e.g. spans. Closes #7272.	2021-05-11 07:52:05 -06:00
John MacFarlane	6e45607f99	Change reader types, allowing better tracking of source positions. Previously, when multiple file arguments were provided, pandoc simply concatenated them and passed the contents to the readers, which took a Text argument. As a result, the readers had no way of knowing which file was the source of any particular bit of text. This meant that we couldn't report accurate source positions on errors or include accurate source positions as attributes in the AST. More seriously, it meant that we couldn't resolve resource paths relative to the files containing them (see e.g. #5501, #6632, #6384, #3752). Add Text.Pandoc.Sources (exported module), with a `Sources` type and a `ToSources` class. A `Sources` wraps a list of `(SourcePos, Text)` pairs. [API change] A parsec `Stream` instance is provided for `Sources`. The module also exports versions of parsec's `satisfy` and other Char parsers that track source positions accurately from a `Sources` stream (or any instance of the new `UpdateSourcePos` class). Text.Pandoc.Parsing now exports these modified Char parsers instead of the ones parsec provides. Modified parsers to use a `Sources` as stream [API change]. The readers that previously took a `Text` argument have been modified to take any instance of `ToSources`. So, they may still be used with a `Text`, but they can also be used with a `Sources` object. In Text.Pandoc.Error, modified the constructor PandocParsecError to take a `Sources` rather than a `Text` as first argument, so parse error locations can be accurately reported. T.P.Error: showPos, do not print "-" as source name.	2021-05-09 19:11:34 -06:00
Albert Krewinkel	8357b835d9	App: allow tabs expansion even if file-scope is used Tabs in plain-text inputs are now handled correctly, even if the `--file-scope` flag is used. Closes: #6709	2021-05-05 19:09:21 +02:00
Albert Krewinkel	ddbf83f62c	Docx writer: support colspans and rowspans in tables See: #6315	2021-05-01 18:52:24 +02:00
mbrackeantidot	b6a65445e1	Docx reader: add handling of vml image objects (jgm#4735) (#7257 ) They represent images, the same way as other images in vml format.	2021-04-29 09:11:44 -07:00
John MacFarlane	d14c5f94df	Further improvements in smart quotes. Improves heuristic for detection of an "open double quote." Closes #2103.	2021-04-29 08:48:49 -07:00
John MacFarlane	80e2e88287	Smarter smart quotes. Treat a leading " with no closing " as a left curly quote. This supports the practice, in fiction, of continuing paragraphs quoting the same speaker without an end quote. It also helps with quotes that break over lines in line blocks. Closes #7216.	2021-04-28 23:32:37 -07:00
Albert Krewinkel	85f379e474	JATS writer: use either styled-content or named-content for spans. If the element has a content-type attribute, or at least one class, then that value is used as `content-type` and the span is put inside a `<named-content>` element. Otherwise a `<styled-content>` element is used instead. Closes: #7211	2021-04-28 22:21:34 +02:00
Albert Krewinkel	0921b82d98	Docx writer: autoset table width if no column has an explicit width.	2021-04-27 13:27:20 +02:00
Jan Tojnar	e9c0f9f97b	Markdown writer: Cleaner (code)blocks with single class (#7242 ) When a block only has a single class and no other attributes, it is not necessary to wrap the class attribute in curly braces – the class name can be placed after the opening mark as is. This will result in bit cleaner output when pandoc is used as a markdown pretty-printer.	2021-04-25 10:36:06 -07:00
John MacFarlane	547bc2cdf8	Add quotes properly in markdown YAML metadata fields. This fixes a bug, which caused the writer to look at the LAST rather than the FIRST character in determining whether quotes were needed. So we got spurious quotes in some cases and didn't get necessary quotes in others. Closes #7245. Updated a number of test cases accordingly.	2021-04-25 10:31:33 -07:00
John MacFarlane	7f4850c9de	Remove biblatex-nussbaum.md test. It is basically the same as biblaetx-quotes.md.	2021-04-25 10:29:03 -07:00
John MacFarlane	73d394ca2a	Use MetaInlines not MetaBlocks for multimarkdown metadata fields. This gives better results in converting to e.g. pandoc markdown. Ref: <https://groups.google.com/d/msgid/pandoc-discuss/9728d1f4-040e-4392-aa04-148f648a8dfdn%40googlegroups.com>	2021-04-18 22:01:12 -07:00
John MacFarlane	a478a5c4c8	Update to released unicode-collation, latest citeproc dev version. Update citeproc test.	2021-04-17 16:15:14 -07:00
John MacFarlane	099ac9985b	Use BCP47 language codes in citeproc tests.	2021-04-17 16:15:14 -07:00
John MacFarlane	ff5a504809	Use new citeproc + unicode-collation. Add command test for unicode-collation.	2021-04-17 16:15:13 -07:00
Albert Krewinkel	5f79a66ed6	JATS writer: reduce unnecessary use of <p> elements for wrapping The `<p>` element is used for wrapping in cases were the contents would otherwise not be allowed in a certain context. Unnecessary wrapping is avoided, especially around quotes (`<disp-quote>` elements). Closes: #7227	2021-04-16 22:47:37 +02:00
Albert Krewinkel	2d60524de4	JATS writer: convert spans to <named-content> elements Spans with attributes are converted to `<named-content>` elements instead of being wrapped with `<milestone-start/>` and `<milestone-end>` elements. Milestone elements are not allowed in documents using the articleauthoring tag set, so this change ensures the creation of valid documents. Closes: #7211	2021-04-10 11:49:18 +02:00
Albert Krewinkel	051b7ffeaf	JATS writer: add footnote number as label in backmatter Footnotes in the backmatter are given the footnote's number as a label. The articleauthoring output is unaffected from this change, as footnotes are placed inline there. Closes: #7210	2021-04-10 10:57:06 +02:00
John MacFarlane	20cd33e5a4	Fix regression in grid tables for wide characters. In the translation from String to Text, a char-width-sensitive splitAt' was dropped. This commit reinstates it. Closes #7214.	2021-04-08 14:48:29 -07:00
John MacFarlane	60974538b2	Commonmark writer: Use backslash escapes for `<` and `\|`... instead of entities. Closes #7208.	2021-04-05 23:29:22 -07:00
Albert Krewinkel	038261ea52	JATS writer: escape disallows chars in identifiers XML identifiers must start with an underscore or letter, and can contain only a limited set of punctuation characters. Any IDs not adhering to these rules are rewritten by writing the offending characters as Uxxxx, where `xxxx` is the character's hex code.	2021-04-05 21:55:54 +02:00
tecosaur	4371223d13	Org writer: Use LaTeX style maths deliminators (#7196 ) Org works better with LaTeX-style delimiters.	2021-04-01 23:36:02 +02:00
niszet	40da6c402b	Treat tabs as spaces in ODT Reader. (#7185 )	2021-03-31 16:44:34 -07:00
John MacFarlane	56ce1fc126	Fix DocBook reader mathml regression... ...caused by the switch in XML libraries. Also fixed a similar issue in JATS. Closes #7173.	2021-03-24 12:04:33 -07:00
Erik Rask	82e8c29cb0	Include Header.Attr.attributes as XML attributes on section Add key-value pairs found in the attributes list of Header.Attr as XML attributes on the corresponding section element. Any key name not allowed as an XML attribute name is dropped, as are keys with invalid values where they are defined as enums in DocBook, and xml:id (for DocBook 5)/id (for DocBook 4) to not intervene with computed identifiers.	2021-03-20 21:29:17 +01:00
John MacFarlane	ceadf33246	Tests: Use getExecutablePath from base... avoiding the need to depend on the executable-path package.	2021-03-19 23:35:47 -07:00
John MacFarlane	dc94601eb5	Tests: factor out setupEnvironment in Test.Helpers. This avoids code duplication between Command and Old.	2021-03-19 21:17:13 -07:00
John MacFarlane	2ca1b20a85	Fix finding of data files from test programs. Apparently Cabal sets a `pandoc_datadir` environment variable so that the data files will be sought in the source directory rather than in the final destination (where they aren't yet installed). So we no longer need to set `--data-dir` in the tests. We just need to make sure `pandoc_datadir` is set in the environment when we call the program in the test suite. This will fix the issue with loading of pandoc.lua when pandoc is built with `-embed_data_files`, reported in #7163. Closes #7163.	2021-03-19 18:57:13 -07:00
John MacFarlane	c3f9e8c122	Docx writer: make nsid in abstractNum deterministic. Previously we assigned a random number (though in a deterministic way). But changes in the random package mean we get different results now on different architectures, even with the same random seed. We don't need random values; so now we just assign a value based on the list number id, which is guaranteed to be unique to the list marker.	2021-03-17 22:31:20 -07:00
John MacFarlane	e66bf891ec	Add test for #7155 .	2021-03-17 09:10:37 -07:00
John MacFarlane	63a6059790	Update tests for new texmath.	2021-03-15 18:22:38 -07:00
John MacFarlane	35b66a7671	MediaWiki reader: Allow block-level content in notes (ref). Closes #7145.	2021-03-13 12:50:44 -08:00
John MacFarlane	eed18d231c	Use integral values for w:tblW in docx. Cloess #7141.	2021-03-13 12:05:52 -08:00
Albert Krewinkel	f8b49e77f8	Use jira-wiki-markup 1.3.4 Jira reader: * Fixed parsing of autolinks (i.e., of bare URLs in the text). Previously an autolink would take up the rest of a line, as spaces were allowed characters in these items. * Emoji character sequences no longer cause parsing failures. This was due to missing backtracking when emoji parsing fails. Jira writer: * Block quotes are only rendered as `bq.` if they do not contain a linebreak.	2021-03-13 14:53:58 +01:00
Albert Krewinkel	00e8d0678e	Jira reader: mark divs created from panels with class "panel". Closes: tarleb/jira-wiki-markup#2	2021-03-13 14:29:47 +01:00
Albert Krewinkel	a8aa301428	Jira writer: improve div/panel handling Include div attributes in panels, always render divs with class `panel` as panels, and avoid nesting of panels.	2021-03-13 12:10:02 +01:00
John MacFarlane	5608dc01e5	HTML writer: Add warnings on duplicate attribute values. This prevents emitting invalid HTML. Ultimately it would be good to prevent this in the types themselves, but this is better for now. T.P.Logging: Add DuplicateAttribute constructor to LogMessage. [API change]	2021-03-10 10:19:40 -08:00
John MacFarlane	1c23e3a824	RST reader: fix logic for ending comments. Previously comments sometimes got extended too far. Closes #7134.	2021-03-09 13:03:27 -08:00
Albert Krewinkel	b9b2586ed3	Org writer: prevent unintended creation of ordered list items Adjust line wrapping if default wrapping would cause a line to be read as an ordered list item. Fixes #7132	2021-03-09 18:14:54 +01:00
Albert Krewinkel	eb184d9148	Jira writer: use noformat instead of code for unknown languages. Code blocks that are not marked as a language supported by Jira are rendered as preformatted text with `{noformat}` blocks. Fixes: tarleb/jira-wiki-markup#4	2021-03-08 12:50:35 +01:00
John MacFarlane	5aa73bd0a2	LaTeX reader: handle table cells containing `&` in `\verb`. Closes #7129.	2021-03-07 15:49:02 -08:00
Albert Krewinkel	e1454fe0d0	Jira writer: use Span identifiers as anchors Closes: tarleb/jira-wiki-markup#3.	2021-03-01 14:36:11 +01:00
John MacFarlane	12b47656d4	Remove superfluous imports.	2021-02-28 22:57:36 -08:00
John MacFarlane	7e38b8e55a	T.P.Readers.LaTeX: Don't export tokenize, untokenize. [API change] These were only exported for testing, which seems the wrong thing to do. They don't belong in the public API and are not really usable as they are, without access to the Tok type which is not exported. Removed the tokenize/untokenize roundtrip test. We put a quickcheck property in the comments which may be used when this code is touched (if it is).	2021-02-28 22:53:42 -08:00
John MacFarlane	a9cc5d2616	Update tests for changes to https URLs.	2021-02-26 18:00:45 -08:00
Salim B	fae6a204f1	Fix/update URLs and use HTTPS where possible (#7122 )	2021-02-26 17:56:04 -08:00
John MacFarlane	f0a991a22b	T.P.CSV: fix parsing of unquoted values. Previously we didn't allow unescaped quotes in unquoted values, but they are allowed. Closes #7112.	2021-02-22 21:18:04 -08:00
Albert Krewinkel	00e4bb51e4	tests: print accurate location if a test fails Ensures that tasty-hunit reports the location of the failing test instead of the location of the helper `test` function.	2021-02-22 23:56:04 +01:00
John MacFarlane	80fde18fb1	Text.Pandoc.UTF8: change IO functions to return Text, not String. [API change] This affects `readFile`, `getContents`, `writeFileWith`, `writeFile`, `putStrWith`, `putStr`, `putStrLnWith`, `putStrLn`. `hPutStrWith`, `hPutStr`, `hPutStrLnWith`, `hPutStrLn`, `hGetContents`. This avoids the need to uselessly create a linked list of characters when emiting output.	2021-02-22 11:30:07 -08:00
John MacFarlane	005344fb18	Revert "LaTeX template: disable ` `?` ` `and` `!` `` ligatures." This reverts commit `24d7cd539b`.	2021-02-18 17:03:11 -08:00
John MacFarlane	24d7cd539b	LaTeX template: disable ` `?` ` `and` `!` `` ligatures. These are often triggered by accident in languagegs that use ` `` ` for end quote (e.g. German). See jgm/citeproc#54.	2021-02-18 15:48:40 -08:00
Albert Krewinkel	743f7216de	Org reader: fix bug in org-ref citation parsing. The org-ref syntax allows to list multiple citations separated by comma. This fixes a bug that accepted commas as part of the citation id, so all citation lists were parsed as one single citation. Fixes: #7101	2021-02-18 21:59:18 +01:00
John MacFarlane	967e7f5fb9	Rename Text.Pandoc.XMLParser -> Text.Pandoc.XML.Light... ..and add new definitions isomorphic to xml-light's, but with Text instead of String. This allows us to keep most of the code in existing readers that use xml-light, but avoid lots of unnecessary allocation. We also add versions of the functions from xml-light's Text.XML.Light.Output and Text.XML.Light.Proc that operate on our modified XML types, and functions that convert xml-light types to our types (since some of our dependencies, like texmath, use xml-light). Update golden tests for docx and pptx. OOXML test: Use `showContent` instead of `ppContent` in `displayDiff`. Docx: Do a manual traversal to unwrap sdt and smartTag. This is faster, and needed to pass the tests. Benchmarks: A = prior to `8ca191604d` (Feb 8) B = as of `8ca191604d` (Feb 8) C = this commit \| Reader \| A \| B \| C \| \| ------- \| ----- \| ------ \| ----- \| \| docbook \| 18 ms \| 12 ms \| 10 ms \| \| opml \| 65 ms \| 62 ms \| 35 ms \| \| jats \| 15 ms \| 11 ms \| 9 ms \| \| docx \| 72 ms \| 69 ms \| 44 ms \| \| odt \| 78 ms \| 41 ms \| 28 ms \| \| epub \| 64 ms \| 61 ms \| 56 ms \| \| fb2 \| 14 ms \| 5 ms \| 4 ms \|	2021-02-16 16:55:20 -08:00
Albert Krewinkel	b5b576184c	JATS writer: add date-type to pub-date elements	2021-02-15 13:15:14 +01:00
Albert Krewinkel	2c99e0e358	JATS writer: replace attribute "pub-type" with "publication-format". The former attribute is deprecated.	2021-02-15 13:15:14 +01:00
John MacFarlane	d84a6041e1	HTML reader: fix bad handling of empty src attribute in iframe. - If src is empty, we simply skip the iframe. - If src is invalid or cannot be fetched, we issue a warning and skip instead of failing with an error. - Closes #7099.	2021-02-13 13:08:34 -08:00
John MacFarlane	6e73273916	T.P.Error: export `renderError`. Refactor `handleError` to use `renderError`. This allows us render error messages without exiting.	2021-02-13 13:08:34 -08:00
Albert Krewinkel	a3beed9db8	Org: support task_lists extension The tasks lists extension is now supported by the org reader and writer; the extension is turned on by default. Closes: #6336	2021-02-13 13:00:37 -08:00
John MacFarlane	3be066b7d3	Fix command test 5686	2021-02-12 19:04:14 -08:00
John MacFarlane	59875185b3	Add command test for #7092	2021-02-12 19:04:14 -08:00
Albert Krewinkel	8ffd4159d6	Jira: require jira-wiki-markup 1.3.3 * Modified the Doc parser to skip leading blank lines. This fixes parsing of documents which start with multiple blank lines. (#7095) * Prevent URLs within link aliases to be treated as autolinks. (#6944) Fixes: #7095 Fixes: #6944	2021-02-12 17:15:12 +01:00
John MacFarlane	8ca191604d	Add new unexported module T.P.XMLParser. This exports functions that uses xml-conduit's parser to produce an xml-light Element or [Content]. This allows existing pandoc code to use a better parser without much modification. The new parser is used in all places where xml-light's parser was previously used. Benchmarks show a significant performance improvement in parsing XML-based formats (especially ODT and FB2). Note that the xml-light types use String, so the conversion from xml-conduit types involves a lot of extra allocation. It would be desirable to avoid that in the future by gradually switching to using xml-conduit directly. This can be done module by module. The new parser also reports errors, which we report when possible. A new constructor PandocXMLError has been added to PandocError in T.P.Error [API change]. Closes #7091, which was the main stimulus. These changes revealed the need for some changes in the tests. The docbook-reader.docbook test lacked definitions for the entities it used; these have been added. And the docx golden tests have been updated, because the new parser does not preserve the order of attributes. Add entity defs to docbook-reader.docbook. Update golden tests for docx.	2021-02-10 22:04:11 -08:00
Albert Krewinkel	d202f7eb77	Avoid unnecessary use of NoImplicitPrelude pragma (#7089 )	2021-02-07 10:02:35 -08:00
John MacFarlane	8e9131db4e	Markdown reader: improved handling of mmd link attributes in references. Previously they only worked for links that had titles. Closes #7080.	2021-02-06 21:52:12 -08:00
Andrew Dunning	4de9edb8e8	LaTeX template: Update to iftex package (#7073 ) Load the iftex package directly rather than via the ifxetex and ifluatex compatibility wrappers, which have been merged into a single package that is part of the LaTeX core. The capitalization of the commands has been changed for compatibility with older versions of TeX Live that have the version of iftex by the Persian TeX Group. This had been removed in <`2845794c0c`> for compatibility with BasicTeX, but that is no longer an issue.	2021-02-03 08:54:11 -08:00
John MacFarlane	e6c7fcc598	Fixed some compiler warnings in tests.	2021-02-02 21:09:10 -08:00
Albert Krewinkel	6f79042502	Add tests for search_path_separator	2021-02-02 21:04:30 -08:00
Albert Krewinkel	e0bf4bfe82	Check that all documented functions are present. Rely on tests in the module package to check the correctness of each function.	2021-02-02 21:04:30 -08:00
Albert Krewinkel	61b108d527	Lua: add module "pandoc.path" The module allows to work with file paths in a convenient and platform-independent manner. Closes: #6001 Closes: #6565	2021-02-02 21:04:30 -08:00

1 2 3 4 5 ...

1673 commits