pandoc

Author	SHA1	Message	Date
John MacFarlane	c348c4d4fe	Use `[x]` not `[X]` for asciidoctor checklists. See #7798.	2022-01-29 17:47:35 -08:00
Albert Krewinkel	a6fa3df114	HTML writer: avoid duplicate "style" attributes on table cells Fixes: #7871	2022-01-28 18:20:14 +01:00
Even Brenden	d36a16a4df	Don't read files outside of user data directory If a file path does not exist relative to the working directory, but it does exist relative to the user data directory, and it exists outside of the user data directory, do not read it. This applies to readDataFile and readMetadataFile in PandocMonad and, by extension, any module that uses these by passing them relative paths.	2022-01-28 08:51:27 -08:00
John MacFarlane	a9f901cf6b	CommonMark reader: fix source position after YAML metadata. Closes #7863.	2022-01-23 22:13:58 -08:00
John MacFarlane	24fa3ffddb	Fix compiler warnings.	2022-01-21 17:28:02 -08:00
John MacFarlane	9e0d146837	Update command tests to distinguish stderr and test exit status.	2022-01-21 15:01:50 -08:00
Even Brenden	7df29e495f	Search for metadata files in $DATADIR/metadata (#7851 ) If files specified with `--metadata-file` are not found in the working directory, look in `$DATADIR/metadata`. Expose new `readMetadataFile` function from Text.Pandoc.Class [API change]. Expose new `PandocCouldNotFindMetadataFileError` constructor for `PandocError` from Text.Pandoc.Error [API change]. Closes #5876.	2022-01-21 12:00:45 -08:00
John MacFarlane	52b78b10c8	Avoid putting a frame around speaker notes in beamer. If speaker notes (a Div with class 'notes') occur right after a section heading, but above slide level, the resulting `\note{..}` caommand should not be wrapped in a frame, as that will cause a spurious blank slide. Closes #7857.	2022-01-20 19:09:44 -08:00
John MacFarlane	ef8135b4a7	HTML writer: don't break lines inside code elements. With the new (default) line wrapping of HTML, in conjunction with the default CSS which includes `code { whitespace: pre-wrap; }`, spurious line breaks could be introduced into inline code. Closes #7858.	2022-01-20 09:17:34 -08:00
John MacFarlane	6723891c72	Markdown writer: handle explicit column widths with pipe tables. If a table has explicit column width information and the content extends beyond the `--columns` width, we need to adjust the widths of the pipe separators to encode this width information. Closes #7847.	2022-01-19 09:36:48 -08:00
Albert Krewinkel	b794b534a5	Remove unused file test/command/jats.csl	2022-01-19 10:12:32 +01:00
Michael Hoffmann	e146b1ff3b	Docx writer: Separate tables even with RawBlocks between (#7844 ) Adjacent docx tables need to be separated by an empty paragraph. If there's a RawBlock between tables which renders to nothing, be sure to still insert the empty paragraph so that they will not collapse together. Fixes #7724	2022-01-18 14:28:28 -08:00
Nikolai Korobeinikov	b683b8d48a	Support checklists in asciidoctor writer (#7832 ) The checklist syntax (similar to `task_list` in markdown) seems to be an asciidoctor-only addition. Co-authored-by: ricnorr <ricnorr@yandex-tream.ru>	2022-01-16 11:05:19 -08:00
Mauro Bieg	9e60142cc9	CSS in HTML template: adjust #TOC and h1 on mobile (#7835 )	2022-01-16 09:22:43 -08:00
John MacFarlane	c6cf78a033	Improve on fix to #7506 . Don't boldface code in output formats that can represent it as monospace. Define aliases for VI, VB, VBI as well.	2022-01-15 12:57:26 -08:00
John MacFarlane	c40727bfbb	Man writer: use custom font V for inline code. The V font is defined conditionally, so that it renders like CB in output formats that support that, and like B in those that don't (e.g. the terminal). We could just redefine C, but this would affect code blocks, too, and putting them all in boldface looks ugly, I think. Possible drawback: fragments created by pandoc's man writer will presuppose a nonstandard V font. Closes #7506. Supersedes `253467a549`.	2022-01-15 12:39:19 -08:00
John MacFarlane	253467a549	Man writer: Use boldface for inline code. Closes #7506. This also allows us to get rid of some special casing on definition lists that ensured that options in code spans would be boldface. (If this change is ever reverted, we'll need that again.)	2022-01-15 12:07:18 -08:00
John MacFarlane	e532bceb8a	Add test for #7826 notes-after-punctuation.	2022-01-13 08:41:05 -08:00
Michael Hoffmann	5001fd3f4d	Docx writer: Handle bullets correctly in lists by not reusing numIds (#7822 ) Make sure that we only create one bullet per list item in docx. In particular, when a div is a list item, its contained paragraphs will now no longer wrongly get individual bullets. This is accomplished by making sure that for each list, we only use the associated numId once. Any repeated use would add incorrect bullets to the document. Closes #7689	2022-01-11 15:48:41 -08:00
John MacFarlane	7bf1191686	HTML writer: don't break attributes values when wrapping.	2022-01-10 10:40:49 -08:00
John MacFarlane	6f739cdb4d	Fix regression: allow blank lines in HTML attributes. The commit `7a9832166e` had the effect that blank lines would be collapsed in HTML attributes. We also roll back a change that collapsed multiple spaces into one.	2022-01-10 10:29:25 -08:00
Lucas Viana	fb91a91615	Org reader: support alphabetical (fancy) lists This adds support for alphabetical lists in org by enabling the extension Ext_fancy_lists, mimicking the behaviour of Org Mode when org-list-allow-alphabetical is enabled. Enabling Ext_fancy_lists will also make Pandoc differentiate between the delimiters of ordered lists (periods or closing parentheses). Org does this differentiation by default when exporting to some formats (e.g. plain text) but does not in others (e.g. html and latex), so I decided to copy Pandoc's markdown reader behaviour.	2022-01-09 09:39:27 -08:00
John MacFarlane	66636c89b0	Org writer: fix list items starting with a code block... or other non-paragraph content. Closes #7810.	2022-01-08 23:21:15 -08:00
John MacFarlane	61968047e4	Avoid blank lines after tight sublists in org, haddock. T.P.Writers.Shared `endsWithPlain` now returns True if the list ends with a list which ends with a Plain. See #7810.	2022-01-08 23:11:08 -08:00
John MacFarlane	252211bd27	Org writer: don't add blank line before lists. The code to do this was apparently copied over from the RST writer, but these blank lines aren't necessary or desirable in org. See #7810 comment 3.	2022-01-08 19:39:26 -08:00
John MacFarlane	a965111680	Fix parsing of footnotes in `--metadata-file`. Closes #7813.	2022-01-07 15:58:26 -08:00
Lucas Viana	45e2e0d018	Org writer: support starting number cookies This complements #7806 by supporting writing Org ordered lists that start at a specific number.	2022-01-07 10:48:28 -08:00
Jesse Hathaway	4dcb2b6fb6	MediaWiki writer: Remove redundant display text for wiki links Prior to this commit the MediaWiki writer always added the display text for a wiki link: * [[Help\|Help]] * [[Bubbles\|Everyone loves bubbles]] However the display text in the first example is redundant since MediaWiki uses the target as the default display text. The result being: * [[Help]] * [[Bubbles\|Everyone loves bubbles]]	2022-01-06 15:05:39 -08:00
Lucas Viana	4be41e3bb5	Org reader: support counter cookies in lists This adds support for counter cookies in org lists. Such cookies are used to override the item counter in ordered lists. In org it is possible to set the counter at any list item, but since Pandoc AST does not support this, we restrict the usage to setting an offset for the entire ordered list, by using the cookie in the first list item. Note that even though unordered lists do not have counters, Org Mode still parses such cookies in unordered lists and suppresses them in the output, so we do the same. Also, even though org-list-allow-alphabetical is disabled in Emacs by default, for some reason alphabetical cookies are always parsed and used in Org Mode regardlessly of whether this option is enabled or the list style is decimal, so we do the same. E.g. 2. test 3. test Is parsed as an ordered list starting at 1, as before. This also conforms to Org Mode behaviour. 1. [@2] test 2. test Is now parsed as an ordered list starting at 2, so that it conforms to Org Mode behaviour. Note that when parsing 1. [@2] test 2. [@9] test the second cookie is silenced and the entire list starts at 2. This is because the current Pandoc AST does not support expressing a change in the counter at a specific item.	2022-01-06 19:33:13 +01:00
John MacFarlane	ea74582288	AsciiDoc writer: improve detection of intraword emphasis. Closes #7803.	2022-01-05 14:48:19 -08:00
Albert Krewinkel	1f8638fb54	Lua: add `pandoc.template` module The module provides a `compile` function to use strings as templates.	2022-01-04 11:55:59 -08:00
Albert Krewinkel	6a5ac90bf1	Lua: add `pandoc.WriterOptions` constructor	2022-01-04 11:55:59 -08:00
John MacFarlane	53699f2ab3	DocBook reader: be sensitive to spacing="compact" in lists. When spacing="compact" is set, Para elements are turned into Plain, so we get a "tight" list. Closes #7799.	2022-01-03 14:19:53 -08:00
John MacFarlane	811eee7cad	ConTeXt template: make title appear in PDF title bar. This is recommended for accessibility reasons. Note: doesn't work with macOS Preview.app. See https://groups.google.com/d/msgid/pandoc-discuss/m2lezx20jq.fsf%40MacBook-Pro-2.hsd1.ca.comcast.net	2022-01-02 22:04:55 -08:00
Tuong Nguyen Manh	32297d5677	Odt: Add list-header The list-header is a type of list-item. Therefore, it will be treated exactly like one.	2022-01-02 15:05:09 -08:00
John MacFarlane	808bcb5d3b	Change reference.pptx to use 16:9 aspect ratio. This is now Powerpoint's default.	2022-01-02 14:58:55 -08:00
Albert Krewinkel	b7a44f9d19	Copyright notices: update for 2022	2022-01-02 11:59:22 -08:00
Albert Krewinkel	eae9be3a48	Org reader: allow trailing spaces after key/value pairs in directives Ensures that spaces at the end of attribute directives like `#+ATTR_HTML: :width 100%` (note the trailing spaces) are accepted.	2022-01-01 13:44:14 +01:00
Albert Krewinkel	2dd1cde715	Lua: allow binary (byte string) readers to be used with `pandoc.read`	2021-12-30 22:41:15 +01:00
John MacFarlane	7d56650e01	OpenDocument writer: fix vertical align bug with display math. Previously some displayed formulas would be floated above a preceding text line. This is fixed by setting vertical-rel to 'text' rather than 'paragraph-content'. Closes #7777.	2021-12-28 16:06:25 -08:00
Albert Krewinkel	fbd2c8e376	Lua: improve handling of empty caption, body by `from_simple_table` Create truly empty table caption and body when these are empty in the simple table. Fixes: #7776	2021-12-25 21:20:18 +01:00
John MacFarlane	c4f6e6cb57	HTML writer: make line breaks more consistent. - With `--wrap=none`, we now output line breaks between block-level elements. Previously they were omitted entirely, so the whole document was on one line, unless there were literal line breaks in pre sections. This makes the HTML writer's behavior more consistent with that of other writers. - Put newline after `<dd>`. - Put newlines after block-level elements in footnote section.	2021-12-22 09:45:02 -08:00
John MacFarlane	7a9832166e	Add text wrapping to HTML output. Previously the HTML writer was exceptional in not being sensitive to the `--wrap` option. With this change `--wrap` now works for HTML. The default (as with other formats) is automatic wrapping to 72 columns. A new internal module, T.P.Writers.Blaze, exports `layoutMarkup`. This converts a blaze Html structure into a doclayout Doc Text. In addition, we now add a line break between an `img` tag and the associated `figcaption`. Note: Output is never wrapped in `writeHtmlStringForEPUB`. This accords with previous behavior since previously the HTML writer was insensitive to `--wrap` settings. There's no real need to wrap HTML inside a zipped container. Note that the contents of script, textarea, and pre tags are always laid out with the `flush` combinator, so that unwanted spaces won't be introduced if these occur in an indented context in a template. Closes #7764.	2021-12-22 09:45:02 -08:00
Albert Krewinkel	0bdf373157	Lua: simplify code of pandoc.utils.stringify Minor behavior change: plain strings nested in tables are now included in the result string.	2021-12-21 21:50:13 +01:00
Albert Krewinkel	edb04a78db	Lua tests: add more tests for `pandoc.utils.stringify`.	2021-12-21 21:25:16 +01:00
Albert Krewinkel	1c389bf6b6	Lua: add tests for pandoc.utils.equals	2021-12-21 19:00:21 +01:00
Albert Krewinkel	d7cab51982	Lua: add new library function `pandoc.utils.type`. The function behaves like the default `type` function from Lua's standard library, but is aware of pandoc userdata types. A typical use-case would be to determine the type of a metadata value.	2021-12-21 09:24:21 -08:00
Albert Krewinkel	cd2bffee1e	Lua: use more natural representation for Reference values Omit `false` boolean values, push integers as numbers.	2021-12-20 09:41:03 +01:00
binaarinen	0610f16f7f	Add a writer for Markua 0.10 (#7729 ) Markua is a markdown variant used by Leanpub. More information about Markua can be found at https://leanpub.com/markua/read. Adds a new exported function `writeMarkua` from T.P.Writers.Markdown. [API change] Closes #1871. Co-authored by Tim Wisotzki and Samuel Lemmenmeier.	2021-12-19 12:10:41 -08:00
Albert Krewinkel	dc3dcc2ccd	Lua: fixup, should have been part of previous commit	2021-12-19 14:31:52 +01:00
John MacFarlane	4b220d592c	Citeproc: avoid adding comma before an author-in-text citation... ...in a note if it begins with a title (no author). Closes #7761.	2021-12-18 12:13:06 -08:00
John MacFarlane	394fa9d072	Org reader: parse official org-cite citations. We also support the older org-ref style as a fallback. We no longer support the "markdown-style" citations. See #7329.	2021-12-14 11:34:32 -08:00
John MacFarlane	5817e86491	Org reader: remove support for "Berkeley style" citations. See #7329.	2021-12-14 09:20:26 -08:00
John MacFarlane	9f089aa286	Org writer: add tests for org-cite citations, and improve support.	2021-12-13 12:11:58 -08:00
Kolen Cheung	a9a9a2c62a	fix(IpynbOutput)!: rank always favors output format Previously, both `fmt == f` case and Image have a rank of 1. In the end, e.g. from ipynb to html conversion, if both html and image exists, it actually prefers the image. This commit changes this, so that fmt == f is always highest rank, and rank never collides. This is achieved by keeping fmt == f case having rank 1, and every other rank increased by 1.	2021-12-11 09:42:30 -08:00
Albert Krewinkel	bfb3118ebb	Lua tests: remove roundtrip tests Property tests that roundtrip elements through the Lua stack are performed in the test-suite of the pandoc-lua-marshal package. No need to test this here as well.	2021-12-10 18:28:54 +01:00
Albert Krewinkel	a64ea18647	Powerpoint tests: shorten lines by grouping tests This makes the test output more pleasant to read in narrow terminal windows.	2021-12-10 18:25:28 +01:00
Kolen Cheung	20eb8ac7fd	ipynb writer: handle cell output with raw block of markdown (#7563 ) Write RawBlock of markdown in code-cell output. #7561 makes the ipynb reader reads code-cell output with mime "text/markdown" to a RawBlock of markdown This commit makes the ipynb writer writes this RawBlock of markdown back inside a code-cell output with the same mime, preserving this information in round-trip Add tests of ipynb reader (#7561) and ipynb writer (#7563)'s ability to handle a "text/markdown" mime type in a code-cell output	2021-12-09 20:36:56 -08:00
Albert Krewinkel	fa643ba6d7	Lua: update to latest pandoc-lua-marshal (0.1.1) - `walk` methods are added to `Block` and `Inline` values; the methods are similar to `pandoc.utils.walk_block` and `pandoc.utils.walk_inline`, but apply to filter also to the element itself, and therefore return a list of element instead of a single element. - Functions of name `Doc` are no longer accepted as alternatives for `Pandoc` filter functions. This functionality was undocumented.	2021-12-09 09:22:29 -08:00
John MacFarlane	ccb9db34f8	Add test for #7738 .	2021-12-07 23:59:59 -08:00
John MacFarlane	51142c6803	Ipynb reader & writer: properly handle cell "id". This is passed through if it exists (in Nb4); otherwise the writer will add a random one so that cells all have an "id". Closes #7728.	2021-12-06 23:40:51 -08:00
John MacFarlane	51f6f0e3a1	Improve Markdown writer escaping. This fixes escaping for '#' in particular. Closes #7726.	2021-12-03 17:52:47 -08:00
John MacFarlane	619dfa2a2a	Markdown reader: don't allow `^` at beginning of link or image label. This is reserved for footnotes. Fixes a regression introduced by `0a93acf`. Closes #7723.	2021-11-30 12:53:54 -08:00
Albert Krewinkel	fa838deefc	Lua: remove `pandoc.utils.text` (#7720 ) The new `pandoc.Inlines` function behaves identical on string input, but allows other Inlines-like arguments as well. The `pandoc.utils.text` function could be written as function pandoc.utils.text (x) assert(type(x) == 'string') return pandoc.Inlines(x) end	2021-11-29 09:12:30 -08:00
Albert Krewinkel	b9222e5cb1	Lua: add constructors `pandoc.Blocks` and `pandoc.Inlines` The functions convert their argument into a list of Block and Inline values, respectively.	2021-11-28 16:02:42 +01:00
Albert Krewinkel	3692a1d1e8	Lua: use package pandoc-lua-marshal (#7719 ) The marshaling functions for pandoc's AST are extracted into a separate package. The package comes with a number of changes: - Pandoc's List module was rewritten in C, thereby improving error messages. - Lists of `Block` and `Inline` elements are marshaled using the new list types `Blocks` and `Inlines`, respectively. These types currently behave identical to the generic List type, but give better error messages. This also opens up the possibility of adding element-specific methods to these lists in the future. - Elements of type `MetaValue` are no longer pushed as values which have `.t` and `.tag` properties. This was already true for `MetaString` and `MetaBool` values, which are still marshaled as Lua strings and booleans, respectively. Affected values: + `MetaBlocks` values are marshaled as a `Blocks` list; + `MetaInlines` values are marshaled as a `Inlines` list; + `MetaList` values are marshaled as a generic pandoc `List`s. + `MetaMap` values are marshaled as plain tables and no longer given any metatable. - The test suite for marshaled objects and their constructors has been extended and improved. - A bug in Citation objects, where setting a citation's suffix modified it's prefix, has been fixed.	2021-11-27 17:08:01 -08:00
John MacFarlane	0d25232bbf	LaTeX reader: Fix semantics of `\ref`. We were including the ams environment type in addition to the number. This is proper behavior for `\cref` but not for `\ref`. To support `\cref` we need to store the environment label separately.	2021-11-24 19:48:56 -08:00
John MacFarlane	7726b69cd3	LaTeX reader: omit visible content for `\label{...}`. Previously we included the text of the label in square brackets, but this is undesirable in many cases. See discussion in <https://github.com/jgm/pandoc/issues/813#issuecomment-978232426>.	2021-11-24 14:47:00 -08:00
John MacFarlane	6072bdcec9	HTML reader: parse attributes on links and images. Closes #6970.	2021-11-24 11:01:55 -08:00
Albert Krewinkel	a8638894ab	Lua: allow single elements as singleton MetaBlocks/MetaInlines Single elements should always be treated as singleton lists in the Lua subsystem.	2021-11-24 16:54:12 +01:00
John MacFarlane	79e6f8db13	Improve detection of pipe table line widths. Fixed calculation of maximum column widths in pipe tables. It is now based on the length of the markdown line, rather than a "stringified" version of the parsed line. This should be more predictable for users. In addition, we take into account double-wide characters such as emojis. Closes #7713.	2021-11-23 13:29:25 -08:00
Albert Krewinkel	bffd74323c	Lua: add function `pandoc.utils.text` (#7710 ) The function converts a string to `Inlines`, treating interword spaces as `Space`s or `SoftBreak`s. If you want a `Str` with literal spaces, use `pandoc.Str`. Closes: #7709	2021-11-23 09:32:53 -08:00
Albert Krewinkel	0c0945b93c	Lua: split strings into words when treating them as Inline list (#7712 ) Using a Lua string where a list of inlines is expected will cause the string to be split into words, replacing spaces and tabs into `pandoc.Space()` elements and newlines into `pandoc.SoftBreak()`. The previous behavior was to treat the string `s` as `{pandoc.Str(s)}`. The old behavior can be recovered by wrapping the string into a table `{s}`.	2021-11-23 09:30:48 -08:00
Albert Krewinkel	96a4bbe264	Capture `alt-text` in JATS figures (#7703 ) Co-authored-by: Aner Lucero <4rgento@gmail.com>	2021-11-20 09:48:01 -08:00
John MacFarlane	db9a73c842	Lua tests: reset path and cpath when testing 'require' fallback.	2021-11-19 21:55:14 -08:00
John MacFarlane	4f2eac88aa	MediaWiki writer: fix code for generating spans for header IDs. We need to generate a span when the header's ID doesn't match the one MediaWiki would generate automatically. But MediaWiki's generation scheme is different from ours (it uses uppercase letters, and `_` instead of `-`, for example). This means that in going from markdown -> mediawiki, we'll now get spans before almost every heading, unless explicit identifiers are used that correspond to the ones MediaWiki auto-generates. This is uglier output but it's necessary for internal links to work properly. See #7697.	2021-11-19 09:05:19 -08:00
John MacFarlane	df5ae1c186	HTML writer: Don't create invalid `data-` attribute... for empty attribute key. (It would be better to make these unrepresentable in the type system, but for now this is an improvement.) Closes #7546.	2021-11-19 08:50:18 -08:00
John MacFarlane	25bba0cc62	MediaWiki writer: use HTML spans for anchors when header has id. Closes #7697.	2021-11-18 21:15:51 -08:00
willj-dev	005dc7ce56	RST reader: handle class attribute for for custom roles (#7700 ) Previously the class attribute was ignored, and the name of the role used as the class. Closes #7699.	2021-11-18 17:33:57 -08:00
Albert Krewinkel	cd91f72843	Lua: set `lpeg`, `re` as globals; allow shared lib access via require The `lpeg` and `re` modules are loaded into globals of the respective name, but they are not necessarily registered as loaded packages. This ensures that - the built-in library versions are preferred when setting the globals, - a shared library is used if pandoc has been compiled without `lpeg`, and - the `require` mechanism can be used to load the shared library if available, falling back to the internal version if possible and necessary.	2021-11-17 10:03:04 +01:00
John MacFarlane	c19f063420	Markdown writer: don't create autolinks when this loses information. Previously we sometimes lost attributes when rendering links as autolinks. Closes #7692.	2021-11-15 11:06:50 -08:00
Albert Krewinkel	ea268fd8a7	LaTeX reader: add rudimentary support for `\autoref` (#7693 )	2021-11-15 09:40:50 -08:00
Albert Krewinkel	96a01451ef	JATS writer: ensure figures are wrapped with `<p>` in list items. This prevents the generation of invalid output.	2021-11-12 13:29:08 +01:00
Christian Despres	abdfefebdf	Writers.Shared: Improve toLegacyTable. Closes #7683. (PR #7684)	2021-11-11 20:55:37 -08:00
Albert Krewinkel	d4c73d5e65	Lua: fix argument order in constructor `pandoc.Cite`. This restores the old behavior; argument order had been switched accidentally in pandoc 2.15.	2021-11-09 14:45:36 +01:00
John MacFarlane	0a45f2600a	Properly handle commented lines in BibTeX/BibLaTeX. Closes #7668.	2021-11-08 10:15:53 -08:00
Rowan Rodrik van der Molen	40aa74badc	Add `<titleabbr>` support to DocBook reader	2021-11-08 07:30:20 -08:00
Albert Krewinkel	ab0fe676a8	Lua: ensure that 're' module is always available. The module is shipped with LPeg.	2021-11-08 12:22:33 +01:00
John MacFarlane	cc46667953	LaTeX reader: add 'uri' class when parsing `\url`. Closes #7672.	2021-11-07 21:08:20 -08:00
Albert Krewinkel	6b462e5933	Lua: allow to pass custom reader options to `pandoc.read` Reader options can now be passed as an optional third argument to `pandoc.read`. The object can either be a table or a ReaderOptions value like `PANDOC_READER_OPTIONS`. Creating new ReaderOptions objects is possible through the new constructor `pandoc.ReaderOptions`. Closes: #7656	2021-11-06 09:04:29 -07:00
Rowan Rodrik van der Molen	7a70a46c03	Support for <indexterm>s when reading DocBook (#7607 ) * Support for <indexterm>s when reading DocBook * Update implementation status of `<n-ary>` tags * Remove non-idiomatic parentheses * More complete `<indexterm>` support, with tests Co-authored-by: Rowan Rodrik van der Molen <rowan@ytec.nl>	2021-11-05 10:22:38 -07:00
John MacFarlane	938d557844	Docx reader: don't let first line indents trigger block quotes. This fixes a regression introduced in pandoc 2.15 by PR #7606. Closes #7655.	2021-11-02 14:04:38 -07:00
Albert Krewinkel	45bcd7d3f1	Lua: fix typo in SoftBreak constructor	2021-11-02 21:53:08 +01:00
Albert Krewinkel	dbc654e4a7	Lua tests: ensure Inline elements have all expected properties	2021-11-02 21:40:37 +01:00
Albert Krewinkel	421fd736d4	Lua: re-add `content` property to Strikeout elements Fixes a regression introduced in 2.15.	2021-11-02 21:22:59 +01:00
Albert Krewinkel	cce49c5d4b	Lua: be more forgiving when retrieving the Image `caption` property Fixes a regression introduced in 2.15.	2021-11-02 17:40:07 +01:00
Albert Krewinkel	b26f950cca	Lua: display Attr values using their native Haskell representation	2021-11-02 17:25:47 +01:00
Albert Krewinkel	c467f0fed1	Lua: allow omitting the 2nd parameter in pandoc.Code constructor Fixes a regression introduced in 2.15 which required users to always specify an Attr value when constructing a Code element.	2021-11-02 17:23:24 +01:00
Albert Krewinkel	210e4c98b0	Lua: allow to compare, show Citation values Comparisons of Citation values are performed in Haskell; values are equal if they represent the same Haskell value. Converting a Citation value to a string now yields its native Haskell string representation.	2021-11-02 16:49:50 +01:00
Albert Krewinkel	3a0ac52f7b	Lua tests: ensure Block elements have expected properties	2021-11-02 10:23:14 +01:00
Albert Krewinkel	759aa50951	Lua: restore `content` property on Header elements	2021-11-01 15:43:51 +01:00
Albert Krewinkel	96e76d4cd4	Lua: restore List behavior of MetaList Fixes a regression introduced in 2.16 which had MetaList elements loose the `pandoc.List` properties. Fixes #7650	2021-11-01 08:27:14 +01:00
Albert Krewinkel	3de8f4fdc5	Lua: re-add `content` property to Link elements This was a regression introduced in version 2.15. Fixes: #7647	2021-10-31 11:15:50 +01:00
Tristan Stenner	2444cbc668	Docx writer: add IDs to native_numbering test	2021-10-29 08:40:20 -07:00
Tristan Stenner	31d6a494de	Update test golden master for docx native numbering	2021-10-29 08:40:20 -07:00
Albert Krewinkel	f4d9b443d8	Lua: use hslua module abstraction where possible This will make it easier to generate module documentation in the future.	2021-10-29 17:08:30 +02:00
Albert Krewinkel	e1cf0ad1be	Lua: fix placement of tests for Block elements in pandoc module tests	2021-10-28 14:10:54 +02:00
Albert Krewinkel	7fcf1d6184	Lua: re-add `t` and `tag` property to Attr values Removal of these properties from Attr values was a regression.	2021-10-27 22:32:19 +02:00
John MacFarlane	d226a35c0a	Switch back from HsYAML to yaml. Reasons: - Performance: HsYAML is around 20 times slower in parsing large YAML bibliographies (#6084). - An issue was submitted to HsYAML, but it hasn't gotten any attention. HsYAML seems borderline unmaintained; it hasn't had a commit in over a year. - Unfortunately this goes back on our attempts to free ourselves from C dependencies (#4535). But I don't see a better alternative until a better pure Haskell parser is available. Closes #6084. Notes: - We've removed the FromYAML instances for all types that had them, since this is a HsYAML-specific typeclass [API change]. (The yaml package just uses From/ToJSON.) - Unlike HsYAML (in the configuration we were using), yaml parses 'Y', 'N', 'Yes', 'No', 'On', 'Off' as boolean values. Users may need to quote these when they are meant to be interpreted as strings. Similarly, 'null' is parsed as a YAML null value (and will be treated as an empty string by pandoc rather than the string 'null'). Quoting it will force it to be interpreted as a string. - Some tests had to be adjusted accordingly. - Pandoc now behaves better when the YAML metadata contains escaping errors: instead of just falling back on treating the section as a table, it raises a YAML parsing error.	2021-10-27 12:50:51 -07:00
Albert Krewinkel	b95e864ecf	Lua: marshal SimpleTable values as userdata objects	2021-10-26 21:45:16 +02:00
Albert Krewinkel	a493c7029c	Lua: marshal Block values as userdata objects Properties of Block values are marshalled lazily, which generally improves performance considerably. Script users may also notice the following differences: - Block element properties can no longer be accessed by numerical indexing of the `.c` field. The `.c` property now serves as an alias for `.content`, so some filter that used this undocumented method for property access may continue to work, while others will need to be updated and use proper property names. - The marshalled Block elements now have a `show` method, and a `__tostring` metamethod. Both return the Haskell string representation of the element. - Block values now have the Lua type `userdata` instead of `table`.	2021-10-26 14:40:10 +02:00
Albert Krewinkel	230b133db5	Lua: marshal Citation values as userdata objects	2021-10-25 09:08:58 +02:00
John MacFarlane	113a66bd08	Fix more epub files in epub reader tests. Closes #7586.	2021-10-24 15:24:59 -07:00
John MacFarlane	73a105f59c	Clean up wasteland.epub and formatting.epub from reader tests. Make them valid according to epubcheck.	2021-10-24 15:11:02 -07:00
John MacFarlane	c282451fb2	Fixed test/epub/img.epub and img_no_cover.epub... so they're valid epubs.	2021-10-24 14:13:04 -07:00
John MacFarlane	6df9dc1262	Fix conformance errors in test/epub/features.epub and test/epub/formatting.epub. See #7586.	2021-10-23 23:23:22 -07:00
John MacFarlane	c712d13b67	Org reader: allow an initial :PROPERTIES: drawer to add to metadata. Closes #7520.	2021-10-22 22:10:25 -07:00
Albert Krewinkel	8523bb01b2	Lua: marshal Attr values as userdata - Adds a new `pandoc.AttributeList()` constructor, which creates the associative attribute list that is used as the third component of `Attr` values. Values of this type can often be passed to constructors instead of `Attr` values. - `AttributeList` values can no longer be indexed numerically.	2021-10-22 11:16:51 -07:00
Albert Krewinkel	e4287e6c95	Lua: marshal Pandoc values as userdata	2021-10-22 11:16:51 -07:00
Albert Krewinkel	9e74826ba9	Switch to hslua-2.0 The new HsLua version takes a somewhat different approach to marshalling and unmarshalling, relying less on typeclasses and more on specialized types. This allows for better performance and improved error messages. Furthermore, new abstractions allow to document the code and exposed functions.	2021-10-22 11:16:51 -07:00
John MacFarlane	0a93acf91a	Markdown reader: don't parse links or bracketed spans as citations. Previously pandoc would parse [link to (@a)](url) as a citation; similarly [(@a)]{#ident} This is undesirable. One should be able to use example references in citations, and even if `@a` is not defined as an example reference, `[@a](url)` should be a link containing an author-in-text citation rather than a normal citation followed by literal `(url)`. Closes #7632.	2021-10-20 10:34:47 -07:00
Milan Bracke	465c28d28e	Docx reader: fix handling of empty fields Some fields only have an instrText and no content, Pandoc didn't understand these, causing other fields to be misunderstood because it seemed like a field was still open when it wasn't.	2021-10-18 19:15:40 -07:00
Milan Bracke	6acc82c5d2	Docx parser: implement PAGEREF fields These fields, often used in tables of contents, can be a hyperlink.	2021-10-18 19:15:40 -07:00
Milan Bracke	193f6bfeba	Docx reader: fix handling of nested fields Fields delimited by fldChar elements can contain other fields. Before, the nested fields would be ignored, except for the end, which would be considered the end of the parent field. To fix this issue, fields needed to be considered containing ParParts instead of Runs, since a Run can't represent complex enough structures. This also impacted Hyperlinks since they can originate from a field.	2021-10-18 19:15:40 -07:00
Emily Bourke	8de261ba4e	pptx: Line up continuation paragraphs This commit changes the `marL` and `indent` values used for plain paragraphs and numbered lists, and changes the spacing defined in the reference doc master for bulleted lists. For paragraphs, there is now a left-indent taken from the `otherStyle` in the master. For numbered lists, the number is positioned where the text would be if this were a plain paragraph, and the text is indented to the next level. This means that continuation paragraphs line up nicely with numbered lists. It also /mostly/ matches the observed PowerPoint behaviour when inserting paragraphs and numbered lists: the only difference is that PowerPoint was using a different margin value for the first level numbered lists – I’ve changed this to match the other levels, as I don’t think it makes the spacing unappealing and it allows continuation paragraphs at any level to line up. With bulleted lists, I’m keeping the observed PowerPoint behaviour of specifying only a level, letting `marL` and `indent` be automatically taken from `bodyStyle`. To that end, this commit changes the `bodyStyle` spacing in the master of the default reference doc, to: - line up the text of the first paragraph in each bullet with any continuation paragraphs - line up nested bullet markers in any continuation paragraphs with the first paragraph, matching lists and plain paragraphs This does mean the continuation paragraphs still won’t line up for anyone using their own reference doc where they haven’t matched the `otherStyle` and `bodyStyle` indent levels, but I think people in that situation will be able to troubleshoot.	2021-10-17 17:24:30 -07:00
Emily Bourke	8af15ab345	pptx: Fix list level numbering In PowerPoint, the content of a top-level list is at the same level as the content of a top-level paragraph – the only difference is that a list style has been applied. At the moment, the pptx writer increments the paragraph level on each list, turning what should be top-level lists into second-level lists. This commit changes that logic, only incrementing the paragraph level on continuation paragraphs of lists. - Fixes https://github.com/jgm/pandoc/issues/4828 - Fixes https://github.com/jgm/pandoc/issues/4663	2021-10-17 17:24:30 -07:00
John MacFarlane	3f489bcb58	Ensure that babel is loaded also with pdflatex. This fixes a regression in #7604, which modernized babel usage but omitted to load babel for pdflatex, with the result that even simple documents could no longer be produced. Closes #7627.	2021-10-16 23:34:53 -07:00
Samuel Tardieu	a41c1fe0bb	asciidoc writer: translate numberLines attribute to linesnum switch AsciiDoctor allows to request line numbering on code blocks by using a switch on the `source` block, such as in: ``` [source%linesnum,haskell] ---- some Haskell code here ---- ```	2021-10-14 13:41:12 -07:00
Samuel Tardieu	628cde48cf	DocBook reader: honor linenumbering attribute The attribute DocBook linenumbering="numbered" attribute on code blocks maps to "numberLines" internally.	2021-10-14 09:04:56 -07:00
John MacFarlane	49c4e1d014	Fix markdown parsing bug for math in bracketed spans and links. This affects math with unbalanced brackets (e.g. `$(0,1]$`) inside links, images, bracketed spans. Closes #7623.	2021-10-13 08:59:37 -07:00
John MacFarlane	4ba4533d70	Update wasteland tests. When we trimmed it down we left out some notes.	2021-10-11 09:09:51 -07:00
Milan Bracke	0f98cbff4b	Avoid blockquote when parent style has more indent When a paragraph has an indentation different from the parent (named) style, it used to be considered a blockquote. But this only makes sense when the paragraph has more indentation. So this commit adds a check for the indentation of the parent style.	2021-10-10 16:27:32 -07:00
John MacFarlane	c72277e986	LaTeX reader: Properly handle `\^` followed by group closing. Closes #7615.	2021-10-10 11:24:28 -07:00
Emily Bourke	aa78765bf9	pptx: Remove excessive layout tests When I added the tests for moved layouts and deleted layouts, I added them to all tests. However, this doesn’t really give a lot more info than having single tests, and the extra tests take up time and disk space. This commit removes the moved-layouts and deleted-layouts tests, in favour of a single test for each of those scenarios.	2021-10-07 08:45:43 -07:00
John MacFarlane	b8d460eeab	Powerpoint writer: consolidate text runs when possible. This slims down the output files by avoiding unnecessary text run elements. Updated golden tests.	2021-10-04 12:24:12 -07:00
John MacFarlane	11baeb8850	OOXML tests: use pretty-printed form to display diffs. Otherwise everything is on one line and the diff is uninformative.	2021-10-04 12:12:16 -07:00
John MacFarlane	82d587493d	Revert "Powerpoint writer: consolidate text run nodes." This reverts commit `62f83aa486`. This was already being done, it seems. I misidentified the problem; it is really with `Str ""` nodes.	2021-10-04 11:50:32 -07:00
John MacFarlane	62f83aa486	Powerpoint writer: consolidate text run nodes. This should reduce the size of the generated files.	2021-10-04 11:45:01 -07:00
John MacFarlane	0088e79cf5	Update tests for babel-related changes in latex template.	2021-10-03 19:27:37 -07:00
John MacFarlane	6ff04ac52d	Fix compareXML helper in Tests.Writers.OOXML. Given how it is used, we were getting "mine" and "good" flipped in the test results.	2021-10-02 06:52:40 -07:00
Ezwal	472b33095e	Docx reader: Add placeholder for word diagram	2021-09-30 12:44:44 -07:00
John MacFarlane	92abe45863	Further test updates for switch to pretty-show.	2021-09-29 08:28:54 -07:00
John MacFarlane	0bdcf415e4	Switch from pretty-simple to pretty-show for native output. Update tests. Reason: it turns out that the native output generated by pretty-simple isn't always readable by the native reader. According to https://github.com/cdepillabout/pretty-simple/issues/99 it is not a design goal of the library that the rendered values be readable using 'read'. This makes it unsuitable for our purposes. pretty-show is a bit slower and it uses 4-space indents (non-configurable), but it doesn't have this serious drawback.	2021-09-28 21:17:53 -07:00
John MacFarlane	665e6d3d94	BibTeX parser: fix expansion of special strings in series... e.g. `newseries` or `library`. Expansion should not happen when these strings are protected in braces, or when they're capitalized. Closes #7591.	2021-09-23 22:21:05 -07:00
John MacFarlane	aa89f6be18	HTML reader: handle empty tbody element in table. Closes #7589.	2021-09-23 09:25:37 -07:00
John MacFarlane	3de0d5a977	test/epub: an excerpt from The Wasteland is enough! Saves over 100K.	2021-09-21 18:54:42 -07:00
John MacFarlane	c6fa0bc271	Revert "Remove unused epub test file features.epub." This reverts commit `83ebb85b64`.	2021-09-21 17:03:25 -07:00
John MacFarlane	a88b0098d6	Make test/epub/wasteland.epub valid.	2021-09-21 16:49:58 -07:00
John MacFarlane	83ebb85b64	Remove unused epub test file features.epub.	2021-09-21 16:42:33 -07:00
John MacFarlane	c266734448	Use pretty-simple to format native output. Previously we used our own homespun formatting. But this produces over-long lines that aren't ideal for diffs in tests. Easier to use something off-the-shelf and standard. Closes #7580. Performance is slower by about a factor of 10, but this isn't really a problem because native isn't suitable as a serialization format. (For serialization you should use json, because the reader is so much faster than native.)	2021-09-21 12:37:42 -07:00

1 2 3 4 5 ...

1972 commits