pandoc

Author	SHA1	Message	Date
John MacFarlane	d0bf7efe95	Man writer: fixed boldfacing of definition terms. Previously the bold-facing would be interrupted by other formatting, because we used `.B`. Closes #5620.	2019-07-13 16:12:28 -07:00
John MacFarlane	1784161946	LaTeX reader: Properly handle \providecommand and environment... They are now ignored if the corresponding command or environment is already defined. Closes #5635.	2019-07-13 15:51:33 -07:00
mb21	6cf5c3f6ac	fix filename and issue reference of previous commit	2019-07-13 12:03:45 +02:00
John MacFarlane	6d30d3e0b3	Pass through aria- attributes to HTML5. Also document addition of data- prefix to unknown attributes. Closes #5646.	2019-07-12 17:03:01 -07:00
Brian Leung	1d9ff85b45	RST reader: keep `name` property in `imgAttr`. (#5637 ) Closes #5619.	2019-07-10 18:35:01 -07:00
Brian Leung	9c4ba81357	Markdown reader: handle inline code more eagerly within lists. (#5628 ) Closes #5627.	2019-07-06 23:14:21 +02:00
oquechy	f0edf60364	Support epigraph command in LaTeX Reader. Closes #3523.	2019-06-21 18:27:26 +02:00
John MacFarlane	bec95c97ac	LaTeX writer: Don't highlight code in headings. This causes compilation errors, and I don't know how to work around them. Closes #5574.	2019-06-11 20:47:29 -07:00
John MacFarlane	3febd81cbc	LaTeX writer: Use mbox to get proper behavior inside `\sout`. Closes #5529.	2019-06-10 15:02:48 -07:00
John MacFarlane	59529e408b	Asciidoc writer: use doubled ## when necessary for spans. Closes #5566.	2019-06-10 14:47:04 -07:00
John MacFarlane	2e12106a90	Asciidoc writer: ensure correct nesting strong/emph. Closes #5565.	2019-06-10 14:42:08 -07:00
John MacFarlane	d1df2b2783	LaTeX reader: pass through unknown listings language as class. Previously if the language was not in the list of listings- supported languages, it would not be added as a class, so custom syntax highlighting could not be used. Closes #5540.	2019-06-08 12:25:34 -07:00
John MacFarlane	d8b4e45be0	LaTeX writer: Include inline code attributes with `--listings`. Closes #5420.	2019-06-07 10:03:10 -07:00
John MacFarlane	10615420de	Include trailing {}s in raw latex commands. Change is in rawLaTeXInline in LaTeX reader, but it affects the markdown reader and other readers that allow raw LaTeX. Previously, trailing `{}` would be included for unknown commands, but not for known commands. However, they are sometimes used to avoid a trailing space after the command. The chances that a `{}` after a LaTeX command is not part of the command are very small. Closes #5439.	2019-06-04 21:20:11 -07:00
John MacFarlane	f82d91eb49	Markdown reader: don't create implicit reference for empty header. Closes #5549.	2019-06-04 08:39:54 -07:00
John MacFarlane	928681ca04	Avoid unwanted interpretation of def list term as other kind of block, e.g. ordered list item, in Markdown writer. Closes #554.	2019-06-03 09:51:19 -07:00
mb21	a58304e00e	HTML writer: output video and audio elements depending on file extension of the image path	2019-05-29 09:43:50 +02:00
John MacFarlane	2ad5dacf87	Remove command test for #5517 . We need a better test that works cross-platform. Until then, removing this. Closes #5528.	2019-05-28 12:45:31 -07:00
Mauro Bieg	3f57f49033	HTML writer: emit empty alt tag in figures (#5518 ) The same text is already in the <figcaption> and screen-readers would read it twice, see #4737	2019-05-28 12:31:41 -04:00
John MacFarlane	8a5b9ac868	Add test for relative file: URI to #5517 .	2019-05-28 09:05:28 -07:00
Mauro Bieg	214da7217b	Fix handling of `file:` URL scheme in `downloadOrRead` (#5522 ) Move up the pattern match to be reachable, closes #5517. Previously `file:/` URLs were handled wrongly and pandoc attempted to make HTTP requests, which failed.	2019-05-28 11:51:21 -04:00
Alexander Krotov	7514277454	HTML reader: trim definition list terms	2019-05-25 18:36:56 +03:00
John MacFarlane	aef71894ce	Markdown writer: Ensure the code fence is long enough. Previously too few backticks were used when the code block contained an indented line of backticks. (Ditto tildes.) Cloess #5519.	2019-05-22 15:21:15 -07:00
Jesse Rosenthal	ed73bd28e5	Markdown writer: Handle labels with integer names Previously if labels had integer names, it could produce a conflict with auto-labeled reference links. Now we test for a conflict and find the next available integer. Note that this involves adding a new state variable `stPrevRefs` to keep track of refs used in other document parts when using `--reference-location=block\|section` Closes #5495	2019-05-21 12:19:59 -04:00
Albert Krewinkel	da9638e6a3	Org writer: always indent src blocks content by 2 spaces Emacs always uses two spaces when indenting the content of src blocks, e.g., when exiting a `C-c '` edit-buffer. Pandoc used to indent contents by the space-equivalent of one tab, but now always uses two spaces, too. Closes: #5440	2019-05-12 14:49:52 +02:00
John MacFarlane	a20323033e	Fix footnote in image caption. Regression! The fix for #4683 broke this case.	2019-05-01 16:56:37 -07:00
John MacFarlane	f11d0c9dc8	HTML: prevent gratuitious emojification on iOS. iOS chooses to render a number of Unicode entities, including '↩', as big colorful emoji. This can be defeated by appending Unicode VARIATION SELECTOR-15'/'VARIATION SELECTOR-16'. So we now append this character when escaping strings, for both '↩' and '↔'. If other characters prove problematic, they can simply be added to needsVariationSelector. Closes #5469.	2019-04-30 22:32:52 -07:00
John MacFarlane	e409509a68	RST writer: treat Span as transparent. Previously an Emph inside a Span was being treated as nested markup and ignored. With this patch, the Span is just ignored. Closes #5446.	2019-04-15 09:48:11 -07:00
John MacFarlane	23df94e30a	Update command test #5416 to make it windows friendly	2019-04-02 17:59:47 -07:00
Mauro Bieg	0fa6951dc1	Dokuwiki Reader fix: parse single curly brace (#5417 ) fixes #5416	2019-04-01 11:36:47 -06:00
John MacFarlane	93ee73e1dc	LaTeX writer: Avoid inadvertently creating ? `or !` ligatures. These are upside down ? and !, resp. Closes #5407.	2019-03-29 10:04:22 -07:00
John MacFarlane	40865958ce	Markdown reader: fenced div takes priority over setext header. For ::: {.cell} --- :::	2019-03-28 17:39:22 -07:00
John MacFarlane	1e60776226	LaTeX writer: Fix footnotes in table caption and cells. This fixes a bug wherein footnotes appeared in the wrong order, and with duplicate numbers, when in table captions and cells. We now use regular `\footnote` commands, even in the table caption and the minipages containing cells. Apparently longtable knows how to handle this. Closes #5367.	2019-03-22 11:55:41 -07:00
John MacFarlane	6be8f4e953	Improved fix to #5340 and added test.	2019-03-18 16:53:36 -07:00
John MacFarlane	3880a23de9	Properly escape attributes in Markdown writer. Closes #5369.	2019-03-17 18:15:47 -07:00
John MacFarlane	ebd7035a2a	Add test case for #5368 .	2019-03-17 18:02:59 -07:00
John MacFarlane	0bed0ab5a3	Use XDG data directory for user data directory. Instead of `$HOME/.pandoc`, the default user data directory is now `$XDG_DATA_HOME/pandoc`, where `XDG_DATA_HOME` defaults to `$HOME/.local/share` but can be overridden by setting the environment variable. If this directory is missing, then `$HOME/.pandoc` is searched instead, for backwards compatibility. However, we recommend moving local pandoc data files from `$HOME/.pandoc` to `$HOME/.local/share/pandoc`. On Windows the default user data directory remains the same. Closes #3582.	2019-03-02 15:03:59 -08:00
John MacFarlane	ba05e1ea02	Shared.compactify: Avoid mixed lists. This improves on the original fix to #5285 by preventing other mixed lists (lists with a mix of Plain and Para elements) that were allowed given the original fix.	2019-02-25 17:33:54 -08:00
John MacFarlane	38c028bd50	JATS reader: fix parsing of figures. This ensures that a figure containing a single image is parsed as a pandoc "implicit figure" (i.e., a Para with a single Image whose title attribute begins with `fig:`). More complex figures will still be parsed as divs. Closes #5321.	2019-02-23 15:40:06 -07:00
John MacFarlane	d7d1c9c8e4	Markdown reader: fix bug parsing fenced code blocks. Previously parsing would break if the code block contained a string of backticks of sufficient length followed by something other than end of line. Closes #5304.	2019-02-15 22:34:32 -08:00
John MacFarlane	47537d26db	Improve tight/loose list handling. Closes #5285. Previously the algorithm allowed list items with a mix of Para and Plain, which is never wanted. compactify in T.P.Shared has been modified so that, if a list's items contain (at the top level) Para elements (aside from perhaps at the very end), ALL Plains are converted to Paras.	2019-02-08 23:16:01 -08:00
John MacFarlane	ccf4e23ee1	Markdown reader: add newline when parsing blocks in YAML. Otherwise last block gets parsed as a Plain rather than a Para. This is a regression in pandoc 2.x. This patch restores pandoc 1.19 behavior. Closes #5271.	2019-02-04 10:22:02 -08:00
John MacFarlane	b74267406b	Update test for last commit.	2019-02-02 16:20:06 -08:00
John MacFarlane	633a9ecfec	LaTeX writer: avoid `{}` after control sequences when escaping. `\ldots{}.` doesn't behave as well as `\ldots.` with the latex ellipsis package. This patch causes pandoc to avoid emitting the `{}` when it is not necessary. Now `\ldots` and other control sequences used in escaping will be followed by either a `{}`, a space, or nothing, depending on context. Thanks to Elliott Slaughter for the suggestion.	2019-02-01 21:17:46 -08:00
John MacFarlane	e752669e50	LaTeX reader: don't let `\egroup` match `{`. `braced` now actually requires nested braces. Otherwise some legitimate command and environment definitions can break (see test/command/tex-group.md).	2019-01-31 22:50:51 -08:00
John MacFarlane	5ddd7b121e	LaTeX reader: support `\endinput`. Closes #5233 .	2019-01-22 21:39:26 -08:00
John MacFarlane	f86ac89383	HTML and markdown: treat textarea as a verbatim environment. We don't want to parse its contents as Markdown or HTML. Closes #5241.	2019-01-21 20:54:12 -08:00
Brian Leung	35971495ab	RST reader: change treatment of `number-lines` directives. (#5207 ) Directives of this type without numeric inputs should not have a `startFrom` attribute; with a blank value, the writers can produce extra whitespace.	2019-01-09 22:19:26 -08:00
John MacFarlane	8673eb079b	Removed superfluous sourceCode class on code blocks. * These were added by the RST reader and, for literate Haskell, by the Markdown and LaTeX readers. There is no point to this class, and it is not applied consistently by all readers. See #5047. * Reverse order of `literate` and `haskell` classes on code blocks when parsing literate Haskell. Better if `haskell` comes first.	2019-01-08 11:36:33 -08:00
Mauro Bieg	f1d83aea12	Implement task lists (#5139 ) Closes #3051	2019-01-02 11:36:37 -08:00
John MacFarlane	ea8af33dab	Commonmark writer: fix handling of SoftBreak with `hard_line_breaks`. This should be rendered as a space. Closes #5195.	2019-01-02 10:31:13 -08:00
John MacFarlane	ffc2192caf	Simplify/fix reading of `--metadata` values on command line. Previously we used HsYAML's decodeStrict to recognize boolean values (treating everything else as a string). This caused problems relating to hvr/HsYAML#7. We now just check for the recognized boolean values `true\|True\|TRUE\|false\|False\|FALSE`, and avoid using HsYAML. Closes #5177.	2018-12-31 21:20:56 -08:00
leungbk	c998b937c1	Org writer: preserve line-numbering for example and code blocks.	2018-12-28 15:07:05 +01:00
John MacFarlane	d5e68d43be	RST writer: don't wrap simple table header lines. Closes #5128.	2018-12-05 17:10:33 -08:00
John MacFarlane	38200c0291	Strip out illegal XML characters in escapeXMLString. Closes #5119.	2018-12-04 09:24:15 -08:00
John MacFarlane	4060df6891	Markdown writer: include needed whitespace after HTML figure. We use HTML for a figure in markdown dialects that can't represent it natively. Closes #5121.	2018-12-03 15:10:13 -08:00
John MacFarlane	83c0789205	Added test for #5053 . Note that the fix for #5099 also fixes #5053, a pandoc 2.4 regression in parsing underscore emphasis after symbols.	2018-11-25 22:50:16 -08:00
John MacFarlane	edc651059e	Fix parsing of citations and quotes after parentheses. Starting with pandoc 2.4, citations and quoted inlines were no longer recognized after parentheses. This is because of commit `9b0bd4ec6f`, which is reverted here. The point of that commit was to allow relocation of soft line breaks to before an abbreviation, so that a nonbreaking space could be added after the abbreviation. Now we simply leave the soft line break in place, even though this means that we won't get a nonbreaking space after "Mr." at the end of a line (and in LaTeX this may result in a longer intersentential space). Those who care about this issue should take care not to end lines with an abbreviation, or to insert nonbreaking spaces manually. Closes #5099.	2018-11-25 22:29:54 -08:00
John MacFarlane	d532eb14eb	HTML reader: allow tfoot before body rows. Closes #5079.	2018-11-16 11:29:15 -08:00
John MacFarlane	e61f632531	HTML reader: parse `<small>` as a Span with class "small". Closes #5080.	2018-11-15 22:36:01 -08:00
John MacFarlane	e61d1d0da9	Asciidoc writer: Render Spans using `[#id .class]#contents#`. See #5080.	2018-11-15 22:29:15 -08:00
John MacFarlane	1a102c11a9	Fix test case for #5014 .	2018-11-13 14:50:26 -08:00
John MacFarlane	1cfdd3662f	HTML reader: allow thead containing a row with td rather than th. See #5014. Note that this doesn't address the original issue in #5014, only an unrelated side-issue.	2018-11-13 14:49:12 -08:00
John MacFarlane	52a57a5362	LaTeX writer: don't emit `[<+->]` unless beamer output, even if `writerIncremental` is True. See #5072.	2018-11-12 09:43:12 -08:00
John MacFarlane	5bc38a741b	Exactly match GitHub's identifier generating algorithm. See #5057.	2018-11-11 20:45:38 -08:00
John MacFarlane	a36d202e86	Text.Pandoc.Shared: add parameter to uniqueIdent, inlineListToIdentifier. The parameter is Extensions. This allows these functions to be sensitive to the settings of `Ext_gfm_auto_identifiers` and `Ext_ascii_identifiers`. This allows us to use `uniqueIdent` in the CommonMark reader, replacing some custom code. It also means that `gfm_auto_identifiers` can now be used in all formats. Semantically, `gfm_auto_identifiers` is now a modifier of `auto_identifiers`; for identifiers to be set, `auto_identifiers` must be turned on, and then the type of identifier produced depends on `gfm_auto_identifiers` and `ascii_identifiers` are set. Closes #5057.	2018-11-11 13:46:23 -08:00
John MacFarlane	5f030f3c2c	Add command test for #5050 .	2018-11-06 22:57:11 -08:00
quasicomputational	a747268823	CommonMark writer: respect --ascii (#5043 )	2018-11-05 09:33:10 -08:00
John MacFarlane	511d647290	XML: toHtml5Entities: prefer shorter entities... when there are several choices for a particular character.	2018-11-04 22:15:53 -08:00
John MacFarlane	805b9f8a12	Roff reader: Improved handling of custom strings as arguments. Added test.	2018-11-02 21:35:49 -07:00
John MacFarlane	26341c1632	Implement --ascii for Markdown writer.	2018-11-01 16:31:04 -07:00
John MacFarlane	f379edc4ad	HTML writer: use character entities references when possible for HTML5.	2018-11-01 16:08:27 -07:00
John MacFarlane	e0290fd18b	LaTeX writer: add newline if math ends in a comment. This prevents the closing delimiter from being swalled up in the comment. Closes #4880.	2018-10-31 21:51:20 -07:00
John MacFarlane	c51be5dfc8	LaTeX reader: allow space at end of math after `\`. Closes #5010. Expose trimMath from T.P.Shared.	2018-10-29 22:20:14 -07:00
Albert Krewinkel	096cbe6987	Lua: allow access to pandoc state (#5015 ) * Lua: allow access to pandoc state Lua filters and custom writers now have read-only access to most fields of pandoc's internal state via the global variable `PANDOC_STATE`. * Lua: allow iterating through fields of PANDOC_STATE * Lua filters doc: describe CommonState * Lua filters doc: mention global variable PANDOC_STATE * Lua: add access to logs Log messages can currently only be printed, but not decomposed.	2018-10-25 22:12:14 -07:00
John MacFarlane	8efb8975ed	Groff writer character escaping changes. T.P.GroffChar: replaced `essentialEscapes` with `manEscapes`, which includes all the escapes mentioned in the groff_man manual. T.P.Writers.Groff: removed escapeCode; changed parameter on escapeString from Bool to new type `EscapeMode`. Rewrote `escapeString`.	2018-10-23 21:44:07 -07:00
Brian Leung	7eea5c62ed	LaTeX reader: add support for `nolinkurl` command. (#4992 )	2018-10-22 23:36:44 -07:00
John MacFarlane	efbb329f1a	Groff escaping changes. - `--ascii` is now turned on automatically for man output, for portability. All man output will be escaped to ASCII. - In T.P.Writers.Groff, `escapeChar`, `escapeString`, and `escapeCode` now take a boolean parameter that selects ascii-only output. This is used by the Ms writer for `--ascii`, instead of doing an extra pass after writing the document. - In ms output without `--ascii`, unicode is used whenever possible (e.g. for double quotes). - A few escapes are changed: e.g. `\[rs]` instead of `\\` for backslash, and `\ga]` instead of `` \` `` for backtick.	2018-10-18 10:21:34 -07:00
John MacFarlane	f48960b75f	Move common groff functions to Text.Pandoc.Writers.Groff (unexported module). These are used in both the man and ms writers. Moved groffEscape out of Text.Pandoc.Writers.Shared [cancels earlier API change from adding it, which was after last release]. This fixes strong/code combination on man (should be `\f[CB]` not `\f[BC]`), mentioned in #4973. Updated tests. Closes #4975.	2018-10-17 17:26:37 -07:00
Alexander Krotov	b3feaba6af	Man writer: use \f[R] instead of \f[] to reset font Fixes #4973	2018-10-17 18:29:07 +03:00
John MacFarlane	6f6ad0514d	LaTeX reader: make macroDef polymorphic and allow in inline context. Otherwise we can't parse something like ``` \lowercase{\def\x{Foo}} ``` I have actually seen tex like this in the wild.	2018-10-15 11:46:31 -07:00
John MacFarlane	22f81f78bd	Added failing test case for macros.	2018-10-15 00:37:17 -07:00
John MacFarlane	88faa45f1d	Markdown writer: ensure blank between raw block and normal content. Otherwise a raw block can prevent a paragraph from being recognized as such. Closes #4629.	2018-10-14 17:12:06 -07:00
John MacFarlane	cf8224045b	Markdown reader: Fix awkward soft break movements before abbreviations. Closes #4635.	2018-10-14 13:02:36 -07:00
John MacFarlane	f5c64c3060	HTML reader: fix htmlTag and isInlineTag to accept processing instructions. Fixes regression #3123 (since 2.0). Added regression test.	2018-10-11 09:58:25 -07:00
John MacFarlane	a92e43575f	LaTeX writer: with `--biblatex`, use `\autocite` when possible. `\autocites{a1}{a2}{a3}` will not collapse the entries. So, if we don't have prefixes and suffixes, we use instead `\autocite{a1;a2;a3}`. Closes #4960.	2018-10-08 20:47:09 -07:00
John MacFarlane	145710c4c3	RST reader: don't allow single-dash separator in headerless table. Closes #4382.	2018-10-07 12:37:08 -07:00
John MacFarlane	b806bff5b4	LaTeX reader: fix bugs omitting raw tex. The default is `-raw_tex`, so no raw tex should result unless we explicitly say `+raw_tex`. Previously some raw commands did make it through. Closes #4527.	2018-10-07 12:21:43 -07:00
John MacFarlane	08fef6b210	RST reader: pass through fields in unknown directives as div attributes. This commit also adds support for `class` and `name` attributes to directives in general. Closes #4715.	2018-10-07 11:44:11 -07:00
Brian Leung	e257b54124	Org reader: fix behavior for successive calls of `#+EXCLUDE_TAGS`. (#4951 ) Calling `#+EXCLUDE_TAGS` multiple times should preserve the status of the previously declared tags.	2018-10-05 22:21:20 -07:00
quasicomputational	6207bdeb68	CommonMark writer: add plain text fallbacks. (#4531 ) Previously, the writer would unconditionally emit HTMLish output for subscripts, superscripts, strikeouts (if the strikeout extension is disabled) and small caps, even with raw_html disabled. Now there are plain-text (and, where possible, fancy Unicode) fallbacks for all of these corresponding (mostly) to the Markdown fallbacks, and the HTMLish output is only used when raw_html is enabled. This commit adds exported functions `toSuperscript` and `toSubscript` to `Text.Pandoc.Writers.Shared`. [API change] Closes #4528.	2018-10-05 21:33:14 -07:00
Brian Leung	a26b3a2d6a	Org reader: Add partial support for `#+EXCLUDE_TAGS` option. (#4950 ) Closes #4284. Headers with the corresponding tags should not appear in the output. If one or more of the specified tags contains a non-tag character like `+`, Org-mode will not treat that as a valid tag, but will nonetheless continue scanning for valid tags. That behavior is not replicated in this patch; entering `cat+dog` as one of the entries in `#+EXCLUDE_TAGS` and running the file through Pandoc will cause the parser to fail and result in the only excluded tag being the default, `noexport`.	2018-10-05 14:28:17 -07:00
John MacFarlane	36f1846cc3	Implement `--ascii` (`writerPreferAscii`) in writers, not App. Now the `write*` functions for Docbook, HTML, ICML, JATS, Man, Ms, OPML are sensitive to `writerPreferAscii`. Previously the to-ascii translation was done in Text.Pandoc.App, and thus not available to those using the writer functions directly. In addition, the LaTeX writer is now sensitive to `writerPreferAscii` and to `--ascii`. 100% ASCII output can't be guaranteed, but the writer will use commands like `\"{a}` and `\l` whenever possible, to avoid emiting a non-ASCII character. A new unexported module, Text.Pandoc.Groff, has been added to store functions used in the different groff-based writers.	2018-09-30 22:32:00 -07:00
John MacFarlane	190ee279c9	LaTeX reader: allow verbatim blocks ending with blank lines. Closes #4624.	2018-09-29 10:57:11 -07:00
leungbk	6e8f31dab1	Force inline code blocks to honor export options. `exportsCode` is moved from `Blocks.hs` to `Shared.hs` and exported accordingly.	2018-09-26 08:49:13 +02:00
Brian Leung	72363cd2fc	Add support for multiprenote and multipostnote arguments in LaTeX. (#4930 ) * Add support for multiprenote and multipostnote arguments. The multiprenotes occur before the first prefix of a multicite, and the multipostnotes follow the last suffix. * Add test for multiprenote and multipostnote.	2018-09-25 20:49:13 -07:00
John MacFarlane	37c6f6adfe	RST reader: fix bug with internal link targets. They were gobbling up indented content underneath. Closes #4919.	2018-09-20 11:15:03 -07:00
John MacFarlane	136bf901aa	Markdown reader: distinguish autolinks in the AST. With this change, autolinks are parsed as Links with the `uri` class. (The same is true for bare links, if the `autolink_bare_uris` extension is enabled.) Email autolinks are parsed as Links with the `email` class. This allows the distinction to be represented in the URI. Formerly the `uri` class was added to autolinks by the HTML writer, but it had to guess what was an autolink and could not distinguish `[http://example.com](http://example.com)` from `<http://example.com>`. It also incorrectly recognized `[pandoc](pandoc)` as an autolink. Now the HTML writer simply passes through the `uri` attribute if it is present, but does not add anything. The Textile writer has been modified so that the `uri` class is not explicitly added for autolinks, even if it is present. Closes #4913.	2018-09-19 14:53:29 -07:00
John MacFarlane	44e4f7b292	Markdown reader: example_lists should work without startnum. Closes #4908.	2018-09-16 20:40:32 -07:00
mb21	5347e9454f	add test for --metadata-file	2018-09-15 17:06:10 +02:00
mb21	bd5500ba7f	add test yaml-metadata-blocks.md	2018-09-15 12:10:10 +02:00
John MacFarlane	fa4ebd71a3	LaTeX reader: resolve `\ref` for figure numbers.	2018-09-09 22:53:18 -07:00
John MacFarlane	a211edc819	HTML reader: parse `<script type="math/tex` tags as math. These are used by MathJax. Closes #4877.	2018-09-07 09:41:17 -07:00
John MacFarlane	85ed24e849	RSTR reader: don't skip link definitions after comments. Closes #4860.	2018-08-29 14:40:04 -07:00
John MacFarlane	a2c4261b32	HTML reader: allow enabling `raw_tex` extension. This now allows raw LaTeX environments, `\ref`, and `\eqref` to be parsed (which is helpful for translation HTML documents using MathJaX). Closes #1126.	2018-08-24 18:04:00 -07:00
Alexander Krotov	937b92cd30	HTML reader: extract spaces inside links instead of trimming them Fixes #4845	2018-08-22 12:43:15 +03:00
John MacFarlane	3b5949e8f2	LaTeX reader: support blockcquote, foreignblockquote from csquotes. Also foreigncblockquote, hyphenblockquote, hyphencblockquote. Closes #4848. But note: currently foreignquote will be parsed as a regular Quoted inline (not using the quotes appropriate to the foreign language).	2018-08-21 21:03:43 -07:00
John MacFarlane	a733068ebf	LaTeX reader: support enquote*, foreignquote, hypphenquote... from csquotes. See #4848. Still TBD: blockquote, blockcquote, foreignblockquote.	2018-08-21 17:39:27 -07:00
John MacFarlane	42f4632e60	LaTeX reader: Support more text-mode accents. Add support for `\\|`, `\b`, `\G`, `\h`, `\d`, `\f`, `\r`, `\t`, `\U`, `\i`, `\j`, `\newtie`, `\textcircled`. Also fall back to combining characters when composed characters are not available. Closes #4652.	2018-08-17 23:19:38 -07:00
Marc Schreiber	175da00295	Add support for latex mintinline (#4365 )	2018-08-17 20:57:36 -07:00
John MacFarlane	1b66865763	LaTeX reader: fix siunitx unit commands... ...they should only be recognized in siunitx contexts. For example, `\l` outside of an siunitx context should be l-slash, not l (for liter)! Closes #4842.	2018-08-17 15:22:47 -07:00
John MacFarlane	13dea94a91	Markdown reader: Use "tex" instead of "latex" for raw tex-ish content. We can't always tell if it's LaTeX, ConTeXt, or plain TeX. Better just to use "tex" always. Also changed: ConTeXt writer: now outputs raw "tex" blocks as well as "context". (Closes #969). RST writer: uses ".. raw:: latex" for "tex" content. (RST doesn't support raw context anyway.) Note that if "context" or "latex" specifically is desired, you can still force that in a markdown document by using the raw attribute (see MANUAL.txt): ```{=latex} \foo ``` Note that this change may affect some filters, if they assume that raw tex parsed by the Markdown reader will be RawBlock (Format "latex"). In most cases it should be trivial to modify the filters to accept "tex" as well.	2018-08-15 10:25:12 -07:00
John MacFarlane	c27ce1e70e	LaTeX reader: handle parameter patterns for `\def`. For example: `\def\foo#1[#2]{#1 and #2}`. Closes #4768. Also fixes #4771. API change: in Text.Pandoc.Readers.LaTeX.Types, new type ArgSpec added. Second parameter of Macro constructor is now `[ArgSpec]` instead of `Int`.	2018-08-14 00:03:55 -07:00
John MacFarlane	919c50162c	RST writer: render Divs with admonition classes as admonitions. Also omit Div with class "admonition-title". These are generated by the RST reader and should be omitted on round-trip. Closes #4833.	2018-08-13 11:17:26 -07:00
John MacFarlane	6d14f53bd9	LaTeX reader: Allow `%` characters in URLs. This affects `\href` and `\url`. Closes #4832.	2018-08-12 16:46:48 -07:00
John MacFarlane	b76203ccf1	Markdown reader: Properly handle boolean values in YAML metadata. This fixes a regression in 2.2.3, which cause boolean values to be parsed as MetaInlines instead of MetaBool. Note also an undocumented (but desirable) change in 2.2.3: numbers are now parsed as MetaInlines rather than MetaString. Closes #4819.	2018-08-07 09:26:58 -07:00
John MacFarlane	94c3753c08	Fix parsing of embedded mappings in YAML metadata. This fixes a regression in 2.2.3 which caused embedded mappings (e.g. mappings in sequences) not to work in YAML metadata. Closes #4817.	2018-08-06 12:32:04 -07:00
John MacFarlane	581a3514ca	RST reader: improve parsing of inline interpreted text roles. * Use a Span with class "title-reference" for the default title-reference role. * Use B.text to split up contents into Spaces, SoftBreaks, and Strs for title-reference. * Use Code with class "interpreted-text" instead of Span and Str for unknown roles. (The RST writer has also been modified to round-trip this properly.) * Disallow blank lines in interpreted text. * Backslash-escape now works in interpreted text. * Backticks followed by alphanumerics no longer end interpreted text. Closes #4811.	2018-08-05 09:56:43 -07:00
John MacFarlane	f7dc3e7487	Added test case for #4669 to repository.	2018-08-05 09:21:45 -07:00
danse	be2d7921cb	RST reader: remove support for nested inlines. RST does not allow nested emphasis, links, or other inline constructs. Closes #4581, double parsing of links with URLs as link text. This supersedes the earlier fix for #4581 in `6419819b46`. Fixes #4561, a bug parsing with URLs inside emphasis. Closes #4792.	2018-07-24 15:35:50 -07:00
John MacFarlane	50e8c3b107	MediaWiki writer: Avoid extra blank line in tables with empty cells. Note that the old output is semantically identical, but the new output looks better. Closes #4794.	2018-07-24 11:38:24 -07:00
John MacFarlane	6419819b46	RST reader: fix double-link bug. Link labels containing raw URLs were parsed as autolinks, but links within links are not allowed. Closes #4581.	2018-07-21 22:53:04 -07:00
John MacFarlane	34b229dd5a	Fix for bug in parsing `\include` in markdown. Starting in 2.2.2, everything after an `\input` (or `\include`) in a markdown file would be parsed as raw LaTeX. This commit fixes the issue and adds a regression test. Closes #4781.	2018-07-19 17:44:16 -07:00
John MacFarlane	af445b34d8	Make markdown and github writers respect the `emoji` extension.	2018-07-15 16:02:46 -07:00
Anders Waldenborg	ec30fb37c1	Wrap emojis in span nodes (#4759 ) Text.Pandoc.Emoji now exports `emojiToInline`, which returns a Span inline containing the emoji character and some attributes with metadata (class `emoji`, attribute `data-emoji` with emoji name). Previously, emojis (as supported in Markdown and CommonMark readers, e.g "😄") were simply translated into the corresponding unicode code point. By wrapping them in Span nodes, we make it possible to do special handling such as giving them a special font in HTML output. We also open up the possibility of treating them differently when the `--ascii` option is selected (though that is not part of this commit). Closes #4743.	2018-07-15 15:14:40 -07:00
Mauro Bieg	5809d5bef2	AsciiDoc Writer: escape square brackets at start of line (#4708 ) closes #4545	2018-07-12 19:37:37 +02:00
Alexander Krotov	41cf6d540f	More spellcheck	2018-07-02 19:07:28 +03:00
John MacFarlane	016e0a09e2	RST writer: don't treat 'example' as a syntax name. This fixes conversions from org with example blocks. Closes #4748.	2018-06-30 11:45:49 +02:00
Anders Waldenborg	904924d172	CommonMark reader: Handle ascii_identifiers extension (#4733 ) Non-ascii characters were not stripped from identifiers even if the `ascii_identifiers` extension was enabled (which is is by default for gfm). Closes #4742	2018-06-29 10:41:26 +02:00
Mauro Bieg	0459d1be26	TikiWiki reader: improve list parsing (#4723 ) - remove trailing Space from list items - parse lists that have no space after marker (fixes #4722)	2018-06-28 13:35:54 +02:00
John MacFarlane	48a505c5a0	Markdown reader: allow empty code spans. E.g. `` ` ` ``.	2018-06-13 11:12:10 -07:00
Mauro Bieg	7e477db95c	LaTeX Reader: parse figure label into Image id (#4704 ) closes #4700	2018-06-13 10:41:30 -07:00
John MacFarlane	a41222db3e	Adjust command test not to use echo. This is fraught on Windows.	2018-06-11 10:53:56 -07:00
Mauro Bieg	905dee6ee3	beamer output: fix single digit column percentage (#4691 ) fixes #4690	2018-06-07 10:50:14 -07:00
John MacFarlane	d32e866449	LaTeX reader: handle includes without surrounding blanklines. In addition, `\input` can now be used in an inline context, e.g. to provide part of a paragraph, as it can in LaTeX. Closes #4553.	2018-06-01 09:25:10 -07:00
John MacFarlane	7119715a6a	LaTeX reader `rawLaTeXBlock`: handle macros that resolve to a... ...`\begin` or `\end`. Fixes #4667.	2018-05-30 12:49:01 -07:00
John MacFarlane	252ab9b773	Markdown writer: preserve `implicit_figures` with attributes... ...even if `implicit_attributes` is not set, by rendering in raw HTML. Fixes #4677.	2018-05-30 09:24:52 -07:00
John MacFarlane	58447bba98	rawLaTeXBlock: don't expand macros in macro definitions! Closes #4653. Note that this only affected LaTeX in markdown. Added regression test.	2018-05-15 09:19:13 -07:00
John MacFarlane	a00ca6f0d8	Removed inadvertently added .orig files from repository. These were added by `96d10c72cc` Closes #4648.	2018-05-11 17:10:32 -07:00
John MacFarlane	d3be567a73	Fix regression with tex math environments in HTML + MathJax. Closes #4639.	2018-05-09 10:37:04 -07:00
John MacFarlane	81881ce470	Parsing: Lookahead for non-whitespace after single/double quote start. Closes #4637.	2018-05-09 10:00:34 -07:00
John MacFarlane	44f1c72b28	Add test for #4576 . Closes #4576.	2018-05-08 09:14:58 -07:00
John MacFarlane	a96c762a10	RST reader: fix anonymous redirects with backticks. Closes #4598.	2018-04-26 12:23:25 -07:00
John MacFarlane	aba0f7e063	Add tests for #4589 and #4594 (currently failing).	2018-04-25 23:04:08 -07:00
John MacFarlane	dab3330a58	RST reader: allow < 3 spaces indent under directives. Closes #4579.	2018-04-22 12:20:25 -07:00
John MacFarlane	7fbe473b2e	Markdown reader/writer: spacing adjustments in tables. * Markdown writer now includes a blank line at the end of the row in a single-row multiline table, to prevent it from being interpreted as a simple table. Closes #4578. * Markdown reader does a better job computing the relative width of the last column in a multiline table, so we can round-trip tables without constantly shrinking the last column.	2018-04-21 13:06:57 -07:00
John MacFarlane	276894a2f2	RST writer: use more consistent indentation. Previously we used an odd mix of 3- and 4-space indentation. Now we use 3-space indentation, except for ordered lists, where indentation must depend on the width of the list marker. Closes #4563.	2018-04-19 13:47:16 -07:00
John MacFarlane	d5b98c8c6e	Man writer: Don't escape U+2019 as '. Closes #4550.	2018-04-14 10:42:05 -07:00
John MacFarlane	d77e8f45c9	LaTEX reader: properly resolve section numbers with \ref and chapters. Closes #4529.	2018-04-05 10:14:06 -07:00
quasicomputational	13538ce6eb	CommonMark writer: correctly ignore LaTeX raw blocks when not raw_tex (#4533 ) Issue #4527.	2018-04-05 08:53:42 -07:00

1 2 3 4 5 ...

491 commits