pandoc

Author	SHA1	Message	Date
John MacFarlane	a20323033e	Fix footnote in image caption. Regression! The fix for #4683 broke this case.	2019-05-01 16:56:37 -07:00
John MacFarlane	f11d0c9dc8	HTML: prevent gratuitious emojification on iOS. iOS chooses to render a number of Unicode entities, including '↩', as big colorful emoji. This can be defeated by appending Unicode VARIATION SELECTOR-15'/'VARIATION SELECTOR-16'. So we now append this character when escaping strings, for both '↩' and '↔'. If other characters prove problematic, they can simply be added to needsVariationSelector. Closes #5469.	2019-04-30 22:32:52 -07:00
John MacFarlane	e409509a68	RST writer: treat Span as transparent. Previously an Emph inside a Span was being treated as nested markup and ignored. With this patch, the Span is just ignored. Closes #5446.	2019-04-15 09:48:11 -07:00
John MacFarlane	23df94e30a	Update command test #5416 to make it windows friendly	2019-04-02 17:59:47 -07:00
Mauro Bieg	0fa6951dc1	Dokuwiki Reader fix: parse single curly brace (#5417 ) fixes #5416	2019-04-01 11:36:47 -06:00
John MacFarlane	93ee73e1dc	LaTeX writer: Avoid inadvertently creating ? `or !` ligatures. These are upside down ? and !, resp. Closes #5407.	2019-03-29 10:04:22 -07:00
John MacFarlane	40865958ce	Markdown reader: fenced div takes priority over setext header. For ::: {.cell} --- :::	2019-03-28 17:39:22 -07:00
John MacFarlane	1e60776226	LaTeX writer: Fix footnotes in table caption and cells. This fixes a bug wherein footnotes appeared in the wrong order, and with duplicate numbers, when in table captions and cells. We now use regular `\footnote` commands, even in the table caption and the minipages containing cells. Apparently longtable knows how to handle this. Closes #5367.	2019-03-22 11:55:41 -07:00
John MacFarlane	6be8f4e953	Improved fix to #5340 and added test.	2019-03-18 16:53:36 -07:00
John MacFarlane	3880a23de9	Properly escape attributes in Markdown writer. Closes #5369.	2019-03-17 18:15:47 -07:00
John MacFarlane	ebd7035a2a	Add test case for #5368 .	2019-03-17 18:02:59 -07:00
John MacFarlane	0bed0ab5a3	Use XDG data directory for user data directory. Instead of `$HOME/.pandoc`, the default user data directory is now `$XDG_DATA_HOME/pandoc`, where `XDG_DATA_HOME` defaults to `$HOME/.local/share` but can be overridden by setting the environment variable. If this directory is missing, then `$HOME/.pandoc` is searched instead, for backwards compatibility. However, we recommend moving local pandoc data files from `$HOME/.pandoc` to `$HOME/.local/share/pandoc`. On Windows the default user data directory remains the same. Closes #3582.	2019-03-02 15:03:59 -08:00
John MacFarlane	ba05e1ea02	Shared.compactify: Avoid mixed lists. This improves on the original fix to #5285 by preventing other mixed lists (lists with a mix of Plain and Para elements) that were allowed given the original fix.	2019-02-25 17:33:54 -08:00
John MacFarlane	38c028bd50	JATS reader: fix parsing of figures. This ensures that a figure containing a single image is parsed as a pandoc "implicit figure" (i.e., a Para with a single Image whose title attribute begins with `fig:`). More complex figures will still be parsed as divs. Closes #5321.	2019-02-23 15:40:06 -07:00
John MacFarlane	d7d1c9c8e4	Markdown reader: fix bug parsing fenced code blocks. Previously parsing would break if the code block contained a string of backticks of sufficient length followed by something other than end of line. Closes #5304.	2019-02-15 22:34:32 -08:00
John MacFarlane	47537d26db	Improve tight/loose list handling. Closes #5285. Previously the algorithm allowed list items with a mix of Para and Plain, which is never wanted. compactify in T.P.Shared has been modified so that, if a list's items contain (at the top level) Para elements (aside from perhaps at the very end), ALL Plains are converted to Paras.	2019-02-08 23:16:01 -08:00
John MacFarlane	ccf4e23ee1	Markdown reader: add newline when parsing blocks in YAML. Otherwise last block gets parsed as a Plain rather than a Para. This is a regression in pandoc 2.x. This patch restores pandoc 1.19 behavior. Closes #5271.	2019-02-04 10:22:02 -08:00
John MacFarlane	b74267406b	Update test for last commit.	2019-02-02 16:20:06 -08:00
John MacFarlane	633a9ecfec	LaTeX writer: avoid `{}` after control sequences when escaping. `\ldots{}.` doesn't behave as well as `\ldots.` with the latex ellipsis package. This patch causes pandoc to avoid emitting the `{}` when it is not necessary. Now `\ldots` and other control sequences used in escaping will be followed by either a `{}`, a space, or nothing, depending on context. Thanks to Elliott Slaughter for the suggestion.	2019-02-01 21:17:46 -08:00
John MacFarlane	e752669e50	LaTeX reader: don't let `\egroup` match `{`. `braced` now actually requires nested braces. Otherwise some legitimate command and environment definitions can break (see test/command/tex-group.md).	2019-01-31 22:50:51 -08:00
John MacFarlane	5ddd7b121e	LaTeX reader: support `\endinput`. Closes #5233 .	2019-01-22 21:39:26 -08:00
John MacFarlane	f86ac89383	HTML and markdown: treat textarea as a verbatim environment. We don't want to parse its contents as Markdown or HTML. Closes #5241.	2019-01-21 20:54:12 -08:00
Brian Leung	35971495ab	RST reader: change treatment of `number-lines` directives. (#5207 ) Directives of this type without numeric inputs should not have a `startFrom` attribute; with a blank value, the writers can produce extra whitespace.	2019-01-09 22:19:26 -08:00
John MacFarlane	8673eb079b	Removed superfluous sourceCode class on code blocks. * These were added by the RST reader and, for literate Haskell, by the Markdown and LaTeX readers. There is no point to this class, and it is not applied consistently by all readers. See #5047. * Reverse order of `literate` and `haskell` classes on code blocks when parsing literate Haskell. Better if `haskell` comes first.	2019-01-08 11:36:33 -08:00
Mauro Bieg	f1d83aea12	Implement task lists (#5139 ) Closes #3051	2019-01-02 11:36:37 -08:00
John MacFarlane	ea8af33dab	Commonmark writer: fix handling of SoftBreak with `hard_line_breaks`. This should be rendered as a space. Closes #5195.	2019-01-02 10:31:13 -08:00
John MacFarlane	ffc2192caf	Simplify/fix reading of `--metadata` values on command line. Previously we used HsYAML's decodeStrict to recognize boolean values (treating everything else as a string). This caused problems relating to hvr/HsYAML#7. We now just check for the recognized boolean values `true\|True\|TRUE\|false\|False\|FALSE`, and avoid using HsYAML. Closes #5177.	2018-12-31 21:20:56 -08:00
leungbk	c998b937c1	Org writer: preserve line-numbering for example and code blocks.	2018-12-28 15:07:05 +01:00
John MacFarlane	d5e68d43be	RST writer: don't wrap simple table header lines. Closes #5128.	2018-12-05 17:10:33 -08:00
John MacFarlane	38200c0291	Strip out illegal XML characters in escapeXMLString. Closes #5119.	2018-12-04 09:24:15 -08:00
John MacFarlane	4060df6891	Markdown writer: include needed whitespace after HTML figure. We use HTML for a figure in markdown dialects that can't represent it natively. Closes #5121.	2018-12-03 15:10:13 -08:00
John MacFarlane	83c0789205	Added test for #5053 . Note that the fix for #5099 also fixes #5053, a pandoc 2.4 regression in parsing underscore emphasis after symbols.	2018-11-25 22:50:16 -08:00
John MacFarlane	edc651059e	Fix parsing of citations and quotes after parentheses. Starting with pandoc 2.4, citations and quoted inlines were no longer recognized after parentheses. This is because of commit `9b0bd4ec6f`, which is reverted here. The point of that commit was to allow relocation of soft line breaks to before an abbreviation, so that a nonbreaking space could be added after the abbreviation. Now we simply leave the soft line break in place, even though this means that we won't get a nonbreaking space after "Mr." at the end of a line (and in LaTeX this may result in a longer intersentential space). Those who care about this issue should take care not to end lines with an abbreviation, or to insert nonbreaking spaces manually. Closes #5099.	2018-11-25 22:29:54 -08:00
John MacFarlane	d532eb14eb	HTML reader: allow tfoot before body rows. Closes #5079.	2018-11-16 11:29:15 -08:00
John MacFarlane	e61f632531	HTML reader: parse `<small>` as a Span with class "small". Closes #5080.	2018-11-15 22:36:01 -08:00
John MacFarlane	e61d1d0da9	Asciidoc writer: Render Spans using `[#id .class]#contents#`. See #5080.	2018-11-15 22:29:15 -08:00
John MacFarlane	1a102c11a9	Fix test case for #5014 .	2018-11-13 14:50:26 -08:00
John MacFarlane	1cfdd3662f	HTML reader: allow thead containing a row with td rather than th. See #5014. Note that this doesn't address the original issue in #5014, only an unrelated side-issue.	2018-11-13 14:49:12 -08:00
John MacFarlane	52a57a5362	LaTeX writer: don't emit `[<+->]` unless beamer output, even if `writerIncremental` is True. See #5072.	2018-11-12 09:43:12 -08:00
John MacFarlane	5bc38a741b	Exactly match GitHub's identifier generating algorithm. See #5057.	2018-11-11 20:45:38 -08:00
John MacFarlane	a36d202e86	Text.Pandoc.Shared: add parameter to uniqueIdent, inlineListToIdentifier. The parameter is Extensions. This allows these functions to be sensitive to the settings of `Ext_gfm_auto_identifiers` and `Ext_ascii_identifiers`. This allows us to use `uniqueIdent` in the CommonMark reader, replacing some custom code. It also means that `gfm_auto_identifiers` can now be used in all formats. Semantically, `gfm_auto_identifiers` is now a modifier of `auto_identifiers`; for identifiers to be set, `auto_identifiers` must be turned on, and then the type of identifier produced depends on `gfm_auto_identifiers` and `ascii_identifiers` are set. Closes #5057.	2018-11-11 13:46:23 -08:00
John MacFarlane	5f030f3c2c	Add command test for #5050 .	2018-11-06 22:57:11 -08:00
quasicomputational	a747268823	CommonMark writer: respect --ascii (#5043 )	2018-11-05 09:33:10 -08:00
John MacFarlane	511d647290	XML: toHtml5Entities: prefer shorter entities... when there are several choices for a particular character.	2018-11-04 22:15:53 -08:00
John MacFarlane	805b9f8a12	Roff reader: Improved handling of custom strings as arguments. Added test.	2018-11-02 21:35:49 -07:00
John MacFarlane	26341c1632	Implement --ascii for Markdown writer.	2018-11-01 16:31:04 -07:00
John MacFarlane	f379edc4ad	HTML writer: use character entities references when possible for HTML5.	2018-11-01 16:08:27 -07:00
John MacFarlane	e0290fd18b	LaTeX writer: add newline if math ends in a comment. This prevents the closing delimiter from being swalled up in the comment. Closes #4880.	2018-10-31 21:51:20 -07:00
John MacFarlane	c51be5dfc8	LaTeX reader: allow space at end of math after `\`. Closes #5010. Expose trimMath from T.P.Shared.	2018-10-29 22:20:14 -07:00
Albert Krewinkel	096cbe6987	Lua: allow access to pandoc state (#5015 ) * Lua: allow access to pandoc state Lua filters and custom writers now have read-only access to most fields of pandoc's internal state via the global variable `PANDOC_STATE`. * Lua: allow iterating through fields of PANDOC_STATE * Lua filters doc: describe CommonState * Lua filters doc: mention global variable PANDOC_STATE * Lua: add access to logs Log messages can currently only be printed, but not decomposed.	2018-10-25 22:12:14 -07:00
John MacFarlane	8efb8975ed	Groff writer character escaping changes. T.P.GroffChar: replaced `essentialEscapes` with `manEscapes`, which includes all the escapes mentioned in the groff_man manual. T.P.Writers.Groff: removed escapeCode; changed parameter on escapeString from Bool to new type `EscapeMode`. Rewrote `escapeString`.	2018-10-23 21:44:07 -07:00
Brian Leung	7eea5c62ed	LaTeX reader: add support for `nolinkurl` command. (#4992 )	2018-10-22 23:36:44 -07:00
John MacFarlane	efbb329f1a	Groff escaping changes. - `--ascii` is now turned on automatically for man output, for portability. All man output will be escaped to ASCII. - In T.P.Writers.Groff, `escapeChar`, `escapeString`, and `escapeCode` now take a boolean parameter that selects ascii-only output. This is used by the Ms writer for `--ascii`, instead of doing an extra pass after writing the document. - In ms output without `--ascii`, unicode is used whenever possible (e.g. for double quotes). - A few escapes are changed: e.g. `\[rs]` instead of `\\` for backslash, and `\ga]` instead of `` \` `` for backtick.	2018-10-18 10:21:34 -07:00
John MacFarlane	f48960b75f	Move common groff functions to Text.Pandoc.Writers.Groff (unexported module). These are used in both the man and ms writers. Moved groffEscape out of Text.Pandoc.Writers.Shared [cancels earlier API change from adding it, which was after last release]. This fixes strong/code combination on man (should be `\f[CB]` not `\f[BC]`), mentioned in #4973. Updated tests. Closes #4975.	2018-10-17 17:26:37 -07:00
Alexander Krotov	b3feaba6af	Man writer: use \f[R] instead of \f[] to reset font Fixes #4973	2018-10-17 18:29:07 +03:00
John MacFarlane	6f6ad0514d	LaTeX reader: make macroDef polymorphic and allow in inline context. Otherwise we can't parse something like ``` \lowercase{\def\x{Foo}} ``` I have actually seen tex like this in the wild.	2018-10-15 11:46:31 -07:00
John MacFarlane	22f81f78bd	Added failing test case for macros.	2018-10-15 00:37:17 -07:00
John MacFarlane	88faa45f1d	Markdown writer: ensure blank between raw block and normal content. Otherwise a raw block can prevent a paragraph from being recognized as such. Closes #4629.	2018-10-14 17:12:06 -07:00
John MacFarlane	cf8224045b	Markdown reader: Fix awkward soft break movements before abbreviations. Closes #4635.	2018-10-14 13:02:36 -07:00
John MacFarlane	f5c64c3060	HTML reader: fix htmlTag and isInlineTag to accept processing instructions. Fixes regression #3123 (since 2.0). Added regression test.	2018-10-11 09:58:25 -07:00
John MacFarlane	a92e43575f	LaTeX writer: with `--biblatex`, use `\autocite` when possible. `\autocites{a1}{a2}{a3}` will not collapse the entries. So, if we don't have prefixes and suffixes, we use instead `\autocite{a1;a2;a3}`. Closes #4960.	2018-10-08 20:47:09 -07:00
John MacFarlane	145710c4c3	RST reader: don't allow single-dash separator in headerless table. Closes #4382.	2018-10-07 12:37:08 -07:00
John MacFarlane	b806bff5b4	LaTeX reader: fix bugs omitting raw tex. The default is `-raw_tex`, so no raw tex should result unless we explicitly say `+raw_tex`. Previously some raw commands did make it through. Closes #4527.	2018-10-07 12:21:43 -07:00
John MacFarlane	08fef6b210	RST reader: pass through fields in unknown directives as div attributes. This commit also adds support for `class` and `name` attributes to directives in general. Closes #4715.	2018-10-07 11:44:11 -07:00
Brian Leung	e257b54124	Org reader: fix behavior for successive calls of `#+EXCLUDE_TAGS`. (#4951 ) Calling `#+EXCLUDE_TAGS` multiple times should preserve the status of the previously declared tags.	2018-10-05 22:21:20 -07:00
quasicomputational	6207bdeb68	CommonMark writer: add plain text fallbacks. (#4531 ) Previously, the writer would unconditionally emit HTMLish output for subscripts, superscripts, strikeouts (if the strikeout extension is disabled) and small caps, even with raw_html disabled. Now there are plain-text (and, where possible, fancy Unicode) fallbacks for all of these corresponding (mostly) to the Markdown fallbacks, and the HTMLish output is only used when raw_html is enabled. This commit adds exported functions `toSuperscript` and `toSubscript` to `Text.Pandoc.Writers.Shared`. [API change] Closes #4528.	2018-10-05 21:33:14 -07:00
Brian Leung	a26b3a2d6a	Org reader: Add partial support for `#+EXCLUDE_TAGS` option. (#4950 ) Closes #4284. Headers with the corresponding tags should not appear in the output. If one or more of the specified tags contains a non-tag character like `+`, Org-mode will not treat that as a valid tag, but will nonetheless continue scanning for valid tags. That behavior is not replicated in this patch; entering `cat+dog` as one of the entries in `#+EXCLUDE_TAGS` and running the file through Pandoc will cause the parser to fail and result in the only excluded tag being the default, `noexport`.	2018-10-05 14:28:17 -07:00
John MacFarlane	36f1846cc3	Implement `--ascii` (`writerPreferAscii`) in writers, not App. Now the `write*` functions for Docbook, HTML, ICML, JATS, Man, Ms, OPML are sensitive to `writerPreferAscii`. Previously the to-ascii translation was done in Text.Pandoc.App, and thus not available to those using the writer functions directly. In addition, the LaTeX writer is now sensitive to `writerPreferAscii` and to `--ascii`. 100% ASCII output can't be guaranteed, but the writer will use commands like `\"{a}` and `\l` whenever possible, to avoid emiting a non-ASCII character. A new unexported module, Text.Pandoc.Groff, has been added to store functions used in the different groff-based writers.	2018-09-30 22:32:00 -07:00
John MacFarlane	190ee279c9	LaTeX reader: allow verbatim blocks ending with blank lines. Closes #4624.	2018-09-29 10:57:11 -07:00
leungbk	6e8f31dab1	Force inline code blocks to honor export options. `exportsCode` is moved from `Blocks.hs` to `Shared.hs` and exported accordingly.	2018-09-26 08:49:13 +02:00
Brian Leung	72363cd2fc	Add support for multiprenote and multipostnote arguments in LaTeX. (#4930 ) * Add support for multiprenote and multipostnote arguments. The multiprenotes occur before the first prefix of a multicite, and the multipostnotes follow the last suffix. * Add test for multiprenote and multipostnote.	2018-09-25 20:49:13 -07:00
John MacFarlane	37c6f6adfe	RST reader: fix bug with internal link targets. They were gobbling up indented content underneath. Closes #4919.	2018-09-20 11:15:03 -07:00
John MacFarlane	136bf901aa	Markdown reader: distinguish autolinks in the AST. With this change, autolinks are parsed as Links with the `uri` class. (The same is true for bare links, if the `autolink_bare_uris` extension is enabled.) Email autolinks are parsed as Links with the `email` class. This allows the distinction to be represented in the URI. Formerly the `uri` class was added to autolinks by the HTML writer, but it had to guess what was an autolink and could not distinguish `[http://example.com](http://example.com)` from `<http://example.com>`. It also incorrectly recognized `[pandoc](pandoc)` as an autolink. Now the HTML writer simply passes through the `uri` attribute if it is present, but does not add anything. The Textile writer has been modified so that the `uri` class is not explicitly added for autolinks, even if it is present. Closes #4913.	2018-09-19 14:53:29 -07:00
John MacFarlane	44e4f7b292	Markdown reader: example_lists should work without startnum. Closes #4908.	2018-09-16 20:40:32 -07:00
mb21	5347e9454f	add test for --metadata-file	2018-09-15 17:06:10 +02:00
mb21	bd5500ba7f	add test yaml-metadata-blocks.md	2018-09-15 12:10:10 +02:00
John MacFarlane	fa4ebd71a3	LaTeX reader: resolve `\ref` for figure numbers.	2018-09-09 22:53:18 -07:00
John MacFarlane	a211edc819	HTML reader: parse `<script type="math/tex` tags as math. These are used by MathJax. Closes #4877.	2018-09-07 09:41:17 -07:00
John MacFarlane	85ed24e849	RSTR reader: don't skip link definitions after comments. Closes #4860.	2018-08-29 14:40:04 -07:00
John MacFarlane	a2c4261b32	HTML reader: allow enabling `raw_tex` extension. This now allows raw LaTeX environments, `\ref`, and `\eqref` to be parsed (which is helpful for translation HTML documents using MathJaX). Closes #1126.	2018-08-24 18:04:00 -07:00
Alexander Krotov	937b92cd30	HTML reader: extract spaces inside links instead of trimming them Fixes #4845	2018-08-22 12:43:15 +03:00
John MacFarlane	3b5949e8f2	LaTeX reader: support blockcquote, foreignblockquote from csquotes. Also foreigncblockquote, hyphenblockquote, hyphencblockquote. Closes #4848. But note: currently foreignquote will be parsed as a regular Quoted inline (not using the quotes appropriate to the foreign language).	2018-08-21 21:03:43 -07:00
John MacFarlane	a733068ebf	LaTeX reader: support enquote*, foreignquote, hypphenquote... from csquotes. See #4848. Still TBD: blockquote, blockcquote, foreignblockquote.	2018-08-21 17:39:27 -07:00
John MacFarlane	42f4632e60	LaTeX reader: Support more text-mode accents. Add support for `\\|`, `\b`, `\G`, `\h`, `\d`, `\f`, `\r`, `\t`, `\U`, `\i`, `\j`, `\newtie`, `\textcircled`. Also fall back to combining characters when composed characters are not available. Closes #4652.	2018-08-17 23:19:38 -07:00
Marc Schreiber	175da00295	Add support for latex mintinline (#4365 )	2018-08-17 20:57:36 -07:00
John MacFarlane	1b66865763	LaTeX reader: fix siunitx unit commands... ...they should only be recognized in siunitx contexts. For example, `\l` outside of an siunitx context should be l-slash, not l (for liter)! Closes #4842.	2018-08-17 15:22:47 -07:00
John MacFarlane	13dea94a91	Markdown reader: Use "tex" instead of "latex" for raw tex-ish content. We can't always tell if it's LaTeX, ConTeXt, or plain TeX. Better just to use "tex" always. Also changed: ConTeXt writer: now outputs raw "tex" blocks as well as "context". (Closes #969). RST writer: uses ".. raw:: latex" for "tex" content. (RST doesn't support raw context anyway.) Note that if "context" or "latex" specifically is desired, you can still force that in a markdown document by using the raw attribute (see MANUAL.txt): ```{=latex} \foo ``` Note that this change may affect some filters, if they assume that raw tex parsed by the Markdown reader will be RawBlock (Format "latex"). In most cases it should be trivial to modify the filters to accept "tex" as well.	2018-08-15 10:25:12 -07:00
John MacFarlane	c27ce1e70e	LaTeX reader: handle parameter patterns for `\def`. For example: `\def\foo#1[#2]{#1 and #2}`. Closes #4768. Also fixes #4771. API change: in Text.Pandoc.Readers.LaTeX.Types, new type ArgSpec added. Second parameter of Macro constructor is now `[ArgSpec]` instead of `Int`.	2018-08-14 00:03:55 -07:00
John MacFarlane	919c50162c	RST writer: render Divs with admonition classes as admonitions. Also omit Div with class "admonition-title". These are generated by the RST reader and should be omitted on round-trip. Closes #4833.	2018-08-13 11:17:26 -07:00
John MacFarlane	6d14f53bd9	LaTeX reader: Allow `%` characters in URLs. This affects `\href` and `\url`. Closes #4832.	2018-08-12 16:46:48 -07:00
John MacFarlane	b76203ccf1	Markdown reader: Properly handle boolean values in YAML metadata. This fixes a regression in 2.2.3, which cause boolean values to be parsed as MetaInlines instead of MetaBool. Note also an undocumented (but desirable) change in 2.2.3: numbers are now parsed as MetaInlines rather than MetaString. Closes #4819.	2018-08-07 09:26:58 -07:00
John MacFarlane	94c3753c08	Fix parsing of embedded mappings in YAML metadata. This fixes a regression in 2.2.3 which caused embedded mappings (e.g. mappings in sequences) not to work in YAML metadata. Closes #4817.	2018-08-06 12:32:04 -07:00
John MacFarlane	581a3514ca	RST reader: improve parsing of inline interpreted text roles. * Use a Span with class "title-reference" for the default title-reference role. * Use B.text to split up contents into Spaces, SoftBreaks, and Strs for title-reference. * Use Code with class "interpreted-text" instead of Span and Str for unknown roles. (The RST writer has also been modified to round-trip this properly.) * Disallow blank lines in interpreted text. * Backslash-escape now works in interpreted text. * Backticks followed by alphanumerics no longer end interpreted text. Closes #4811.	2018-08-05 09:56:43 -07:00
John MacFarlane	f7dc3e7487	Added test case for #4669 to repository.	2018-08-05 09:21:45 -07:00
danse	be2d7921cb	RST reader: remove support for nested inlines. RST does not allow nested emphasis, links, or other inline constructs. Closes #4581, double parsing of links with URLs as link text. This supersedes the earlier fix for #4581 in `6419819b46`. Fixes #4561, a bug parsing with URLs inside emphasis. Closes #4792.	2018-07-24 15:35:50 -07:00
John MacFarlane	50e8c3b107	MediaWiki writer: Avoid extra blank line in tables with empty cells. Note that the old output is semantically identical, but the new output looks better. Closes #4794.	2018-07-24 11:38:24 -07:00
John MacFarlane	6419819b46	RST reader: fix double-link bug. Link labels containing raw URLs were parsed as autolinks, but links within links are not allowed. Closes #4581.	2018-07-21 22:53:04 -07:00
John MacFarlane	34b229dd5a	Fix for bug in parsing `\include` in markdown. Starting in 2.2.2, everything after an `\input` (or `\include`) in a markdown file would be parsed as raw LaTeX. This commit fixes the issue and adds a regression test. Closes #4781.	2018-07-19 17:44:16 -07:00
John MacFarlane	af445b34d8	Make markdown and github writers respect the `emoji` extension.	2018-07-15 16:02:46 -07:00
Anders Waldenborg	ec30fb37c1	Wrap emojis in span nodes (#4759 ) Text.Pandoc.Emoji now exports `emojiToInline`, which returns a Span inline containing the emoji character and some attributes with metadata (class `emoji`, attribute `data-emoji` with emoji name). Previously, emojis (as supported in Markdown and CommonMark readers, e.g "😄") were simply translated into the corresponding unicode code point. By wrapping them in Span nodes, we make it possible to do special handling such as giving them a special font in HTML output. We also open up the possibility of treating them differently when the `--ascii` option is selected (though that is not part of this commit). Closes #4743.	2018-07-15 15:14:40 -07:00
Mauro Bieg	5809d5bef2	AsciiDoc Writer: escape square brackets at start of line (#4708 ) closes #4545	2018-07-12 19:37:37 +02:00
Alexander Krotov	41cf6d540f	More spellcheck	2018-07-02 19:07:28 +03:00
John MacFarlane	016e0a09e2	RST writer: don't treat 'example' as a syntax name. This fixes conversions from org with example blocks. Closes #4748.	2018-06-30 11:45:49 +02:00
Anders Waldenborg	904924d172	CommonMark reader: Handle ascii_identifiers extension (#4733 ) Non-ascii characters were not stripped from identifiers even if the `ascii_identifiers` extension was enabled (which is is by default for gfm). Closes #4742	2018-06-29 10:41:26 +02:00
Mauro Bieg	0459d1be26	TikiWiki reader: improve list parsing (#4723 ) - remove trailing Space from list items - parse lists that have no space after marker (fixes #4722)	2018-06-28 13:35:54 +02:00
John MacFarlane	48a505c5a0	Markdown reader: allow empty code spans. E.g. `` ` ` ``.	2018-06-13 11:12:10 -07:00
Mauro Bieg	7e477db95c	LaTeX Reader: parse figure label into Image id (#4704 ) closes #4700	2018-06-13 10:41:30 -07:00
John MacFarlane	a41222db3e	Adjust command test not to use echo. This is fraught on Windows.	2018-06-11 10:53:56 -07:00
Mauro Bieg	905dee6ee3	beamer output: fix single digit column percentage (#4691 ) fixes #4690	2018-06-07 10:50:14 -07:00
John MacFarlane	d32e866449	LaTeX reader: handle includes without surrounding blanklines. In addition, `\input` can now be used in an inline context, e.g. to provide part of a paragraph, as it can in LaTeX. Closes #4553.	2018-06-01 09:25:10 -07:00
John MacFarlane	7119715a6a	LaTeX reader `rawLaTeXBlock`: handle macros that resolve to a... ...`\begin` or `\end`. Fixes #4667.	2018-05-30 12:49:01 -07:00
John MacFarlane	252ab9b773	Markdown writer: preserve `implicit_figures` with attributes... ...even if `implicit_attributes` is not set, by rendering in raw HTML. Fixes #4677.	2018-05-30 09:24:52 -07:00
John MacFarlane	58447bba98	rawLaTeXBlock: don't expand macros in macro definitions! Closes #4653. Note that this only affected LaTeX in markdown. Added regression test.	2018-05-15 09:19:13 -07:00
John MacFarlane	a00ca6f0d8	Removed inadvertently added .orig files from repository. These were added by `96d10c72cc` Closes #4648.	2018-05-11 17:10:32 -07:00
John MacFarlane	d3be567a73	Fix regression with tex math environments in HTML + MathJax. Closes #4639.	2018-05-09 10:37:04 -07:00
John MacFarlane	81881ce470	Parsing: Lookahead for non-whitespace after single/double quote start. Closes #4637.	2018-05-09 10:00:34 -07:00
John MacFarlane	44f1c72b28	Add test for #4576 . Closes #4576.	2018-05-08 09:14:58 -07:00
John MacFarlane	a96c762a10	RST reader: fix anonymous redirects with backticks. Closes #4598.	2018-04-26 12:23:25 -07:00
John MacFarlane	aba0f7e063	Add tests for #4589 and #4594 (currently failing).	2018-04-25 23:04:08 -07:00
John MacFarlane	dab3330a58	RST reader: allow < 3 spaces indent under directives. Closes #4579.	2018-04-22 12:20:25 -07:00
John MacFarlane	7fbe473b2e	Markdown reader/writer: spacing adjustments in tables. * Markdown writer now includes a blank line at the end of the row in a single-row multiline table, to prevent it from being interpreted as a simple table. Closes #4578. * Markdown reader does a better job computing the relative width of the last column in a multiline table, so we can round-trip tables without constantly shrinking the last column.	2018-04-21 13:06:57 -07:00
John MacFarlane	276894a2f2	RST writer: use more consistent indentation. Previously we used an odd mix of 3- and 4-space indentation. Now we use 3-space indentation, except for ordered lists, where indentation must depend on the width of the list marker. Closes #4563.	2018-04-19 13:47:16 -07:00
John MacFarlane	d5b98c8c6e	Man writer: Don't escape U+2019 as '. Closes #4550.	2018-04-14 10:42:05 -07:00
John MacFarlane	d77e8f45c9	LaTEX reader: properly resolve section numbers with \ref and chapters. Closes #4529.	2018-04-05 10:14:06 -07:00
quasicomputational	13538ce6eb	CommonMark writer: correctly ignore LaTeX raw blocks when not raw_tex (#4533 ) Issue #4527.	2018-04-05 08:53:42 -07:00
Marc Schreiber	16523ea3d1	LaTeX reader: parse sloppypar environment (#4517 )	2018-04-02 09:33:29 -07:00
John MacFarlane	b9602766d8	Textile reader: fixed tables with no body rows. Previously these raised an exception. Closes #4513.	2018-03-30 14:56:36 -07:00
John MacFarlane	5a79948e0c	Mediawiki reader: improve table parsing. This fixes detection of table attributes and also handles `!` characters in cells. Closes #4508.	2018-03-28 08:59:34 -07:00
Marc Schreiber	155a2ac039	Add support to parse unit string of \SI command (closes #4296 ).	2018-03-17 20:59:20 -07:00
Francesco Occhipinti	e5845f33ad	Don't wrap lines in grid tables when `--wrap=none` (#4320 ) * Annotate gridTable code with comments and abstract small functions * Don't wrap lines in tables when `--wrap=none`. Instead, expand cells, even if it results in cells that don't respect relative widths or surpass page column width. * This change affects RST, Markdown, and Haddock writers.	2018-03-17 20:31:43 -07:00
John MacFarlane	b76c0e6a4a	RST reader: Allow unicode bullet characters. Closes #4454.	2018-03-14 17:33:00 -07:00
John MacFarlane	17725a0661	Beamer: put hyperlink after `\begin{frame}`. and not in the title. If it's in the title, then we get a titlebar on slides with the `plain` attribute, when the id is non-null. This fixes a regression from 1.9.x. Closes #4307.	2018-03-13 10:03:51 -07:00
John MacFarlane	adefd86cd4	LaTeX reader: Fix regression in package options including underscore. Closes #4424.	2018-03-02 09:33:18 -08:00
John MacFarlane	377640402f	LaTeX reader: Fixed comments inside citations. Closes #4374 .	2018-02-17 23:06:54 -08:00
Henri Menke	751b5ad010	ConTeXt writer: new section syntax and --section-divs (#4295 ) Fixes #2609. This PR introduces the new-style section headings: `\section[my-header]{My Header}` -> `\section[title={My Header},reference={my-header}]`. On top of this, the ConTeXt writer now supports the `--section-divs` option to write sections in the fenced style, with `\startsection` and `\stopsection`.	2018-01-25 11:56:28 -08:00
John MacFarlane	ac08a887cf	Markdown reader: Fix parsing bug with nested fenced divs. Closes #4281. Previously we allowed "nonindent spaces" before the opening and closing `:::`, but this interfered with list parsing, so now we require the fences to be flush with the margin of the containing block.	2018-01-20 14:44:08 -08:00
John MacFarlane	957c0e110d	RST reader: fix parsing of headers with trailing space. This was a regression in pandoc 2.0. Closes #4280.	2018-01-20 11:10:09 -08:00
John MacFarlane	ca8cd38bdc	Markdown reader: don't coalesce adjacent raw LaTeX blocks... if they are separated by a blank line. See lierdakil/pandoc-crossref#160 for motivation.	2018-01-17 09:22:35 -08:00
John MacFarlane	615a99c2c2	RST reader: add aligned environment when needed in math. rst2latex.py uses an align* environment for math in `.. math::` blocks, so this math may contain line breaks. If it does, we put the math in an `aligned` environment to simulate rst2latex.py's behavior. Closes #4254.	2018-01-14 15:11:11 -08:00
John MacFarlane	d9584d73f9	Markdown reader: Improved inlinesInBalancedBrackets. The change both improves performance and fixes a regression whereby normal citations inside inline notes were not parsed correctly. Closes jgm/pandoc-citeproc#315.	2018-01-14 12:24:21 -08:00
John MacFarlane	e7d95cadf5	LaTeX reader: pass through macro defs in rawLaTeXBlock... even if the `latex_macros` extension is set. This reverts to earlier behavior and is probably safer on the whole, since some macros only modify things in included packages, which pandoc's macro expansion can't modify. Closes #4246.	2018-01-13 22:12:32 -08:00
John MacFarlane	dca0032b0e	LaTeX reader: allow macro definitions inside macros. Previously we went into an infinite loop with ``` \newcommand{\noop}[1]{#1} \noop{\newcommand{\foo}[1]{#1}} \foo{hi} ``` See #4253.	2018-01-13 12:22:25 -08:00
John MacFarlane	49007ded7b	RST reader: better handling for headers with an anchor. Instead of creating a div containing the header, we put the id directly on the header. This way header promotion will work properly. Closes #4240.	2018-01-10 12:07:33 -08:00
John MacFarlane	13f7c2cf83	Fixed a test case so it works on windows too.	2018-01-09 17:45:02 -08:00
John MacFarlane	e3f01235e9	HTML writer: Fixed footnote backlinks with --id-prefix. Closes #4235.	2018-01-09 15:29:27 -08:00
John MacFarlane	3c93ac5cf0	LaTeX reader: be more tolerant of `&` character. This allows us to parse unknown tabular environments as raw LaTeX. Closes #4208.	2017-12-28 08:41:53 -08:00
John MacFarlane	acfa846aab	Merge pull request #4184 from mb21/html-reader-figcaption HTML Reader: be more forgiving about figcaption	2017-12-27 13:33:00 -07:00
John MacFarlane	a888083ee1	HTML reader: parse div with class `line-block` as LineBlock. See #4162.	2017-12-27 12:26:15 -08:00
John MacFarlane	9e1d86638c	LaTeX reader: support `\foreignlanguage` from babel.	2017-12-26 10:57:57 -08:00
John MacFarlane	ee5fe9bf2c	RST reader: allow empty list items (as docutils does). Closes #4193.	2017-12-24 13:02:18 -08:00
mb21	9b54b94612	HTML Reader: be more forgiving about figcaption fixes #4183	2017-12-23 09:42:04 +01:00
John MacFarlane	28b736bf95	`latex_macros` extension changes. Don't pass through macro definitions themselves when `latex_macros` is set. The macros have already been applied. If `latex_macros` is enabled, then `rawLaTeXBlock` in Text.Pandoc.Readers.LaTeX will succeed in parsing a macro definition, and will update pandoc's internal macro map accordingly, but the empty string will be returned. Together with earlier changes, this closes #4179.	2017-12-22 18:03:51 -08:00
John MacFarlane	4a07977715	Markdown reader: improved raw tex parsing. + Preserve original whitespace between blocks. + Recognize `\placeformula` as context.	2017-12-22 18:03:51 -08:00
John MacFarlane	9758720a24	RST writer: fix anchors for headers. We were missing an `_`. See #4188.	2017-12-22 10:36:37 -08:00
Alexander Krotov	d035689a06	Org writer: do not wrap "-" to avoid accidental bullet lists Also add TODO for ordered lists.	2017-12-21 16:36:29 +03:00
Alexander Krotov	1e21cfb251	Muse writer: don't wrap note references to the next line Closes #4172.	2017-12-19 13:30:48 +03:00
Alexander Krotov	ef8430e702	Fix for #4171 fix: don't wrap note references after SoftBreak	2017-12-19 13:30:48 +03:00
John MacFarlane	c0cc9270cb	Org writer: don't allow fn refs to wrap to beginning of line. Otherwise they can be interpreted as footnote definitions. Closes #4171.	2017-12-18 16:33:52 -08:00
John MacFarlane	808f6d3fa1	OPML reader: enable raw HTML and other extensions by default for notes. This fixes a regression in 2.0. Note that extensions can now be individually disabled, e.g. `-f opml-smart-raw_html`. Closes #4164.	2017-12-17 09:52:53 -08:00
John MacFarlane	044d58bb24	Fixed regression in LateX tokenization. This mainly affects the Markdown reader when parsing raw LaTeX with escaped spaces. Closes #4159.	2017-12-15 09:45:29 -08:00
John MacFarlane	b94f1e2045	RST reader: more accurate parsing of references. Previously we erroneously included the enclosing backticks in a reference ID (closes #4156). This change also disables interpretation of syntax inside references, as in docutils. So, there is no emphasis in `my link`_	2017-12-14 12:48:43 -08:00
John MacFarlane	7093a3b44c	Markdown: Improved computation of relative cell widths in pipe tables.	2017-12-12 15:36:29 -08:00
John MacFarlane	67b6abc806	LaTeX reader: fix \ before newline. This should be a nonbreaking space, as long as it's not followed by a blank line. This has been fixed at the tokenizer level. Closes #4134.	2017-12-08 16:34:15 -08:00
John MacFarlane	f6007e7146	Markdown reader: accept processing instructions as raw HTML. Closes #4125.	2017-12-06 16:05:50 -08:00
John MacFarlane	6a2562efb5	Rewrite empty_paragraphs test so it will run on Windows.	2017-12-04 15:41:09 -08:00
John MacFarlane	fac3953abf	Markdown reader: Don't parse native div as table caption. Closes #4119.	2017-12-04 15:04:47 -08:00
John MacFarlane	ae60e0196c	Add `empty_paragraphs` extension. * Deprecate `--strip-empty-paragraphs` option. Instead we now use an `empty_paragraphs` extension that can be enabled on the reader or writer. By default, disabled. * Add `Ext_empty_paragraphs` constructor to `Extension`. * Revert "Docx reader: don't strip out empty paragraphs." This reverts commit `d6c58eb836`. * Implement `empty_paragraphs` extension in docx reader and writer, opendocument writer, html reader and writer. * Add tests for `empty_paragraphs` extension.	2017-12-04 14:56:57 -08:00
John MacFarlane	03496d1810	Test for #4113 . Closes #4113.	2017-12-03 20:15:40 -08:00
John MacFarlane	1193c1a505	LaTeX writer: allow specifying just width or height for image size. Previously both needed to be specified (unless the image was being resized to be smaller than its original size). If height but not width is specified, we now set width to textwidth (and similarly if width but not height is specified). Since we have keepaspectratio, this yields the desired result.	2017-12-01 21:18:29 -08:00
John MacFarlane	b2a190546d	Revert "LaTeX writer: Add keepaspectratio to includegraphics..." This reverts commit `171187a452`.	2017-12-01 13:51:33 -08:00
John MacFarlane	171187a452	LaTeX writer: Add keepaspectratio to includegraphics... ...if only one of height/width is given.	2017-11-30 16:03:28 -08:00
John MacFarlane	03ddac451e	Support beamer `\alert` in LaTeX reader. Closes #4091 .	2017-11-29 21:30:13 -08:00
John MacFarlane	508aab0bd5	Text.Pandoc.Parsing.uri: allow `&` and `=` as word characters. This fixes a bug where pandoc would stop parsing a URI with an empty attribute: for example, `&a=&b=` wolud stop at `a`. (The uri parser tries to guess which punctuation characters are part of the URI and which might be punctuation after it.) Closes #4068.	2017-11-14 22:08:14 -08:00
John MacFarlane	51897937cd	LaTeX reader: allow optional arguments on `\footnote`. Closes #4062.	2017-11-13 21:19:38 -08:00
John MacFarlane	8d6e0e516a	Markdown writer: fix bug with doubled footnotes in grid tables. Closes #4061.	2017-11-13 21:12:04 -08:00
John MacFarlane	eeaa3b048c	LaTeX reader: support column specs like `*{2}{r}`. This is equivalent to `rr`. We now expand it like a macro. Closes #4056.	2017-11-12 14:46:29 -08:00
John MacFarlane	7ba0ae8b4d	LaTeX reader: allow optional args for parbox. See #4056.	2017-11-12 14:19:58 -08:00
John MacFarlane	fb5ba1bb00	Fixed YAML metadata with "chomp" (`\|-`). Previously if a YAML block under `\|-` contained a blank line, pandoc would not parse it as metadata.	2017-11-11 10:17:53 -05:00
John MacFarlane	1592d38821	Allow fenced code blocks to be indented 1-3 spaces. This brings our handling of them into alignment with CommonMark's. Closes #??.	2017-11-09 23:22:44 -05:00
John MacFarlane	fef5770591	Fix regression with --metadata. It should replace a metadata value set in the document itself, rather than creating a list including a new value. Closes #4054.	2017-11-08 21:54:23 -08:00
John MacFarlane	8e53489cbc	Fix strikethrough in gfm writer. Previously we got a crash, because we were trying to print a native cmark STRIKETHROUGH node, and the commonmark writer in cmark-github doesn't support this. Work around this by using a raw node to add the strikethrough delimiters. Closes #4038.	2017-11-04 10:35:52 -07:00
John MacFarlane	642d603666	Improved support for columns in HTML. * Move as much as possible to the CSS in the template. * Ensure that all the HTML-based templates (including epub) contain the CSS for columns. * Columns default to 50% width unless they are given a width attribute. Closes #4028.	2017-11-02 20:57:05 -07:00
John MacFarlane	6d00e6e8c3	Fixed revealjs slide column width issues. * Remove "width" attribute which is not allowed on div. * Remove space between `<div class="column">` elements, since this prevents columns whose widths sum to 100% (the space takes up space). Closes #4028.	2017-11-02 10:23:04 -07:00
John MacFarlane	ed3d466384	Really fix #3989 . The previous fix only worked in certain cases. Other cases with `>` in an HTML attribute broke.	2017-11-01 09:27:51 -07:00
John MacFarlane	f1ebdb8145	Updated command test for #3989 . We didn't fix it completely before.	2017-11-01 09:15:15 -07:00
John MacFarlane	fb6e5812bc	Fixed regression in parsing of HTML comments in markdown... and other non-HTML formats (`Text.Pandoc.Readers.HTML.htmlTag`). The parser stopped at the first `>` character, even if it wasn't the end of the comment. Closes #4019.	2017-10-31 21:14:38 -07:00
John MacFarlane	0e57b8b85d	Add Millimeter constructor to Dimension in ImageSize. Minor API change. Now sizes given in 'mm' are no longer converted to 'cm'. Closes #4012.	2017-10-31 11:58:43 -07:00
John MacFarlane	5f9f458df3	LaTeX reader: handle `%` comment right after command. For example \emph% {hi}	2017-10-31 11:31:35 -07:00
John MacFarlane	556c6c2c6d	Markdown reader: make sure fenced div closers work in lists. Previously the following failed: ::: {.class} 1. one 2. two ::: and you needed a blank line before the closing `:::`.	2017-10-31 10:57:20 -07:00
John MacFarlane	81610144f9	Make `fenced_divs` affect the Markdown writer. If `fenced_divs` is enabled, fenced divs will be used.	2017-10-31 10:57:20 -07:00
John MacFarlane	244b42dbaf	Added failing command test for #4007 .	2017-10-30 11:04:40 -07:00
John MacFarlane	513b16a71b	Fenced divs: ensure that paragraph at end doesn't become Plain. Added test case.	2017-10-24 09:53:29 -07:00
John MacFarlane	ecb5475a2a	Back to using [WARNING] and [INFO] to mark messages.	2017-10-23 23:01:37 -07:00
John MacFarlane	fda0c0119f	Implemented fenced Divs. + Added Ext_fenced_divs to Extensions (default for pandoc Markdown). + Document fenced_divs extension in manual. + Implemented fenced code divs in Markdown reader. + Added test. Closes #168.	2017-10-23 22:45:28 -07:00
John MacFarlane	896803b0d5	HTML reader: `htmlTag` improvements. We previously failed on cases where an attribute contained a `>` character. This patch fixes the bug. Closes #3989.	2017-10-23 17:29:32 -07:00
John MacFarlane	1a82ecbb68	More pleasing presentation of warnings and info messages. !! warning -- info	2017-10-23 15:00:11 -07:00
John MacFarlane	cecf02e326	Fixed test for change in log level.	2017-10-23 11:20:22 -07:00
mb21	e2123a4033	LaTeX Reader: support \lettrine	2017-10-22 20:33:30 +02:00
John MacFarlane	28bb5d610d	LaTeX reader: support `\expandafter`. Closes #3983.	2017-10-19 13:23:50 -07:00
John MacFarlane	61641f996f	Revised command test 3971 to work with Windows.	2017-10-16 22:51:25 -07:00
John MacFarlane	c40857b389	Improved handling of include files in LaTeX reader. Previously `\include` wouldn't work if the included file contained, e.g., a begin without a matching end. We've changed the Tok type so that it stores a full SourcePos, rather than just a line and column. So tokens keeep track of the file they came from. This allows us to use a simpler method for includes, which doesn't require parsing the included document as a whole. Closes #3971.	2017-10-16 22:05:34 -07:00
John MacFarlane	9cf9a64923	RST writer: correctly handle inline code containing backticks. (Use a :literal: role.) Closes #3974.	2017-10-16 20:54:43 -07:00
John MacFarlane	cba18c19a6	RST writer: don't backslash-escape word-internal punctuation. Closes #3978.	2017-10-16 20:39:19 -07:00
John MacFarlane	75d8c99c73	ConTeXt writer: Use identifiers for chapters. Closes #3968.	2017-10-11 20:21:55 -07:00
John MacFarlane	8cd1e00bbc	Add test - closes #3958 .	2017-10-08 21:57:26 -07:00
John MacFarlane	492f496842	Markdown reader: Fixed bug with indented code following raw LaTeX. Closes #3947.	2017-10-02 21:28:14 -07:00
John MacFarlane	2314534d4d	RST writer: add header anchors when header has non-standard id. Closes #3937.	2017-09-27 20:42:04 -07:00
John MacFarlane	b1ee747a24	Added `--strip-comments` option, `readerStripComments` in `ReaderOptions`. * Options: Added readerStripComments to ReaderOptions. * Added `--strip-comments` command-line option. * Made `htmlTag` from the HTML reader sensitive to this feature. This affects Markdown and Textile input. Closes #2552.	2017-09-17 13:01:27 -07:00
John MacFarlane	4177ee8626	Textile reader: allow 'pre' code in list item. Closes #3916.	2017-09-12 08:58:47 -07:00
John MacFarlane	5fc4980216	Markdown writer: Escape pipe characters when `pipe_tables` enabled. Closes #3887.	2017-09-07 22:10:13 -07:00
Albert Krewinkel	6a6c3858b4	Org writer: stop using raw HTML to wrap divs Div's are difficult to translate into org syntax, as there are multiple div-like structures (drawers, special blocks, greater blocks) which all have their advantages and disadvantages. Previously pandoc would use raw HTML to preserve the full div information; this was rarely useful and resulted in visual clutter. Div-rendering was changed to discard the div's classes and key-value pairs if there is no natural way to translate the div into an org structure. Closes: #3771	2017-09-01 00:08:12 +02:00
John MacFarlane	8fcf66453c	RST reader: Fixed `..include::` directive. Closes #3880.	2017-08-27 17:09:55 -07:00
John MacFarlane	1b3431a165	LaTeX reader: improved support for \hyperlink, \hypertarget. Closes #2549.	2017-08-25 22:04:57 -07:00
John MacFarlane	d70b89c0d9	Use pandoc-types 1.17.1. Tests updated for new simpleTable behavior... with empty headers.	2017-08-20 23:24:51 -07:00
John MacFarlane	9cc128b579	LaTeX reader: Set identifiers on Spans used for \label.	2017-08-20 16:52:03 -07:00
John MacFarlane	a31241a08b	Markdown reader: use CommonMark rules for list item nesting. Closes #3511. Previously pandoc used the four-space rule: continuation paragraphs, sublists, and other block level content had to be indented 4 spaces. Now the indentation required is determined by the first line of the list item: to be included in the list item, blocks must be indented to the level of the first non-space content after the list marker. Exception: if are 5 or more spaces after the list marker, then the content is interpreted as an indented code block, and continuation paragraphs must be indented two spaces beyond the end of the list marker. See the CommonMark spec for more details and examples. Documents that adhere to the four-space rule should, in most cases, be parsed the same way by the new rules. Here are some examples of texts that will be parsed differently: - a - b will be parsed as a list item with a sublist; under the four-space rule, it would be a list with two items. - a code Here we have an indented code block under the list item, even though it is only indented six spaces from the margin, because it is four spaces past the point where a continuation paragraph could begin. With the four-space rule, this would be a regular paragraph rather than a code block. - a code Here the code block will start with two spaces, whereas under the four-space rule, it would start with `code`. With the four-space rule, indented code under a list item always must be indented eight spaces from the margin, while the new rules require only that it be indented four spaces from the beginning of the first non-space text after the list marker (here, `a`). This change was motivated by a slew of bug reports from people who expected lists to work differently (#3125, #2367, #2575, #2210, #1990, #1137, #744, #172, #137, #128) and by the growing prevalance of CommonMark (now used by GitHub, for example). Users who want to use the old rules can select the `four_space_rule` extension. * Added `four_space_rule` extension. * Added `Ext_four_space_rule` to `Extensions`. * `Parsing` now exports `gobbleAtMostSpaces`, and the type of `gobbleSpaces` has been changed so that a `ReaderOptions` parameter is not needed.	2017-08-19 15:45:01 -07:00
John MacFarlane	5ab1162def	Markdown reader: fixed parsing of fenced code after list... ...when there is no intervening blank line. Closes #3733.	2017-08-18 21:46:55 -07:00
John MacFarlane	bfbdfa646a	LaTeX reader: implement \newtoggle, \iftoggle, \toggletrue\|false from etoolbox. Closes #3853.	2017-08-18 10:13:41 -07:00
John MacFarlane	d1444b4ecd	RST reader/writer: support unknown interpreted text roles... ...by parsing them as Span with "role" attributes. This way they can be manipulated in the AST. Closes #3407.	2017-08-17 16:01:44 -07:00
John MacFarlane	b1f6fb4af5	HTML reader: support column alignments. These can be set either with a `width` attribute or with `text-width` in a `style` attribute. Closes #1881.	2017-08-17 12:08:32 -07:00
John MacFarlane	db715ca847	LaTeX reader: use Link instead of Span for `\ref`. This makes more sense semantically and avoids unnecessary Span [Link] nestings when references are resolved.	2017-08-16 10:56:12 -07:00
schrieveslaach	cf4b40162d	LaTeX reader: add Support for `glossaries` and `acronym` package (#3589 ) Acronyms are not resolved by the reader, but acronym and glossary information is put into attributes on Spans so that they can be processed in filters.	2017-08-16 10:24:46 -07:00
John MacFarlane	68434957d6	Fixed command test #2994 on Windows.	2017-08-16 09:47:25 -07:00
John MacFarlane	892a4edeb1	Implement multicolumn support for slide formats. The structure expected is: <div class="columns"> <div class="column" width="40%"> contents... </div> <div class="column" width="60%"> contents... </div> </div> Support has been added for beamer and all HTML slide formats. Closes #1710. Note: later we could add a more elegant way to create this structure in Markdown than to use raw HTML div elements. This would come for free with a "native div syntax" (#168). Or we could devise something specific to slides	2017-08-14 23:17:44 -07:00
John MacFarlane	319d7ed6ff	Changed command test for #2994 so it actually tests the writer.	2017-08-14 00:00:50 -07:00
schrieveslaach	2845ab5976	Put content of \ref, \label commands into span… (#3639 ) * Put content of `\ref` and `\label` commands into Span elements so they can be used in filters. * Add support for `\eqref`	2017-08-13 10:58:45 -07:00
John MacFarlane	8f65590ce9	CommonMark writer: prefer pipe tables to HTML tables... ...even if it means losing relative column width information. See #3734.	2017-08-13 10:43:43 -07:00
John MacFarlane	506866ef73	Markdown writer: Use pipe tables if `raw_html` disabled... and `pipe_tables` enabled, even if the table has relative width information. Closes #3734.	2017-08-13 10:37:24 -07:00
John MacFarlane	418bda8128	Docx writer: pass through comments. We assume that comments are defined as parsed by the docx reader: I want <span class="comment-start" id="0" author="Jesse Rosenthal" date="2016-05-09T16:13:00Z">I left a comment.</span>some text to have a comment <span class="comment-end" id="0"></span>on it. We assume also that the id attributes are unique and properly matched between comment-start and comment-end. Closes #2994.	2017-08-12 22:59:53 -07:00
John MacFarlane	be9957bddc	Escape MetaString values (as added with --metadata flag). Previously they would be transmitted to the template without any escaping. Note that `--M title='foo'` yields a different result from --- title: foo --- In the latter case, we have emphasis; in the former case, just a string with literal asterisks (which will be escaped in formats, like Markdown, that require it). Closes #3792.	2017-08-12 20:27:42 -07:00
John MacFarlane	0ab8670a0e	LaTeX reader: Fixed space after \figurename etc.	2017-08-12 13:40:28 -07:00
John MacFarlane	467ca2a1ad	Fixed data-dir on translations tests.	2017-08-12 10:39:25 -07:00
John MacFarlane	dbb81f513c	More translation tests.	2017-08-11 23:59:27 -07:00
John MacFarlane	9abb688f29	Added simple test for translations.	2017-08-11 23:57:28 -07:00
John MacFarlane	dee4cbc854	RST reader: implement csv-table directive. Most attributes are supported, including `:file:` and `:url:`. A (probably insufficient) test case has been added. Closes #3533.	2017-08-10 15:01:14 -07:00
John MacFarlane	09b7df472d	LaTeX reader: Use `label` instead of `data-label` for label in caption. See `d441e656db`, #3639.	2017-08-09 09:15:50 -07:00
John MacFarlane	1ad9679dc9	CommonMark writer: avoid excess blank lines at end of output.	2017-08-08 14:00:13 -07:00
John MacFarlane	3752298d91	Thread options through CommonMark reader. This is more efficient than doing AST traversals for emojis and hard breaks. Also make behavior sensitive to `raw_html` extension.	2017-08-08 13:55:19 -07:00
John MacFarlane	b6f7c4930b	CommonMark writer: support `hard_line_breaks`, `smart`. Add tests.	2017-08-08 13:18:27 -07:00
John MacFarlane	2c0e989f9d	Markdown reader: fixed spurious parsing as citation as reference def. We now disallow reference keys starting with `@` if the `citations` extension is enabled. Closes #3840.	2017-08-07 21:00:57 -07:00
John MacFarlane	c806ef1b15	LaTeX reader: Support simple `\def` macros. Note that we still don't support macros with fancy parameter delimiters, like \def\foo#1..#2{...}	2017-08-07 16:06:19 -07:00
John MacFarlane	9e6b9cdc5f	LaTeX reader: Support `\let`. Also, fix regular macros so they're expanded at the point of use, and NOT also the point of definition. `\let` macros, by contrast, are expanded at the point of definition. Added an `ExpansionPoint` field to `Macro` to track this difference.	2017-08-07 13:38:15 -07:00
John MacFarlane	ced834076d	DokuWiki reader: better handling for code block in list item. Closes #3824.	2017-08-02 10:33:08 -07:00
John MacFarlane	303d10d07b	Small tweak in test (add --wrap=preserve).	2017-07-26 12:55:15 +02:00
John MacFarlane	e0ab09611a	HTML writer: render raw inline environments when --mathjax used. We previously did this only with raw blocks, on the assumption that math environments would always be raw blocks. This has changed since we now parse them as inline environments. Closes #3816.	2017-07-26 12:50:36 +02:00
John MacFarlane	d441e656db	HTML writer: insert data- in front of unsupported attributes. Thus, a span with attribute 'foo' gets written to HTML5 with 'data-foo', so it is valid HTML5. HTML4 is not affected. This will allow us to use custom attributes in pandoc without producing invalid HTML.	2017-07-25 13:13:24 +02:00
John MacFarlane	2b039acb4e	Merge branch 'textcolor-support' of https://github.com/schrieveslaach/pandoc into schrieveslaach-textcolor-support	2017-07-25 11:42:10 +02:00
John MacFarlane	329b61ff5c	LaTeX reader: support etoolbox's ifstrequal.	2017-07-24 11:20:59 +02:00
John MacFarlane	439ffc2e7f	Added a test case with `markdown-latex_macros`.	2017-07-24 00:02:55 +02:00
John MacFarlane	be14e2b501	LaTeX reader: some improvements in macro parsing. Fixed applyMacros so that it operates on the whole string, not just the first token! Don't remove macro definitions from the output, even if Ext_latex_macros is set, so that macros will be applied. Since they're only applied to math in Markdown, removing the macros can have bad effects. Even for math macros, keeping them should be harmless.	2017-07-24 00:02:55 +02:00
Mauro Bieg	7d9b782f73	HTML Reader: parse figure and figcaption (#3813 )	2017-07-22 19:22:56 +02:00
John MacFarlane	7191fe1f29	LaTeX reader: handle optional args in raw `\titleformat`. Closes #3804.	2017-07-21 09:28:36 +02:00
John MacFarlane	56f63af3f6	LaTeX reader: fixed regression with starred environment names. Closes #3803.	2017-07-19 17:30:22 +02:00
schrieveslaach	911b63dfc3	Add LaTeX xspace support (#3797 )	2017-07-13 20:56:59 +02:00
Marc Schreiber	f93d7d06f6	Merge branch 'master' of https://github.com/jgm/pandoc into textcolor-support	2017-07-13 11:51:40 +02:00
John MacFarlane	013fd1c6b6	Make sure \write18 is parsed as raw LaTeX. The change is in the LaTeX reader's treatment of raw commands, but it also affects the Markdown reader.	2017-07-12 14:50:49 +02:00
John MacFarlane	41209ea676	HTML reader: Ensure that paragraphs are closed properly... when the parent block element closes, even without `</p>`. Closes #3794.	2017-07-11 15:52:38 +02:00
John MacFarlane	0feb7504b1	Rewrote LaTeX reader with proper tokenization. This rewrite is primarily motivated by the need to get macros working properly. A side benefit is that the reader is significantly faster (27s -> 19s in one benchmark, and there is a lot of room for further optimization). We now tokenize the input text, then parse the token stream. Macros modify the token stream, so they should now be effective in any context, including math. Thus, we no longer need the clunky macro processing capacities of texmath. A custom state LaTeXState is used instead of ParserState. This, plus the tokenization, will require some rewriting of the exported functions rawLaTeXInline, inlineCommand, rawLaTeXBlock. * Added Text.Pandoc.Readers.LaTeX.Types (new exported module). Exports Macro, Tok, TokType, Line, Column. [API change] * Text.Pandoc.Parsing: adjusted type of `insertIncludedFile` so it can be used with token parser. * Removed old texmath macro stuff from Parsing. Use Macro from Text.Pandoc.Readers.LaTeX.Types instead. * Removed texmath macro material from Markdown reader. * Changed types for Text.Pandoc.Readers.LaTeX's rawLaTeXInline and rawLaTeXBlock. (Both now return a String, and they are polymorphic in state.) * Added orgMacros field to OrgState. [API change] * Removed readerApplyMacros from ReaderOptions. Now we just check the `latex_macros` reader extension. * Allow `\newcommand\foo{blah}` without braces. Fixes #1390. Fixes #2118. Fixes #3236. Fixes #3779. Fixes #934. Fixes #982.	2017-07-07 12:36:00 +02:00
John MacFarlane	e574d50b1c	Markdown writer: Ensure that `+` and `-` are escaped properly... so they don't cause spurious lists. Previously they were only if succeeded by a space, not if they were at end of line. Closes #3773.	2017-06-30 17:41:25 +02:00
John MacFarlane	33a29fbf87	RST reader: support anchors. E.g. `hello` .. _hello: paragraph This is supported by putting "paragraph" in a Div with id `hello`. Closes #262.	2017-06-27 15:03:16 +02:00
John MacFarlane	563c9c8687	RST reader: Handle chained link definitions. For example, .. _hello: .. _goodbye: example.com Here both `hello` and `goodbye` should link to `example.com`. Fixes the first part of #262.	2017-06-27 14:35:03 +02:00
John MacFarlane	5812ac0390	Markdown reader: interpret YAML metadata as Inlines when possible. If the metadata field is all on one line, we try to interpret it as Inlines, and only try parsing as Blocks if that fails. If it extends over one line (including possibly the `\|` or `>` character signaling an indented block), then we parse as Blocks. This was motivated by some German users finding that date: '22. Juin 2017' got parsed as an ordered list. Closes #3755.	2017-06-23 22:31:08 +02:00
John MacFarlane	2b34337a9c	Text.Pandoc.Extensions: Added `Ext_raw_attribute`. Documented in MANUAL.txt. This is enabled by default in pandoc markdown and multimarkdown.	2017-06-23 00:37:13 +02:00
John MacFarlane	6a077ac9c7	Fixed footnotes in table captions. Note that if the table has a first page header and a continuation page header, the notes will appear only on the first occurrence of the header. Closes #2378.	2017-06-20 11:21:32 +02:00
schrieveslaach	635f299b44	Merge branch 'master' into textcolor-support	2017-06-12 15:52:29 +02:00
John MacFarlane	8a000e3ecc	Markdown writer: don't allow soft break in header. Closes #3736.	2017-06-12 09:23:30 +02:00
John MacFarlane	b466152d61	Don't allow backslash + newline to affect block structure. Note that as a result of this change, the following, which formerly produced a header with two lines separated by a line break, will now produce a header followed by a paragraph: # Hi\ there This may affect some existing documents that relied on this undocumented and unintended behavior. This change makes pandoc more consistent with other Markdown implementations, and with itself (since the two-space version of a line break doesn't work inside ATX headers, and neither version works inside Setext headers). Closes #3730.	2017-06-11 22:24:20 +02:00
schrieveslaach	f36de77a25	Support for \faCheck and \faClose (#3727 )	2017-06-11 07:47:42 +02:00
John MacFarlane	8218bdb95c	HTML writer: Avoid two class attributes when adding 'uri' class. Closes #3716.	2017-06-01 18:41:54 +02:00
John MacFarlane	c366fab2cb	Markdown writer: Avoid inline surround-marking with empty content. E.g. we don't want `<strong></strong>` to become `****`. Similarly for emphasis, super/subscript, strikeout. Closes #3715.	2017-06-01 12:30:58 +02:00
John MacFarlane	9396f1fb67	LaTeX reader: handle some width specifiers on table columns. Currently we only handle the form `0.9\linewidth`. Anything else would have to be converted to a percentage, using some kind arbitrary assumptions about line widths. See #3709.	2017-06-01 12:08:28 +02:00
Marc Schreiber	181c56d400	Add \colorbox support	2017-06-01 09:50:51 +02:00
Albert Krewinkel	7852cd5603	Org reader: recognize babel result blocks with attributes Babel result blocks can have block attributes like captions and names. Result blocks with attributes were not recognized and were parsed as normal blocks without attributes. Fixes: #3706	2017-05-31 20:01:04 +02:00
John MacFarlane	5ec384eb60	LaTeX reader: handle escaped & inside table cell. Closes #3708.	2017-05-29 22:47:04 +02:00
John MacFarlane	8614902234	Markdown writer: changes to `--reference-links`. With `--reference-location` of `section` or `block`, pandoc will now repeat references that have been used in earlier sections. The Markdown reader has also been modified, so that exactly repeated references do not generate a warning, only references with the same label but different targets. The idea is that, with references after every block, one might want to repeat references sometimes. Closes #3701.	2017-05-27 23:18:45 +02:00
John MacFarlane	cb7b0a6985	Allow em for image height/width in HTML, LaTeX. - Export `inEm` from ImageSize [API change]. - Change `showFl` and `show` instance for `Dimension` so extra decimal places are omitted. - Added `Em` as a constructor of `Dimension` [API change]. - Allow `em`, `cm`, `in` to pass through without conversion in HTML, LaTeX. Closes #3450.	2017-05-25 22:48:27 +02:00
John MacFarlane	708973a33a	Added `spaced_reference_links` extension. This is now the default for pandoc's Markdown. It allows whitespace between the two parts of a reference link: e.g. [a] [b] [b]: url This is now forbidden by default. Closes #2602.	2017-05-25 12:57:31 +02:00
John MacFarlane	e34131502a	Update command tests to include stderr output.	2017-05-25 11:52:09 +02:00
John MacFarlane	41db9e826e	MediaWiki reader: don't do curly quotes inside `<tt>` contexts. Even if `+smart`. See #3585.	2017-05-25 09:35:25 +02:00
John MacFarlane	b9a30ef959	Markdown reader: fixed smart quotes after emphasis. E.g. in foo's 'foo' Closes #2228.	2017-05-24 23:23:08 +02:00
John MacFarlane	bc6aac7b47	Parsing: Provide parseFromString'. This is a verison of parseFromString specialied to ParserState, which resets stateLastStrPos at the end. This is almost always what we want. This fixes a bug where `_hi_` wasn't treated as emphasis in the following, because pandoc got confused about the position of the last word: - [o] _hi_ Closes #3690.	2017-05-24 22:41:47 +02:00
Marc Schreiber	b1d0c61f2d	Add another test to make sure that textcolor parsing is working in the inside of a paragraph	2017-05-23 17:36:27 -03:00
Marc Schreiber	29a4bdc681	Add suggestions of @jgm: parse bracketed stuff as inlines	2017-05-23 17:31:42 -03:00
keiichiro shikano	c0c54b7906	RST Reader: parse list table directive (#3688 ) Closes #3432.	2017-05-23 20:53:04 +02:00
Marc Schreiber	03cb05f4c6	Improve SVG image size code. The old code made some unwise assumptions about how the svg file would look. See #3580.	2017-05-20 23:09:08 +02:00
John MacFarlane	ca77f0a95e	RST writer: add empty comments when needed... to avoid including a blocquote in the indented content of a preceding block. Closes #3675.	2017-05-19 21:05:15 +02:00
John MacFarlane	818d5c2f35	Markdown: allow attributes in reference links to start on next line. This addresses a subsidiary issue in #3674.	2017-05-18 13:20:32 +02:00
John MacFarlane	7b3aaee15a	Markdown writer: Fixed duplicated reference links with `--reference-links` and `--reference-location=section`. Also ensure that there are no empty link references `[]`. Closes #3674.	2017-05-17 16:23:33 +02:00
Albert Krewinkel	af4bf91c59	Org reader: add basic file inclusion mechanism Support for the `#+INCLUDE:` file inclusion mechanism was added. Recognized include types are example, export, src, and normal org file inclusion. Advanced features like line numbers and level selection are not implemented yet. Closes: #3510	2017-05-14 12:45:31 +02:00
John MacFarlane	37189667cc	Textile reader: fix bug for certain links in table cells. Closes #3667.	2017-05-15 20:36:11 +02:00
Albert Krewinkel	4b9fb7a128	Combine grid table parsers The grid table parsers for markdown and rst was combined into one single parser, slightly changing parsing behavior of both parsers: - The markdown parser now compactifies block content cell-wise: pure text blocks in cells are now treated as paragraphs only if the cell contains multiple paragraphs, and as plain blocks otherwise. Before, this was true only for single-column tables. - The rst parser now accepts newlines and multiple blocks in header cells. Closes: #3638	2017-05-11 00:17:56 +02:00
John MacFarlane	82cc7fb0d4	Markdown reader: improved parsing of indented raw HTML blocks. Previously we inadvertently interpreted indented HTML as code blocks. This was a regression. We now seek to determine the indentation level of the contents of an HTML block, and (optionally) skip that much indentation. As a side effect, indentation may be stripped off of raw HTML blocks, if `markdown_in_html_blocks` is used. This is better than having things interpreted as indented code blocks. Closes #1841.	2017-05-06 22:56:16 +02:00
John MacFarlane	f20c89e243	LaTeX reader: Better handling of comments inside math environments. This solves a problem with commented out `\end{eqnarray}` inside an eqnarray (among other things). Closes #3113.	2017-05-06 22:16:43 +02:00
schrieveslaach	ddf2524477	Fix keyval funtion: pandoc did not parse options in braces correctly.… (#3642 ) * Fix keyval funtion: pandoc did not parse options in braces correctly. Additionally, dot, dash, and colon were no valid characters * Add \| as possible option value * Improved code	2017-05-06 15:09:29 +02:00
Albert Krewinkel	da8c153a68	Org reader: support macros Closes: #3401	2017-05-06 11:00:32 +02:00
Marc Schreiber	4ed6d91656	\textcolor will be parse as span at the beginning of a paragraph	2017-05-04 16:48:27 +02:00
Albert Krewinkel	57cba3f1d5	Org reader: support table.el tables Closes #3314	2017-05-03 22:43:34 +02:00
Marc Schreiber	1728d4e609	\textcolor works as inline and block command	2017-05-03 13:39:38 +02:00
Marc Schreiber	d9439808f2	Add block version of \textcolor	2017-05-03 12:00:30 +02:00
David A Roberts	79855ef934	Markdown writer: better escaping for links (#3628 ) Previously the Markdown writer would sometimes create links where there were none in the source. This is now avoided by selectively escaping bracket characters when they occur in a place where a link might be created. Closes #3619.	2017-05-03 12:19:45 +02:00
schrieveslaach	6e55e6837a	LaTeX reader: Add support for tabularx environment (#3632 )	2017-05-03 12:16:48 +02:00
Mauro Bieg	e02cfcdeac	Markdown Writer: put space before reference link definitions Fixes #3630 (#3631). Previously the attributes in link reference definitions did not have a space preceding.	2017-05-03 12:13:25 +02:00
Marc Schreiber	49336ee6ee	Add basic \textcolor support to LaTeX reader	2017-05-02 10:48:57 +02:00
David A Roberts	c0192132cf	Markdown writer: Case-insensitive reference links. (#3616 ) Ensure that we do not generate reference links whose labels differ only by case. Also allow implicit reference links when the link text and label are identical up to case. Closes #3615.	2017-05-02 09:00:37 +02:00
John MacFarlane	730796ee31	LaTeX writer: Fix problem with escaping in lstinline. Previously the LaTeX writer created invalid LaTeX when `--listings` was specified and a code span occured inside emphasis or another construction. This is because the characters `%{}\` must be escaped in lstinline when the listinline occurs in another command, otherwise they must not be escaped. To deal with this, adoping Michael Kofler's suggestion, we always wrap lstinline in a dummy command `\passthrough`, now defined in the default template if `--listings` is specified. This way we can consistently escape the special characters. Closes #1629.	2017-04-29 11:05:44 +02:00
John MacFarlane	e76b672414	LaTeX writer: don't use lstinline it \item[..]. If you do, the contents of item disappear or are misplaced. Use `\texttt` instead. Closes #645.	2017-04-28 12:03:59 +02:00
schrieveslaach	a29fa15a7b	LaTeX reader: Add basic support for hyphenat package (#3603 )	2017-04-26 12:05:13 +02:00
schrieveslaach	81548960d5	LaTeX reader: Add support for \vdots (#3607 )	2017-04-26 12:03:07 +02:00
John MacFarlane	ee160d7c4c	LaTeX writer: fix error with line breaks after empty content. LaTeX requires something before a line break, so we insert a `~` if no printable content has yet been emitted. Closes #2874.	2017-04-25 15:00:27 +02:00
John MacFarlane	d17f0dab84	LaTeX reader: better support for subfigure package. A figure with two subfigures turns into two pandoc figures; the subcaptions are used and the main caption ignored, unless there are no subcaptions. Closes #3577.	2017-04-24 23:39:14 +02:00
John MacFarlane	51a46b7e31	HTML reader: Revise treatment of li with id attribute. Previously we always added an empty div before the list item, but this created problems with spacing in tight lists. Now we do this: If the list item contents begin with a Plain block, we modify the Plain block by adding a Span around its contents. Otherwise, we add a Div around the contents of the list item (instead of adding an empty Div to the beginning, as before). Closes #3596.	2017-04-23 11:03:48 +02:00
schrieveslaach	020dc63e23	Add siunitx Support (#3588 ) For example: ```latex \SI[round-precision=2]{1}{m} is equal to \SI{1000}{mm}. \SI[round-precision=2]{1}[\$]{} is equal to \SI{0.938094}{\euro} ```	2017-04-22 21:57:21 +02:00
John MacFarlane	bcc848d773	Avoid parsing "Notes:*" as a bare URI. This avoids parsing bare URIs that start with a scheme + colon + ``, `_`, or `]`. Closes #3570.	2017-04-15 13:32:28 +02:00
John MacFarlane	31a36cf186	Man writer: Fix handling of nested font commands. Previously pandoc emitted incorrect markup for bold + italic, for example, or bold + code. Closes #3568.	2017-04-12 12:23:29 +02:00
John MacFarlane	12ae1df5bf	Allow raw latex commands starting with `\start` in Markdown. Previously these weren't allowed because they were interpreted as starting ConTeXt environments, even without a corresponding \stop... Closes #3558.	2017-04-06 11:30:03 +02:00
schrieveslaach	5fe734d452	lstinline with braces can be used (verb cannot be used with braces) (#3535 ) * Fix lstinline handling: lstinline with braces can be used (verb cannot be used with braces) * Use codeWith and determine the language from lstinline * Improve code * Add another test: convert lstinline without language option	2017-03-29 14:49:46 +02:00
schrieveslaach	49d72444d7	LaTeX reader: add support for LaTeX subfiles package. Closes #3530.	2017-03-27 21:20:27 +02:00
John MacFarlane	fddd6ffdd0	Add blank lines to #3531 command test.	2017-03-26 20:48:54 +02:00
John MacFarlane	358dfba8f4	MediaWiki writer: don't softbreak lines inside list items. Closes #3531.	2017-03-26 20:41:09 +02:00
John MacFarlane	438e8686cf	Markdown writer: don't emit a simple table if `simple_tables` disabled. Closes #3529.	2017-03-24 16:11:56 +01:00
John MacFarlane	a939cfe769	Pipe tables: impose minimum cell size. This might help with #3526. At any rate, it fixes another bug (see test/command/3526.md).	2017-03-23 16:54:47 +01:00
John MacFarlane	286b320fb0	Added to issue 3516 command test to debug test failure on appveyor.	2017-03-22 14:36:12 +01:00
John MacFarlane	430e2db9ba	Improve rendering of superscript in plain output. We now handle a few non digit characters (+, -, =, parentheses) for which there are superscripted unicode characters. Closes #3518.	2017-03-21 14:43:14 +01:00
John MacFarlane	daf8d1db18	RST writer: improve grid table output, fix bug with empty rows. Uses the new gridTable in Writers.Shared, which is here improved to better handle 0-width cells. Closes #3516.	2017-03-21 14:16:46 +01:00
John MacFarlane	48c88d566d	Add `space_in_atx_header` extension. This is enabled by default in pandoc and GitHub markdown but not the other flavors. This requirse a space between the opening #'s and the header text in ATX headers (as CommonMark does but many other implementations do not). This is desirable to avoid falsely capturing things ilke #hashtag or #5 Closes #3512.	2017-03-20 21:55:30 +01:00
John MacFarlane	fff3489bf3	Removed failing part of 3348 test. This was failing because of a small discrepancy in markdown table header line lengths on appveyor. It's a minor issue, I can't see what is causing it, and it's irrelevant to the issue this is testing, so we'll just write native for this test.	2017-03-19 20:37:39 +01:00
John MacFarlane	87f99f3fdf	HTML reader: Better sanity checks on raw HTML. This also affects the Markdown reader. Closes #3257.	2017-03-18 22:43:57 +01:00
John MacFarlane	435221a9f3	Added test case to 3348 to try to figure out why appveyor build fails.	2017-03-17 17:10:43 +01:00
John MacFarlane	8f90b83fee	Adjust command test 3348.md to specify column width. This is meant to address a test failure on appveyor.	2017-03-17 16:19:51 +01:00
John MacFarlane	090165d714	Added test for #256 .	2017-03-16 22:31:36 +01:00
John MacFarlane	6bf3f89d69	Better handling of \part in LaTeX. Closes #1905. Removed stateChapters from ParserState. Now we parse chapters as level 0 headers, and parts as level -1 headers. After parsing, we check for the lowest header level, and if it's less than 1 we bump everything up so that 1 is the lowest header level. So `\part` will always produce a header; no command-line options are needed.	2017-03-13 22:11:10 +01:00
John MacFarlane	c8b906256d	Improved behavior of `auto_identifiers` when there are explicit ids. Previously only autogenerated ids were added to the list of header identifiers in state, so explicit ids weren't taken into account when generating unique identifiers. Duplicated identifiers could result. This simple fix ensures that explicitly given identifiers are also taken into account. Fixes #1745. Note some limitations, however. An autogenerated identifier may still coincide with an explicit identifier that is given for a header later in the document, or with an identifier on a div, span, link, or image. Fixing this would be much more difficult, because we need to run `registerHeader` before we have the complete parse tree (so we can't get a complete list of identifiers from the document by walking the tree). However, it might be worth issuing warnings for duplicate header identifiers; I think we can do that. It is not common for headers to have the same text, and the issue can always be worked around by adding explicit identifiers, if the user is aware of it.	2017-03-12 21:30:04 +01:00
John MacFarlane	62becc1536	Changed test case labeled 3384.md to 3348.md. The last commit referred to #3384, but should have closed #3348.	2017-03-11 23:29:57 +01:00
John MacFarlane	d66b046c8a	Markdown writer: fixed bugs in simple/multiline list output. * Previously we got overlong lists with `--wrap=none`. This is fixed. * Previously a multiline list could become a simple list (and would always become one with `--wrap=none`). Closes #3384.	2017-03-11 23:24:14 +01:00
John MacFarlane	c46febaaee	Expand \newenvironment macros. Closes #987. Depends on still unreleased texmath 0.9.3.	2017-03-10 09:46:32 +01:00
Albert Krewinkel	c91f168fc9	Org reader: disallow tables on list marker lines Fixes: #3499	2017-03-08 15:45:00 +01:00
John MacFarlane	bcfb77e2ab	Markdown writer: Avoid spurious blanklines at end of document... after tables and list, for example.	2017-03-08 12:47:39 +01:00
John MacFarlane	b6e7bfaf1d	Markdown writer: ensure space before list at top level. Closes #3487.	2017-03-08 12:42:01 +01:00
John MacFarlane	410991ec6e	Org reader: don't allow tables inside list items. Closes #3499.	2017-03-08 12:28:13 +01:00
John MacFarlane	2c67101c7d	Added test case for #3497 .	2017-03-08 12:23:01 +01:00
John MacFarlane	8c55b7b564	Markdown reader: Treat certain environments as inline when they occur without space surrounding them. E.g. equation, math. This avoids incorrect vertical space around equations. Closes #3309. Closes #2171. See also rstudio/bookdown#358.	2017-03-07 15:00:32 +01:00
John MacFarlane	74afd2974a	Markdown writer: better handling of tables with empty columns. E.g. an HTML table with two cells in the first row and one in the second (but no row/colspan). We now calculate the number of columns based on the longest row (or the length of aligns or widths). Closes #3337.	2017-03-06 22:51:28 +01:00
John MacFarlane	9e87114234	LaTeX reader: allow newpage, clearpage, pagebreak in inline contexts as well as block contexts. Closes #3494.	2017-03-06 21:49:06 +01:00
John MacFarlane	e20f55618f	Markdown reader: fixed internal header links. Closes #2397. This patch also adds `shortcut_reference_links` to the list of mmd extensions.	2017-03-05 16:34:47 +01:00
John MacFarlane	2fee07795c	Added a markdown abbrevation test case.	2017-03-05 10:44:25 +01:00
John MacFarlane	7fc6919f90	Markdown reader: Fixed regression on left-biased union for metadata. When multiple YAML metadata blocks are used, and two define the same field, the value defined first takes precedence, according to the manual. This was changed briefly in `ba3ee62323`. This commit reverts to the original behavior and adds a test case.	2017-03-05 09:28:44 +01:00
John MacFarlane	ba3ee62323	Parse YAML metadata in a context that sees footnotes... defined in the body of the document. Closes #1279.	2017-03-05 01:36:40 +01:00
John MacFarlane	0517cf0bc0	Fixed some loose ends in #1592 . Added test cases. Fixed HTML reader to parse a span with class "smallcaps" as SmallCaps. Fixed Markdown writer to render SmallCaps as a native span when native spans are enabled.	2017-03-04 23:01:29 +01:00
John MacFarlane	ce9d49ef04	OpenDocument writer: fixed dropped elements in some ordered lists. Closes #2434.	2017-03-03 22:48:37 +01:00
John MacFarlane	fb47d1d909	RST reader: support RST-style citations. The citations appear at the end of the document as a definition list in a special div with id `citations`. Citations link to the definitions. Added stateCitations to ParserState. Closes #853.	2017-03-03 22:23:01 +01:00
John MacFarlane	4d25bba5f7	RST reader: Handle multiline cells in simple tables. Closes #1166.	2017-03-02 16:48:53 +01:00
John MacFarlane	ea619bfcb4	Markdown writer: Fixed grid tables embedded in grid tables. Closes #2834.	2017-03-01 17:41:14 +01:00
John MacFarlane	d1b50a6c5d	RST reader: implemented implicit internal header links. Cloess #3475.	2017-02-28 10:32:36 +01:00
John MacFarlane	99b39ffc17	RST reader: support scale and align attributes of images. Closes #2662.	2017-02-26 23:40:31 +01:00
John MacFarlane	65c4efeb59	Added test case for variables/metadata in Markdown writer.	2017-02-25 23:54:30 +01:00
John MacFarlane	7d0082aa0b	LaTeX reader: allow hspace and vspace to count as raw block or inline. Previously we would refuse to parse anything as raw inline if it was in the blockCommands list. Now we allow exceptions if they're listed under ignoreInlines in inlineCommands. This should make it easier e.g. to include an \hspace between two side-by-side raw LaTeX tables.	2017-02-25 12:43:00 +01:00
John MacFarlane	f4a452f891	When parsing raw LaTeX commands, include trailing space. Otherwise things like `\noindent foo` break and turn into `\noindentfoo`. Affects `-f latex+raw_tex` and `-f markdown` (and other formats that allow `raw_tex`). Closes #1773.	2017-02-22 21:15:25 +01:00
John MacFarlane	5d71e37f26	MediaWiki reader: ensure that list starts begin at left margin. Including when they're in tables or other list items. Closes #2606.	2017-02-21 23:41:32 +01:00
John MacFarlane	5269724ad3	MediaWiki reader: fixed more table issues. Closes #2649.	2017-02-21 21:28:24 +01:00
John MacFarlane	575014975e	Fix indirect hyperlink targets. Closes #512 .	2017-02-15 17:36:16 +01:00
John MacFarlane	cfdbe85e71	LaTeX reader: properly handle column prefixes/suffixes. For example, in \begin{tabular}{>{$}l<{$}>{$}l<{$} >{$}l<{$}} each cell will be interpreted as if it has a `$` before its content and a `$` after (math mode).	2017-02-13 22:39:59 +01:00
John MacFarlane	1a23bc65b8	Fixed small bug in RST list parsing. See #3432. Previously the parser didn't handle properly this case: * - a - b * - c - d	2017-02-11 20:55:13 +01:00
John MacFarlane	47a16065c4	Removed --parse-raw and readerParseRaw. These were confusing. Now we rely on the +raw_tex or +raw_html extension with latex or html input. Thus, instead of --parse-raw -f latex we use -f latex+raw_tex and instead of --parse-raw -f html we use -f html+raw_html	2017-02-06 23:33:23 +01:00
John MacFarlane	c93ecfc3c5	Handle language in inline code with --listings. Closes #3422.	2017-02-05 22:22:42 +01:00
John MacFarlane	396d304167	More smart escaping tests.	2017-02-04 22:09:19 +01:00
John MacFarlane	ce9ec67970	Added first command test to cabal metadata and repo.	2017-02-04 21:56:32 +01:00

... 6 7 8 9 10 ...

716 commits