pandoc

Author	SHA1	Message	Date
John MacFarlane	5aa73bd0a2	LaTeX reader: handle table cells containing `&` in `\verb`. Closes #7129.	2021-03-07 15:49:02 -08:00
John MacFarlane	a9cc5d2616	Update tests for changes to https URLs.	2021-02-26 18:00:45 -08:00
John MacFarlane	f0a991a22b	T.P.CSV: fix parsing of unquoted values. Previously we didn't allow unescaped quotes in unquoted values, but they are allowed. Closes #7112.	2021-02-22 21:18:04 -08:00
John MacFarlane	d84a6041e1	HTML reader: fix bad handling of empty src attribute in iframe. - If src is empty, we simply skip the iframe. - If src is invalid or cannot be fetched, we issue a warning and skip instead of failing with an error. - Closes #7099.	2021-02-13 13:08:34 -08:00
John MacFarlane	6e73273916	T.P.Error: export `renderError`. Refactor `handleError` to use `renderError`. This allows us render error messages without exiting.	2021-02-13 13:08:34 -08:00
John MacFarlane	3be066b7d3	Fix command test 5686	2021-02-12 19:04:14 -08:00
John MacFarlane	59875185b3	Add command test for #7092	2021-02-12 19:04:14 -08:00
John MacFarlane	8ca191604d	Add new unexported module T.P.XMLParser. This exports functions that uses xml-conduit's parser to produce an xml-light Element or [Content]. This allows existing pandoc code to use a better parser without much modification. The new parser is used in all places where xml-light's parser was previously used. Benchmarks show a significant performance improvement in parsing XML-based formats (especially ODT and FB2). Note that the xml-light types use String, so the conversion from xml-conduit types involves a lot of extra allocation. It would be desirable to avoid that in the future by gradually switching to using xml-conduit directly. This can be done module by module. The new parser also reports errors, which we report when possible. A new constructor PandocXMLError has been added to PandocError in T.P.Error [API change]. Closes #7091, which was the main stimulus. These changes revealed the need for some changes in the tests. The docbook-reader.docbook test lacked definitions for the entities it used; these have been added. And the docx golden tests have been updated, because the new parser does not preserve the order of attributes. Add entity defs to docbook-reader.docbook. Update golden tests for docx.	2021-02-10 22:04:11 -08:00
John MacFarlane	8e9131db4e	Markdown reader: improved handling of mmd link attributes in references. Previously they only worked for links that had titles. Closes #7080.	2021-02-06 21:52:12 -08:00
John MacFarlane	02d3c71e72	BibTeX writer: use doclayout and doctemplate. This change allows bibtex/biblatex output to wrap as other formats do, depending on the settings of `--wrap` and `--columns`. It also introduces default templates for bibtex and biblatex, which allow for using the variables `header-include`, `include-before` or `include-after` (or alternatively the command line options `--include-in-header`, `--include-before-body`, `--include-after-body`) to insert content into the generated bibtex/biblatex. This change requires a change in the return type of the unexported `T.P.Citeproc.writeBibTeXString` from `Text` to `Doc Text`. Closes #7068.	2021-02-01 18:05:20 -08:00
John MacFarlane	b239c89a82	BibTeX writer fixes. Closes #7067 . + Require citeproc 0.3.0.7, which correctly titlecases when titles contain non-ASCII characters. + Correctly handle 'pages' (= 'page' in CSL). + Correctly handle BibLaTeX 'langid' (= 'language' in CSL). + In BibTeX output, protect foreign titles since there's no language field.	2021-02-01 11:23:07 -08:00
John MacFarlane	d1875b69ec	RST reader: fix handling of header in CSV tables. The interpretation of this line is not affected by the delim option. Closes #7064.	2021-01-31 12:05:46 -08:00
John MacFarlane	9223788a05	Markdown writer: handle math right before digit. We insert an HTML comment to avoid a `$` right before a digit, which pandoc will not recognize as a math delimiter.	2021-01-29 18:29:17 -08:00
John MacFarlane	98c2a52b4e	Clean up BibTeX parsing. Previously there was a messy code path that gave strange results in some cases, not passing through raw tex but trying to extract a string content. This was an artefact of trying to handle some special bibtex-specific commands in the BibTeX reader. Now we just handle these in the LaTeX reader and simplify parsing in the BibTeX reader. This does mean that more raw tex will be passed through (and currently this is not sensitive to the `raw_tex` extension; this should be fixed). Closes #7049.	2021-01-26 22:45:57 -08:00
John MacFarlane	198ce0cde9	ImageSize: use viewBox for svg if no length, width. This change allows pandoc to extract size information from more SVGs. Closes #7045.	2021-01-22 20:49:41 -08:00
Albert Krewinkel	b4b3560191	JATS writer: allow to use element-citation	2021-01-22 19:35:08 +01:00
John MacFarlane	3d6ebc9051	Use dev version of citeproc. Change a citation test which had wrong disambiguation (see jgm/citeproc#44).	2021-01-15 11:51:07 -08:00
Albert Krewinkel	68fa437999	JATS writer: fix citations (#7018 ) * JATS writer: keep code lines at 80 chars or below * JATS writer: fix citations	2021-01-10 15:35:48 -08:00
John MacFarlane	327e1428c5	gfm/commonmark writer: implement start number on ordered lists. Previously they always started at 1, but according to the spec the start number is respected. Closes #7009.	2021-01-07 16:42:05 -08:00
John MacFarlane	c0d8b186d1	T.P.Parsing: modify gridTableWith' for headerless tables. If the table lacks a header, the header row should be an empty list. Previously we got a list of empty cells, which caused an empty header to be emitted instead of no header. In LaTeX/PDF output that meant we got a double top line with space between. @tarleb @despres - please let me know if this is problematic for some reason I'm not grasping.	2021-01-07 11:07:03 -08:00
John MacFarlane	15ba184e6e	HTML writer: fix implicit_figure at end of footnotes. Closes #7006.	2021-01-05 12:07:02 -08:00
David Martschenko	385b6a3b21	Implement defaults file inheritance (#6924 ) Allow defaults files to inherit options from other defaults files by specifying them with the following syntax: `defaults: [list of defaults files or single defaults file]`.	2021-01-05 10:15:59 -08:00
John MacFarlane	ea479bf28a	LaTeX reader: handle filecontents environment. Closes #7003.	2021-01-04 14:05:03 -08:00
John MacFarlane	9a18cf4b59	LaTeX writer: revert table line height increase in 2.11.3. In 2.11.3 we started adding `\addlinespace`, which produced less dense tables. This wasn't an intentional change; I misunderstood a comment in the discussion leading up to the change. This commit restores the earlier default table appearance. Note that if you want a less dense table, you can use something like `\def\arraystretch{1.5}` in your header. Closes #6996.	2021-01-02 07:56:07 -08:00
John MacFarlane	23f964b907	Mediawiki reader: allow space around storng/emph delimiters. Closes #6993.	2020-12-30 21:31:28 -08:00
John MacFarlane	a7a162ea55	Update test for new citeproc and require it in cabal.	2020-12-28 14:40:23 -08:00
timo-a	668596cc89	Add support for writing nested tables to asciidoc (#6972 ) Added field to WriterState that denotes the current nesting level for traversing tables. Depending on the value of that field nested tables are recognized and written. Asciidoc supports one level of nesting. If deeper tables are to be written, they are omitted and a warning is issued.	2020-12-27 18:42:28 -08:00
John MacFarlane	95b15fe6d3	Remove some test files that are no longer used.	2020-12-18 11:22:29 -08:00
John MacFarlane	b4b4e32307	Properly handle boolean values in writing YAML metadata. (Markdown writer.) This requires doctemplates >= 0.9. Closes #6388.	2020-12-15 23:45:34 -08:00
John MacFarlane	7d799bfcda	Allow both inline and external references to be used with `--citeproc`. This fixes a regression, since pandoc-citeproc allowed these to be combined. Closes #6951.	2020-12-15 08:51:43 -08:00
John MacFarlane	c43e2dc0f4	RST writer: better image handling. - An image alone in its paragraph (but not a figure) is now rendered as an independent image, with an `alt` attribute if a description is supplied. - An inline image that is not alone in its paragraph will be rendered, as before, using a substitution. Such an image cannot have a "center", "left", or "right" alignment, so the classes `align-center`, `align-left`, or `align-right` are ignored. However, `align-top`, `align-middle`, `align-bottom` will generate a corresponding `align` attribute. Closes #6948.	2020-12-13 15:25:46 -08:00
mb21	208cb96196	ICML writer: fix image bounding box for custom widths/heights fixes #6936	2020-12-12 14:49:11 +01:00
John MacFarlane	0a502e5ff5	HTML reader: retain attribute prefixes and avoid duplicates. Previously we stripped attribute prefixes, reading `xml:lang` as `lang` for example. This resulted in two duplicate `lang` attributes when `xml:lang` and `lang` were both used. This commit causes the prefixes to be retained, and also avoids invald duplicate attributes. Closes #6938.	2020-12-10 15:44:10 -08:00
Nils Carlson	c161893f44	OpenDocument writer: Allow references for internal links (#6774 ) This commit adds two extensions to the OpenDocument writer, `xrefs_name` and `xrefs_number`. Links to headings, figures and tables inside the document are substituted with cross-references that will use the name or caption of the referenced item for `xrefs_name` or the number for `xrefs_number`. For the `xrefs_number` to be useful heading numbers must be enabled in the generated document and table and figure captions must be enabled using for example the `native_numbering` extension. In order for numbers and reference text to be updated the generated document must be refreshed. Co-authored-by: Nils Carlson <nils.carlson@ludd.ltu.se>	2020-12-05 10:00:04 -08:00
John MacFarlane	ddb76cb356	LaTeX reader: don't apply theorem default styling to a figure inside. If we put an image in italics, then when rendering to Markdown we no longer get an implicit figure. Closes #6925.	2020-12-05 09:53:39 -08:00
John MacFarlane	dc3ef5201f	Markdown writer: ensure that a new csl-block begins on a new line. This just looks better and doesn't affect the semantics. See #6921.	2020-12-04 10:55:48 -08:00
John MacFarlane	7199d68ba0	EPUB writer: include title page in landmarks. Closes #6919. Note that the toc is also included if `--toc` is specified.	2020-12-03 21:39:44 -08:00
Albert Krewinkel	a9c766291f	HTML reader: support body headers, row head columns Closes: #6312	2020-11-27 10:36:13 +01:00
Igor Pashev	630b1bff2b	LaTeX reader: preserve center environment (#6852 ) The contents of the `center` environment are put in a `Div` with class `center`.	2020-11-26 12:04:31 -08:00
John MacFarlane	70a7c2446e	Update tests for LaTeX table changes.	2020-11-25 15:49:17 -08:00
John MacFarlane	815976d537	Fix truncation of `[Citation]` list in `Cite` inside footnotes... This affected author-in-text citations in footnotes. It didn't cause problems for the printed output, but for filters that expected the citation id and other information. Closes #6890.	2020-11-25 09:10:10 -08:00
Albert Krewinkel	c9f98e2bf5	HTML reader: support row or column-spanning table cells	2020-11-24 14:17:35 +01:00
Nils Carlson	75c881e2d9	OpenDocument Writer: Implement Div and Span ident support (#6755 ) Spans and Divs containing an ident in the Attr will become bookmarks or sections with idents in OpenDocument format.	2020-11-22 22:23:30 -08:00
John MacFarlane	b5b5ef92cb	LaTeX writer: Improve table spacing. + Remove the `\strut` that was added at the end of minipage environments in cells. + Replace `\tabularnewline` with `\\ \addlinespace`. Closes #6842, closes #6860.	2020-11-22 10:54:42 -08:00
Nils Carlson	ae52918faa	OpenDocument writer: Table text width support (#6792 ) Support for table width as a percentage of text width by summing width of columns and verifying that the sum is > 0 and <= 1.	2020-11-21 12:42:43 -08:00
John MacFarlane	7db2cf5d2f	LaTeX reader: more robust parsing of bracketed options. Improves on `9a40976`. Closes #6873.	2020-11-21 12:24:37 -08:00
Nils Carlson	56ceaf49dc	DocBook reader: Table text width support (#6791 ) Table width in relation to text width is not natively supported by docbook but is by the docbook fo stylesheets through an XML processing instruction, <?dbfo table-width="50%"?> . Implement support for this instruction in the DocBook reader.	2020-11-20 16:05:56 -08:00
John MacFarlane	9a4097640f	Improve LaTeX option parsing... in cases where we run into trouble parsing inlines til the closing `]`, e.g. quotes, we return a plain string with the option contents. Previously we mistakenly included the brackets in this string. Closes #6869.	2020-11-20 13:40:26 -08:00
John MacFarlane	c647948ff1	`commonmark_x`: replace `auto_identifiers` with `gfm_auto_identifiers`. `commonmark_x` never actually supported `auto_identifiers` (it didn't do anything), because the underlying library implements gfm-style identifiers only. Attempts to add the `autolink_identifiers` extension to `commonmark` will now fail with an error. Closes #6863.	2020-11-20 09:17:14 -08:00
John MacFarlane	0962b30d84	Man reader: improve handling of .IP. We now better handle `.IP` when it is used with non-bullet, non-numbered lists, creating a definition list. We also skip blank lines like groff itself. Closes #6858.	2020-11-18 22:44:32 -08:00
TEC	0306eec5fa	Replace org #+KEYWORDS with #+keywords As of ~2 years ago, lower case keywords became the standard (though they are handled case insensitive, as always): `13424336a6` Upper case keywords are exclusive to the manual: - https://orgmode.org/list/871s50zn6p.fsf@nicolasgoaziou.fr/ - https://orgmode.org/list/87tuuw3n15.fsf@nicolasgoaziou.fr/	2020-11-18 14:48:56 +01:00
John MacFarlane	bf3fea0a8c	Markdown reader: fix regression with example list references. This affects example list references followed by dashes. Introduced by commit `b8d17f7`. Closes #6855.	2020-11-17 20:36:59 -08:00
John MacFarlane	5271c6b3fb	Improve fix to siunitx numbers with minus. - use real minus sign - use tests contributed by Igor Pashev.	2020-11-16 16:36:16 -08:00
John MacFarlane	734b4c26a9	LaTeX reader: Fix negative numbers in siunitx commands. The commit `a157e1a` broke negative numbers, e.g. `\SI{-33}{\celcius}` or `\num{-3}`. This fixes the regression.	2020-11-16 14:08:29 -08:00
John MacFarlane	d7f905fb63	Markdown reader: fix detection of locators following in-text citations. Prevously, if we had `@foo [p. 33; @bar]`, the `p. 33` would be incorrectly parsed as a prefix of `@bar` rather than a suffix of `@foo`.	2020-11-15 17:51:03 -08:00
Aner Lucero	f63b76e169	Markdown writer: default to using ATX headings. Previously we used Setext (underlined) headings by default. The default is now ATX (`##` style). * Add the `--markdown-headings=atx\|setext` option. * Deprecate `--atx-headers`. * Add constructor 'ATXHeadingInLHS` constructor to `LogMessage` [API change]. * Support `markdown-headings` in defaults files. * Document new options in MANUAL. Closes #6662.	2020-11-14 21:33:32 -08:00
John MacFarlane	b8d17f7ae8	Markdown reader: don't increment stateNoteNumber for example refs. Background: syntactically, references to example list items can't be distinguished from citations; we only know which they are after we've parsed the whole document (and this is resolved in the `runF` stage). This means that pandoc's calculation of `citationNoteNum` can sometimes be wrong when there are example list references. This commit partially addresses #6836, but only for the case where the example list references refer to list items defined previously in the document.	2020-11-14 15:00:17 -08:00
John MacFarlane	68b298ed9a	Improve period suppression algorithm for citations in notes... in note citation styles. See #6835.	2020-11-13 10:52:21 -08:00
John MacFarlane	efe74746d8	DokuWiki writer: translate language names for code elements... ...and improve whitespace. Closes #6807.	2020-11-04 22:38:53 -08:00
John MacFarlane	8f75a53542	Properly support optional cite argument for `\blockquote`. (LaTeX reader) Closes #6802.	2020-11-03 10:25:56 -08:00
John MacFarlane	6cbe5efd56	LaTeX reader: fix bug parsing macro arguments. If `\cL` is defined as `\mathcal{L}`, and `\til` as `\tilde{#1}`, then `\til\cL` should expand to `\tilde{\mathcal{L}}`, but pandoc was expanding it to `\tilde\mathcal{L}`. This is fixed by parsing the arguments in "verbatim mode" when the macro expands arguments at the point of use. Closes #6796.	2020-11-02 15:04:16 -08:00
John MacFarlane	6051c751ce	Citeproc: use comma for in-text citations inside footnotes. When an author-in-text citation like `@foo` occurs in a footnote, we now render it with: `AUTHOR NAME + COMMA + SPACE + REST`. Previously we rendered: `AUTHOR NAME + SPACE + "(" + REST + ")"`. This gives better results. Note that normal citations are still rendered in parentheses.	2020-11-01 10:48:47 -08:00
John MacFarlane	3e6d009c6b	Use new citeproc; do note capitalization here, not in citeproc.	2020-10-29 21:53:02 -07:00
John MacFarlane	bd7c9eb32b	LaTeX writer: Improved calculation of table column widths. We now have LaTeX do the calculation, using `\tabcolsep`. So we should now have accurate relative column widths no matter what the text width. The default template has been modified to load the calc package if tables are used.	2020-10-29 12:10:05 -07:00
John MacFarlane	517c55dae7	Use latest citeproc. Closes #6783 .	2020-10-27 22:21:03 -07:00
Nils Carlson	dd3d920ba0	DocBook Reader: fix duplicate bibliography bug (#6773 ) Also add unit test to ensure the behavior stays consistent.	2020-10-26 12:49:03 -07:00
John MacFarlane	efc6994c8a	Commonmark writer: fix regression with fenced divs. Starting with 2.10.1, fenced divs no longer render with HTML div tags in commonmark output. This is a regression due to our transition from cmark-gfm. This commit fixes it. Closes #6768.	2020-10-23 09:25:07 -07:00
John MacFarlane	b876793910	Use latest citeproc. This fixes a problem with author-in-text citations for references including both an author and an editor. Previously, both were included in the text, but only the author should be. Closes #6765. Added a test.	2020-10-21 23:14:17 -07:00
John MacFarlane	f9c6167ad1	citeproc - improved removal of final period... ...in citations inside notes in note-based styles. These citations are put in parentheses, but the final period must be removed. See jgm/citeproc#20	2020-10-21 22:23:21 -07:00
John MacFarlane	eb3307da4e	Fix handling of xdata in bibtex/biblatex bibliographies. Closes #6752.	2020-10-15 17:41:45 -07:00
Albert Krewinkel	90af138443	Fix typos in comments, doc strings, error messages, and tests Typos reported by https://fossies.org/linux/test/pandoc-master.tar.gz/codespell.html See: #6738	2020-10-14 22:26:51 +02:00
John MacFarlane	229e763646	Depend on latest citeproc. This fixes the citation number issue with ieee.csl and other styles that do not explicitly sort bibliographies. (Pandoc was numbering them by their order in the bibliography file, rather than the order cited, as required by the CSL spec.) Closes #6741.	2020-10-13 14:52:09 -07:00
John MacFarlane	6fce81fb61	Use latest citeproc (better grouping/collapsing behavior with prefixes).	2020-10-13 11:06:02 -07:00
John MacFarlane	12ff835a8a	Commonmark reader: add pipe_table extension after defaults. Otherwise we get bad results for non-table, non-paragraph lines containing pipe characters. Closes #6739. See also jgm/commonmark-hs#52.	2020-10-12 21:24:26 -07:00
John MacFarlane	2007cff203	Markdown writer: Fix autolinks rendering for gfm. Previously, autolinks rendered as raw HTML, due to the `class="uri"` added by pandoc's markdown reader. Closes #6740.	2020-10-12 18:57:04 -07:00
John MacFarlane	0b5e2601f5	LaTeX reader: allow blank lines inside `\author`.	2020-10-10 16:28:52 -07:00
John MacFarlane	9a6c42590f	LaTeX reader: Fix parsing of "show name" in newtheorem. Previously we were just treating it as a string and ignoring accents and formatting. See #6734.	2020-10-08 22:47:32 -07:00
John MacFarlane	2d4214fa31	Extend fix to #6719 to JATS reader	2020-10-08 21:36:08 -07:00
John MacFarlane	f19286cf12	DocBook reader: don't squelch space at end of emphasis element. Instead, include it after the emphasis. Closes #6719. Same fix was made for other inline elements, e.g. strikethrough.	2020-10-08 21:27:52 -07:00
John MacFarlane	0cfba4e36e	Fixed some bibtex comments in tests (closing }).	2020-10-08 20:42:36 -07:00
John MacFarlane	641849b70a	Be less aggressive about using quotes for YAML values. We need quotes if `[` or `{` or `'` is at the beginning of the line, but not otherwise.	2020-10-08 10:54:53 -07:00
John MacFarlane	1be0f0fba8	Use double quotes for YAML metadata. Closes #6727.	2020-10-07 23:05:51 -07:00
John MacFarlane	1742821c3e	Fix URL prefixes in citations also when they occur in notes. Update chicago-fullnote-bibliography.csl and adjust tests. Closes #6723.	2020-10-07 11:23:28 -07:00
John MacFarlane	fd3809c33f	Unescape entities in writing CSL JSON. The renderCslJson function escapes `<`, `>`, and `&` as entities. This is appropriate when generating HTML, but in CSL JSON these are supposed to appear unescaped. Closes jgm/citeproc#17.	2020-10-06 22:29:25 -07:00
John MacFarlane	7d54e79091	Use latest citeproc. Update chicago-fullnute-bibliography test, which is now correct.	2020-10-03 16:07:55 -07:00
John MacFarlane	27b4c21f72	Update to lastest citeproc	2020-10-01 22:07:55 -07:00
Nils Carlson	ae4dcc0d4a	OpenDocument Writer: Implement table cell alignment (#6700 ) Co-authored-by: Mauro Bieg <mb21@users.noreply.github.com>	2020-09-27 11:21:53 -07:00
John MacFarlane	a822067903	Fix short-title. We were getting null short-titles generated, and that was creating wrong citations in some cases. Close #6702.	2020-09-26 14:28:28 -07:00
John MacFarlane	188c444990	RST reader: apply `.. class::` directly to following Header. rather than creating a surrounding Div. Closes #6699.	2020-09-25 09:06:15 -07:00
Nils Carlson	1ad7a047d5	DocBook reader: Implement table cell alignment (#6698 )	2020-09-24 17:43:43 -07:00
John MacFarlane	810ea6fdf8	Citeproc: Insert space after csl-left-margin span contents... if they come before csl-right-inline. This ensures that the citation number or label will be separated from the rest by a space, even in formats (like plain) that don't yet have special handling for the display spans.	2020-09-24 09:57:55 -07:00
Nils Carlson	4f13c0e25e	OpenDocument writer: New table cell support with row and column spans (#6682 ) Unit tests only verify column spans at this point. Co-authored-by: Nils Carlson <nils.carlson@ludd.ltu.se>	2020-09-24 09:31:47 -07:00
John MacFarlane	e0984a43a9	Add built-in citation support using new citeproc library. This deprecates the use of the external pandoc-citeproc filter; citation processing is now built in to pandoc. * Add dependency on citeproc library. * Add Text.Pandoc.Citeproc module (and some associated unexported modules under Text.Pandoc.Citeproc). Exports `processCitations`. [API change] * Add data files needed for Text.Pandoc.Citeproc: default.csl in the data directory, and a citeproc directory that is just used at compile-time. Note that we've added file-embed as a mandatory rather than a conditional depedency, because of the biblatex localization files. We might eventually want to use readDataFile for this, but it would take some code reorganization. * Text.Pandoc.Loging: Add `CiteprocWarning` to `LogMessage` and use it in `processCitations`. [API change] * Add tests from the pandoc-citeproc package as command tests (including some tests pandoc-citeproc did not pass). * Remove instructions for building pandoc-citeproc from CI and release binary build instructions. We will no longer distribute pandoc-citeproc. * Markdown reader: tweak abbreviation support. Don't insert a nonbreaking space after a potential abbreviation if it comes right before a note or citation. This messes up several things, including citeproc's moving of note citations. * Add `csljson` as and input and output format. This allows pandoc to convert between `csljson` and other bibliography formats, and to generate formatted versions of CSL JSON bibliographies. * Add module Text.Pandoc.Writers.CslJson, exporting `writeCslJson`. [API change] * Add module Text.Pandoc.Readers.CslJson, exporting `readCslJson`. [API change] * Added `bibtex`, `biblatex` as input formats. This allows pandoc to convert between BibLaTeX and BibTeX and other bibliography formats, and to generated formatted versions of BibTeX/BibLaTeX bibliographies. * Add module Text.Pandoc.Readers.BibTeX, exporting `readBibTeX` and `readBibLaTeX`. [API change] * Make "standalone" implicit if output format is a bibliography format. This is needed because pandoc readers for bibliography formats put the bibliographic information in the `references` field of metadata; and unless standalone is specified, metadata gets ignored. (TODO: This needs improvement. We should trigger standalone for the reader when the input format is bibliographic, and for the writer when the output format is markdown.) * Carry over `citationNoteNum` to `citationNoteNumber`. This was just ignored in pandoc-citeproc. * Text.Pandoc.Filter: Add `CiteprocFilter` constructor to Filter. [API change] This runs the processCitations transformation. We need to treat it like a filter so it can be placed in the sequence of filter runs (after some, before others). In FromYAML, this is parsed from `citeproc` or `{type: citeproc}`, so this special filter may be specified either way in a defaults file (or by `citeproc: true`, though this gives no control of positioning relative to other filters). TODO: we need to add something to the manual section on defaults files for this. * Add deprecation warning if `upandoc-citeproc` filter is used. * Add `--citeproc/-C` option to trigger citation processing. This behaves like a filter and will be positioned relative to filters as they appear on the command line. * Rewrote the manual on citatations, adding a dedicated Citations section which also includes some information formerly found in the pandoc-citeproc man page. * Look for CSL styles in the `csl` subdirectory of the pandoc user data directory. This changes the old pandoc-citeproc behavior, which looked in `~/.csl`. Users can simply symlink `~/.csl` to the `csl` subdirectory of their pandoc user data directory if they want the old behavior. * Add support for CSL bibliography entry formatting to LaTeX, HTML, Ms writers. Added CSL-related CSS to styles.html.	2020-09-21 10:15:50 -07:00
John MacFarlane	a59ae96062	Markdown reader: Set citationNoteNum accurately in citations. This also changes stateLastNoteNumber -> stateNoteNumber.	2020-09-21 10:10:37 -07:00
John MacFarlane	a26ec96d89	LaTeX writer: fix spacing issue with list in definition list. When a list occurs at the beginning of a definition list definition, it can start on the same line as the label, which looks bad. Fix that by starting such lists with an `\item[]`.	2020-09-15 17:59:03 -07:00
Leonard Rosenthol	55e5ad2d8f	Changed default link state to invisible (#6676 )	2020-09-10 22:58:53 -07:00
John MacFarlane	623ce89e0e	Improved uncertainty handling in slunitx.	2020-09-10 14:48:35 -07:00
John MacFarlane	a03160fb0d	LaTeX reader: support parenthesized uncertainties in siunitx.	2020-09-10 13:07:31 -07:00
Leonard Rosenthol	ef4f514359	Implement support for internal document links in ICML (#6606 ) Closes #5541.	2020-09-10 09:40:35 -07:00
Nils Carlson	96a0f3c7af	docbook reader: Implement column span support for tables (#6492 ) Implement column span support for tables in the DocBook reader. Co-authored-by: Nils Carlson <nils.carlson@ludd.ltu.se>	2020-09-10 09:11:52 -07:00
John MacFarlane	529eb696dc	LaTeX reader: Support squared, cubed, tothe in siunitx. Closes #6657.	2020-09-02 11:06:26 -07:00
Albert Krewinkel	3c07b1d9b6	Fix tests for skylighting 0.10	2020-08-31 21:22:03 +02:00
Emerson Harkin	6cfb31bbe2	Change SIRange to SIrange (#6617 )	2020-08-14 11:30:17 -07:00
Emerson Harkin	1b8f161198	Minimal support for SIRange in LaTeX reader (#6418 ) Add support for `\SIRange{firstnumber}{secondnumber}{unit}` provided by siunitx. An en-dash is used instead of localized "to".	2020-07-23 16:47:32 -07:00
John MacFarlane	a0e3172a0b	Further improvements to ams theorem support, and a test. See #1608.	2020-07-23 11:11:28 -07:00
John MacFarlane	942e3ee1f9	RST reader: fix csv tables with multiline cells. Closes #6549.	2020-07-21 10:20:15 -07:00
John MacFarlane	d6b7b1dc77	Remove use of cmark-gfm for commonmark/gfm rendering. Instead rely on the markdown writer with appropriate extensions. Export writeCommonMark variant from Markdown writer. This changes a few small things in rendering markdown, e.g. w/r/t requiring backslashes before spaces inside super/subscripts.	2020-07-19 22:51:59 -07:00
Albert Krewinkel	b894de6426	HTML writer: improve alt-text/caption handling for HTML5 Screen readers read an image's `alt` attribute and the figure caption, both of which come from the same source in pandoc. The figure caption is hidden from screen readers with the `aria-hidden` attribute. This improves accessibility. For HTML4, where `aria-hidden` is not allowed, pandoc still uses an empty `alt` attribute to avoid duplicate contents. Closes: #6491	2020-07-01 14:54:52 +02:00
John MacFarlane	9b7282bb0f	LaTeX reader: Retain the Div around tables with attributes. We'll need this to store table attributes until all writers are adjusted to react to attributes on the Table element.	2020-06-23 11:12:40 -07:00
John MacFarlane	90b2c5a5e4	Add test for #6481 .	2020-06-23 08:27:19 -07:00
John MacFarlane	f6dfacf9d6	Add "summary" to list of block-level HTML tags. Closes #6385. (The summary element needs to be the first child of details and should not be enclosed by p tags.) NOTE: you need to include a blank line before the closing `</details>`, if you want the last part of the content to be parsed as a paragraph.	2020-05-20 07:45:14 -07:00
Lila	f4185fcef0	Use CSS in favor of <br> for display math (#6372 ) Some CSS to ensure that display math is displayed centered and on a new line is now included in the default HTML-based templates; this may be overridden if the user wants a different behavior.	2020-05-18 22:45:44 -07:00
John MacFarlane	be9e93d4ae	LaTeX writer: create hypertarget for links with identifier. Closes #6360.	2020-05-12 14:37:07 -07:00
John MacFarlane	46179d5b3e	Use latest skylighting. This adds `aria-hidden="true"` to the empty a elements, which helps people who use screen readers.	2020-05-12 14:37:07 -07:00
andrebauer	97fe2ea16c	LaTeX Writer: Add support for customizable alignment of columns in beamer (#6331 ) Add support for customizable alignment of columns in beamer. Closes #4805, closes #4150.	2020-05-02 17:08:16 -07:00
John MacFarlane	f268ae3035	RST writer: properly handle images with same alt text. Previously we created duplicate references for these in rendering RST. Closes #6194.	2020-04-24 16:54:52 -07:00
John MacFarlane	6baacb51bb	AsciiDoc writer: add blank line after Div. Closes #6308.	2020-04-22 23:04:43 -07:00
John MacFarlane	9a809d4d01	Markdown writer: avoid unnecessary escapes before intraword `_` when `intraword_underscores` extension is enabled. Closes #6296.	2020-04-17 22:42:21 -07:00
John MacFarlane	8f40b4ba14	LaTeX reader: don't put surrounding Div around Table. This reverts a change in the last release; the Div is no longer needed, because we can now put the id right in the Table's attributes. However, writers may still need to be modified to do something with the id in a Table (e.g. create an anchor), so in the short term we may lose the ability to link to tables in some writers.	2020-04-17 13:04:15 -07:00
despresc	c7814f31e1	Use the new builders, modify readers to preserve empty headers The Builder.simpleTable now only adds a row to the TableHead when the given header row is not null. This uncovered an inconsistency in the readers: some would unconditionally emit a header filled with empty cells, even if the header was not present. Now every reader has the conditional behaviour. Only the XWiki writer depended on the header row being always present; it now pads its head as necessary.	2020-04-15 23:03:22 -04:00
despresc	d368536a4e	Adapt to the removal of the RowSpan, ColSpan, RowHeadColumns accessors	2020-04-15 23:03:22 -04:00
despresc	4e34d366df	Adapt to the newest Table type, fix some previous adaptation issues - Writers.Native is now adapted to the new Table type. - Inline captions should now be conditionally wrapped in a Plain, not a Para block. - The toLegacyTable function now lives in Writers.Shared.	2020-04-15 23:03:22 -04:00
despresc	7254a2ae0b	Implement the new Table type	2020-04-15 23:03:22 -04:00
John MacFarlane	71c4857464	JATS reader: handle "label" element in section title. Closes #6288.	2020-04-15 09:23:04 -07:00
John MacFarlane	9187b4bca9	LaTeX writer: ensure that `-M csquotes` works even in fragment mode. Closes #6265.	2020-04-11 10:40:59 -07:00
Tristan de Cacqueray	dd06d63540	HTML reader: support <bdo> (#6271 ) See https://developer.mozilla.org/en-US/docs/Web/HTML/Element/bdo Closes #5794	2020-04-11 09:57:59 -07:00
John MacFarlane	11df2a3c0f	LaTeX reader: better handling of `\lettrine`. - SmallCaps instead of Span for the part after the initial capital. - Ensure that both arguments are parsed, so that in Markdown both are treated as raw LateX. (Closes #6258.)	2020-04-07 09:25:52 -07:00
John MacFarlane	0edc084c50	Revert "Allow specifying string value in metadata using `!!literal` tag." This reverts commit `3493d6afaa`. This might be worth considering in the future, but let's not do it yet...the additional complexity needs a better justification.	2020-02-17 15:58:21 -08:00
John MacFarlane	3493d6afaa	Allow specifying string value in metadata using `!!literal` tag. This is experimental. Normally metadata values are interpreted as markdown, but if the !!literal tag is used they will be interpreted as plain strings. We need to consider whether this can still be implemented if we switch back from HsYAML to yaml for performance reasons.	2020-02-17 09:53:36 -08:00
Ethan Riley	daf770c1e9	Fixes: group biblatex citations even with prefix and suffix (#6058 ) Closes #5849. Previously biblatex citations were only grouped if there was no prefix. This patch allows them to be grouped in subgroups split by prefixes and suffixes, which allows better citation sorting.	2020-02-14 08:44:40 -08:00
John MacFarlane	3a79f37d88	LaTeX reader: improve caption and label parsing. - Don't emit empty Span elements for labels. - Put tables with labels in a surrounding Div.	2020-02-12 17:43:55 -08:00
John MacFarlane	3fbee8c6ed	LaTeX reader: resolve `\ref` to table numbers. Closes #6137.	2020-02-11 22:28:06 -08:00
John MacFarlane	114d77c2ab	Fix spurious dots in markdown_mmd metadata output Closes #6133 (regression).	2020-02-10 09:00:21 -08:00
John MacFarlane	5f0bd52221	reveal.js: ensure that pauses work even in title slides. Closes #5819.	2020-02-08 09:38:07 -08:00
John MacFarlane	0a4f49d370	MediaWiki writer: prevent triple `[[[`. This confuses mediawiki's parser. So we insert a `<nowiki/>` no-op between a literal `[` and a link. Closes #6119.	2020-02-05 10:08:18 -08:00
John MacFarlane	9c4dc8b49b	LaTeX reader: skip comments in more places where this is needed. Closes #6114.	2020-02-05 09:48:42 -08:00
John MacFarlane	d9b1776336	Fix duplicate frame classes in LaTeX/Beamer output. Close #6107.	2020-02-03 08:52:07 -08:00
John MacFarlane	fb3df6cf19	csv reader: allow empty cells.	2020-01-31 21:43:19 -08:00
John MacFarlane	f9514ccb9e	Add Text.Pandoc.Readers.CSV (readCSV). This adds csv as an input format. The CSV table is converted into a pandoc simple table. Closes #6100.	2020-01-31 21:14:21 -08:00
John MacFarlane	157936c927	HTML writer: fix duplicate attributes on headings. Another regression from 2.7.x. Closes #6062.	2020-01-12 15:18:37 -08:00
John MacFarlane	42b915e656	LaTeX reader: allow beamer overlays for all commands in all raw tex. This affecs parsing of raw tex in LaTeX and in Markdown and other formats. Closes #6043.	2020-01-10 08:26:33 -08:00
John MacFarlane	fc78be1140	LaTeX reader: improve parsing of raw environments. If parsing fails in a raw environment (e.g. due to special characters like unescaped `_`), try again as a verbatim environment, which is less sensitive to special characters. This allows us to capture special environments that change catcodes as raw tex when `-f latex+raw_tex` is used. Closes #6034.	2020-01-08 08:43:51 -08:00
John MacFarlane	ad5f7ecfce	Fix regression in handling of columns in beamer slides. Columns in title slides were causing problems with slide division. Closes #6033.	2020-01-07 11:11:23 -08:00
John MacFarlane	2a67b7aea9	Reveal.js writer: restore old behavior for 2D nesting. The fix to #6030 actually changed behavior, so that the 2D nesting occurred at slide level N-1 and N, instead of at the top-level section. This commit restores the 2.7.3 behavior. If there are more than 2 levels, the top level is horizontal and the rest are collapsed to vertical. Closes #6032.	2020-01-07 10:11:46 -08:00
John MacFarlane	66bca029c7	Fix regression in beamer slide structure with certain slide levels. Closes #6030.	2020-01-05 11:10:07 -08:00
John MacFarlane	0dff9b3cd2	Fix revealjs slide structure regression with certain slide levels. Partially addresses #6030.	2020-01-05 09:42:35 -08:00
John MacFarlane	82f44592ae	Fix parsing bug affected indented code after raw HTML. Closes #6009, #5360.	2019-12-27 13:08:22 -08:00
Albert Krewinkel	5bd2d28b19	Org reader: wrap named table in div, using name as id Closes: #5984	2019-12-18 22:28:26 +01:00
John MacFarlane	5029233293	Adjust test to work with Windows (I hope).	2019-12-17 14:13:05 -08:00
John MacFarlane	d0918627ca	Improved --toc generation.	2019-12-17 11:59:52 -08:00
John MacFarlane	20cf4e47b0	Improved makeSections so we don't get doubled attributes. Closes #5986.	2019-12-17 11:59:52 -08:00
John MacFarlane	b0c5ecbb1a	Added test for #5986 .	2019-12-17 11:59:52 -08:00
John MacFarlane	204c7bb943	Add section-divs command test (failing).	2019-12-17 11:59:52 -08:00
Albert Krewinkel	75dc013036	Org reader: add table labels to caption if both are present The table `#+NAME:` or `#+LABEL:` is added to the table's caption in the form of an empty span with the label set as the span's ID. Closes: #5984	2019-12-13 16:22:04 +01:00
John MacFarlane	0b54d6282b	Fix --toc-depth regression in 2.8. Closes #5967.	2019-12-07 14:20:41 -08:00
John MacFarlane	992f77c17c	Roll back part of of `--shift-heading-level-by` change. With positive heading shifts, starting in 2.8 this option caused metadata titles to be removed and changed to regular headings. This behavior is incompatible with the old behavior of `--base-header-level` and breaks old workflows, so with this commit we are rolling back this change. Now, there is an asymmetry in positive and negative heading level shifts: + With positive shifts, the metadata title stays the same and does not get changed to a heading in the body. + With negative shifts, a heading can be converted into the metadata title. I think this is a desirable combination of features, despite the asymmetry. One might, e.g., want to have a document with level-1 section headigs, but render it to HTML with level-2 headings, retaining the metadata title (which pandoc will render as a level-1 heading with the default template). Closes #5957. Revises #5615.	2019-12-05 09:59:50 -08:00
John MacFarlane	79a10388da	HTML writer: add task-list class to ul if all elements are task list items. This will allow styling unordered task lists in a way that omits the bullet.	2019-12-05 09:32:40 -08:00
John MacFarlane	4489283b03	Fix makeSections so it doesn't turn column divs into sections.	2019-12-05 09:17:28 -08:00
John MacFarlane	ce0a4f8c47	RST writers: Use grid tables for 1-column tables. With simple tables, we have a clash with heading syntax. Closes #5936.	2019-11-25 07:31:28 -08:00
John MacFarlane	c1b51b1282	Improve markdown escaping in list items. Closes #5918.	2019-11-19 22:33:14 -08:00
John MacFarlane	1f69162ffd	RST writer: Improve spacing for tables with no width information. If a simple table would be too wide, we use a grid table. The code for generating grid tables has been adjusted to give more intelligent column widths when widths aren't given. (This also affects the markdown writer.) Closes #5899.	2019-11-15 23:09:53 -08:00
John MacFarlane	81fae63a54	Change optInputFiles to a `Maybe [FilePath]`. `Nothing` means: nothing specified. `Just []` means: an empty list specified (e.g. in defaults). Potentially these could lead to different behavior: see #5888.	2019-11-14 18:42:55 -08:00
John MacFarlane	a60eb60a3d	Allow combining `-Vheader-includes` and `--include-in-header`. Closes #5904.	2019-11-14 07:48:19 -08:00
John MacFarlane	3645f9babe	Fixed some test locations and put test data files in extra-source-files.	2019-11-14 06:21:00 -08:00
John MacFarlane	a1f69b1c7d	Fix regression preventing header-includes from being set using -V. See #5904.	2019-11-14 05:48:20 -08:00
John MacFarlane	cbcaf19174	Add test for #5881 .	2019-11-13 17:07:44 -08:00
John MacFarlane	5c0b3743be	Ensure there's a blank line before RST tables. Closes #5898.	2019-11-13 10:10:55 -08:00
John MacFarlane	741b1f7fb4	Markdown reader: fix small super/subscript issue. Superscripts and subscripts cannot contain spaces, but newlines were previously allowed (unintentionally). This led to bad interactions in some cases with footnotes. E.g. ``` foo^[note] bar^[note] ``` With this change newlines are also not allowed inside super/subscripts. Closes #5878.	2019-11-11 09:08:52 -08:00
Florian Beeres	bf2eb4f288	Change the implementation of `htmlSpanLikeElements` and implement `<dfn>` (#5882 ) * Add HTML Reader support for `<dfn>`, parsing this as a Span with class `dfn`. * Change `htmlSpanLikeElements` implementation to retain classes, attributes and inline content.	2019-11-11 08:55:58 -08:00
John MacFarlane	3bf5362898	DocBook reader: Fix bug with entities in mathphrase element. Closes #5885.	2019-11-07 23:08:05 -08:00
John MacFarlane	9c7f75afb5	Change merge behavior for metadata. Previously, if a document contained two YAML metadata blocks that set the same field, the conflict would be resolved in favor of the first. Now it is resolved in favor of the second (due to a change in pandoc-types). This makes the behavior more uniform with other things in pandoc (such as reference links and `--metadata-file`).	2019-11-07 10:48:38 -08:00
Dmitry Pogodin	270ffe6ab5	Place caption before table in OpenDocument format. (#5869 ) Closes #5681.	2019-11-03 07:17:05 -08:00
John MacFarlane	65593043c3	LaTeX reader: Fixed dollar-math parsing... ...to ensure that space is left between a control seq and a following word that would otherwise change its meaning. Closes #5836.	2019-11-02 12:44:48 -07:00
John MacFarlane	f39c44f0ba	Add test for #5836 .	2019-11-02 12:27:15 -07:00
John MacFarlane	6c9a20b2d3	Test for macro definitions in LaTeX preamble.	2019-11-02 11:08:26 -07:00
John MacFarlane	724fd655e7	Add test case for #5845 .	2019-11-02 09:40:08 -07:00
John MacFarlane	63cfd45406	T.P.W.Shared: Changed gridTables so it does better at... ...keeping the widths of columns. See #4320. Adjust test case for #4320.	2019-10-29 22:21:35 -07:00
John MacFarlane	a796b655cd	Shared.makeSections: better behavior in some corner cases. When a div surrounds multiple sections at the same level, or a section of highre level followed by one of lower level, then we just leave it as a div and create a new div for the section. Closes #5846, closes #5761.	2019-10-29 21:24:58 -07:00
John MacFarlane	47566817c5	Shared: improve isTight. If a list has an empty item, this should not count against its being a tight list. Closes #5857.	2019-10-28 21:14:01 -07:00
Florian B	a933ad30fb	Add support for reading & writing <mark> elements Parse <mark> elements from HTML as HTML span like elements, with a single class matching the tag name `mark`. Mark elements are rendered to HTML using the native <mark> element. Fixes https://github.com/jgm/pandoc/issues/5797.	2019-10-16 09:08:12 -07:00
Daniele D'Orazio	1425bf9a65	Add support for reading and writing <kbd> elements * Text.Pandoc.Shared: export `htmlSpanLikeElements` [API change] This commit also introduces a mapping of HTML span like elements that are internally represented as a Span with a single class, but that are converted back to the original element by the html writer. As of now, only the kbd element is handled this way. Ideally these elements should be handled as plain AST values, but since that would be a breaking change with a large impact, we revert to this stop-gap solution. Fixes https://github.com/jgm/pandoc/issues/5796.	2019-10-15 09:49:09 -07:00
John MacFarlane	9e3e195dd4	Fix `gfm_auto_identifiers` behavior with emojis. Closes #5813. Note that we also now use emoji names for emojis when `ascii_identifiers` is enabled.	2019-10-11 10:00:33 -07:00
John MacFarlane	7c2dd0359b	Markdown writer: prefer pipe_tables to raw html... ...even when we must lose width information. All in all this seems to be people's preferred behavior, even though it is slightly lossier. Closes #2608. Closes #4497.	2019-10-11 09:46:58 -07:00
John MacFarlane	a3cd74c29b	`--metadata-file`: when multiple files specified, second takes precedence... on conflicting fields. This changes earlier behavior (but not in a release), where first took precedence. Note that this may seem inconsistent with the behavior of multiple YAML blocks within a document, where the first takes precedence. Still, it is convenient to be able to override defaults with options later on the command line.	2019-10-10 10:00:45 -07:00
John MacFarlane	68b09a6d81	Make some writers sensitive to 'unlisted' class on headings. If this is present on a heading with the 'unnumbered' class, the heading won't appear in the TOC. This class has no effect if 'unnumbered' is not also specified. This affects HTML-based writers (including slide shows and epub), LateX (including beamer), RTF, and PowerPoint. Other writers do not yet support `unlisted`. Closes #1762.	2019-10-10 09:15:40 -07:00
John MacFarlane	a3729ef2da	RST writer: proper handling of :align: on figures, images. When the image has the `align-right` (etc.) class, we now use an `:align:` attribute. Closes #4420.	2019-10-09 15:05:22 -07:00
Nils Carlson	8028de3322	odt: Add external option for native numbering This adds an external options +native_numbering to the ODT writer enabling enumeration of figures and tables in ODT output.	2019-09-24 15:23:59 -07:00
John MacFarlane	ba14649945	Improve test #5753	2019-09-22 22:00:20 -07:00
John MacFarlane	9abed45879	RST reader: Fixed parsing of indented blocks. We were requiring consistent indentation, but this isn't required by RST, as long as each nonblank line of the block has some indentation. Closes #5753.	2019-09-22 12:01:45 -07:00
John MacFarlane	d247e9f72e	Make `plain` output plainer. Previously we used the following Project Gutenberg conventions for plain output: - extra space before and after level 1 and 2 headings - all-caps for strong emphasis `LIKE THIS` - underscores surrounding regular emphasis `_like this_` This commit makes `plain` output plainer. Strong and Emph inlines are rendered without special formatting. Headings are also rendered without special formatting, and with only one blank line following. To restore the former behavior, use `-t plain+gutenberg`. API change: Add `Ext_gutenberg` constructor to `Extension`. See #5741.	2019-09-22 11:33:09 -07:00
John MacFarlane	5a85789185	Remove admonition-title remnants. Completes `8e01ccb41d`	2019-09-19 16:09:38 -07:00
John MacFarlane	88dc6fac5d	Add --shift-heading-level-by option. Deprecate --base-heading-level. The new option does everything the old one does, but also allows negative shifts. It also promotes the document metadata (if not null) to a level-1 heading with a +1 shift, and demotes an initial level-1 heading to document metadata with a -1 shift. This supports converting documents that use an initial level-1 heading for the document title. Closes #5615.	2019-09-10 23:16:13 -07:00
John MacFarlane	4778d03473	LaTeX reader: Fix parsing of optional arguments that contain braced text. Closes #5740.	2019-09-09 21:33:16 -07:00
Brian Leung	0558ea9836	Org reader: modify handling of example blocks. (#5717 ) * Org reader: allow the `-i` switch to ignore leading spaces. * Org reader: handle awkwardly-aligned code blocks within lists. Code blocks in Org lists must have their #+BEGIN_ aligned in a reasonable way, but their other components can be positioned otherwise.	2019-09-08 22:34:10 -07:00
John MacFarlane	9f984ff26a	Replace Element and makeHierarchical with makeSections. Text.Pandoc.Shared: + Remove `Element` type [API change] + Remove `makeHierarchicalize` [API change] + Add `makeSections` [API change] + Export `deLink` [API change] Now that we have Divs, we can use them to represent the structure of sections, and we don't need a special Element type. `makeSections` reorganizes a block list, adding Divs with class `section` around sections, and adding numbering if needed. This change also fixes some longstanding issues recognizing section structure when the document contains Divs. Closes #3057, see also #997. All writers have been changed to use `makeSections`. Note that in the process we have reverted the change `c1d058aeb1` made in response to #5168, which I'm not completely sure was a good idea. Lua modules have also been adjusted accordingly. Existing lua filters that use `hierarchicalize` will need to be rewritten to use `make_sections`.	2019-09-08 22:20:19 -07:00
John MacFarlane	1ccff3339d	Revert changes to hierarchicalizeWithIds. Revert "hierarchicalize: ensure that sections get ids..." This reverts commit `212406a61d`. Revert "Improve detection of headings in Divs by hierarchicalize." This reverts commit `6e2cfd6c97`. Revert "Shared.hierarchicalize: improve handling of div and section structure." This reverts commit `345b33762e`.	2019-09-08 21:56:42 -07:00
John MacFarlane	212406a61d	hierarchicalize: ensure that sections get ids... even if they're in divs. Improves #3057.	2019-09-06 09:05:52 -07:00
John MacFarlane	6e2cfd6c97	Improve detection of headings in Divs by hierarchicalize. The structure ``` <h1>one</h1> <div> <h1>two</h1> </div> ``` should create two coordinate sections, not a section with a subsection. Now it does. Extends #3057.	2019-09-06 08:44:59 -07:00
John MacFarlane	345b33762e	Shared.hierarchicalize: improve handling of div and section structure. Previously Divs were opaque to hierarchicalize, so headings inside divs didn't get into the table of contents, for example (#3057). Now hierarchicalize treats Divs as sections when appropriate. For example, these structures both yield a section and a subsection: ``` html <div> <h1>one</h1> <div> <h2>two</h2> </div> </div> ``` ``` html <div> <h1>one</h1> <div> <h1>two</h1> </div> </div> ``` Note that ``` html <h1>one</h1> <div> <h2>two</h2> </div> <h1>three</h1> ``` gets parsed as the structure one two three which may not always be desirable. Closes #3057.	2019-09-05 22:37:13 -07:00
John MacFarlane	e4cca4cf67	Roff readers: better parsing of groups. We now allow groups where the closing `\\}` isn't at the beginning of a line. Closes #5410.	2019-09-04 09:24:42 -07:00
John MacFarlane	513058a24e	XML: change toEntities to emit numerical hex character references. Previously decimal references were used. But Polyglot Markup prefers hex. See #5718. This affects the output of pandoc with `--ascii`.	2019-09-03 11:28:20 -07:00
John MacFarlane	6b286a1d74	LaTeX reader: don't try to parse includes if raw_tex is set. When the `raw_tex` extension is set, we just carry through `\usepackage`, `\input`, etc. verbatim as raw LaTeX. Closes #5673.	2019-09-02 21:03:05 -07:00
John MacFarlane	d79242796b	HTML writer: use numeric character references with `--ascii`. Previously we used named character references with html5 output. But these aren't valid XML, and we aim to produce html5 that is also valid XHTML (polyglot markup). (This is also needed for epub3.) Closes #5718.	2019-09-02 20:36:57 -07:00
John MacFarlane	5e708eb8ce	LaTeX reader: properly handle optional arguments for macros. Closes #5682.	2019-09-02 18:48:37 -07:00
John MacFarlane	fba1296fd1	LaTeX reader: fix `\\` in `\parbox` inside a table cell. Closes #5711.	2019-08-27 10:48:02 -07:00
John MacFarlane	167fc4bc87	Markdown reader: Headers: don't parse content over newline boundary. Closes #5714.	2019-08-27 10:15:00 -07:00
John MacFarlane	180f534d21	Add test for issue #5708 .	2019-08-26 15:20:22 -07:00
John MacFarlane	1ee6e0e087	Use new doctemplates, doclayout. + Remove Text.Pandoc.Pretty; use doclayout instead. [API change] + Text.Pandoc.Writers.Shared: remove metaToJSON, metaToJSON' [API change]. + Text.Pandoc.Writers.Shared: modify `addVariablesToContext`, `defField`, `setField`, `getField`, `resetField` to work with Context rather than JSON values. [API change] + Text.Pandoc.Writers.Shared: export new function `endsWithPlain` [API change]. + Use new templates and doclayout in writers. + Use Doc-based templates in all writers. + Adjust three tests for minor template rendering differences. + Added indentation to body in docbook4, docbook5 templates. The main impact of this change is better reflowing of content interpolated into templates. Previously, interpolated variables were rendered independently and intepolated as strings, which could lead to overly long lines. Now the templates interpolated as Doc values which may include breaking spaces, and reflowing occurs after template interpolation rather than before.	2019-08-25 14:24:31 -07:00
Owen McGrath	92debe4b9e	Change optMetadataFile type from Maybe to List (#5702 ) Changed optMetadataFile from `Maybe FilePath` to `[FilePath]`. This allows for multiple YAML metadata files to be added. The new default value has been changed from `Nothing` to `[]`. To account for this change in `Text.Pandoc.App`, `metaDataFromFile` now operates on two `mapM` calls (for `readFileLazy` and `yamlToMeta`) and a fold. Added a test (command/5700.md) which tests this functionality and updated MANUAL.txt, as per the contributing guidelines. With the current behavior, using `foldr1 (<>)`, values within files specified first will be used over those in later files. (If the reverse of this behavior would be preferred, it should be fixed by changing foldr1 to foldl1.)	2019-08-24 09:41:25 -07:00
John MacFarlane	9d581428f9	Add test for #5690 .	2019-08-23 10:15:42 -07:00
John MacFarlane	1c71bd1ff5	Ensure proper nesting when we have long ordered list markers. Closes #5705.	2019-08-23 09:16:54 -07:00
John MacFarlane	79a3449eeb	LaTeX reader: improve withRaw so it can handle cases where... the token string is modified by a parser (e.g. accent when it only takes part of a Word token). Closes #5686. Still not ideal, because we get the whole `\t0BAR` and not just `\t0` as a raw latex inline command. But I'm willing to let this be an edge case, since you can easily work around this by inserting a space, braces, or raw attribute. The important thing is that we no longer drop the rest of the document after a raw latex inline command that gobbles only part of a Word token!	2019-08-14 14:34:44 -07:00
John MacFarlane	eb23527121	Rename test for 5685 -> 5684 (typo in last commit). Closes #5684. (Note that #5685 is NOT closed by previous commit.)	2019-08-14 11:13:18 -07:00
John MacFarlane	0b2fb9b8f9	Add thin space when needed in LaTeX quote ligatures. Closes #5685.	2019-08-14 11:07:02 -07:00
Philip Pesca	01a52e2300	HTML writer: ensure TeX formulas are rendered correctly (#5658 ) The web service passed in to `--webtex` may render formulas using inline or display style by default. Prefixing formulas with the appropriate command ensures they are rendered correctly. This is a followup to the discussion in #5656.	2019-07-24 10:10:36 -07:00
Philip Pesca	bc508534a6	HTML writer: render inline formulas correctly with --webtex (#5656 ) We add `\textstyle` to the beginning of the formula to ensure it will be rendered in inline style. Closes #5655.	2019-07-23 12:21:31 -07:00
John MacFarlane	db5f6dd4fe	Fix error introduced in change to test for 4669.	2019-07-22 15:32:49 -07:00
John MacFarlane	d9960244d8	LaTeX reader: support tex `\tt` command. Closes #5654.	2019-07-22 15:29:07 -07:00
John MacFarlane	91d4283263	LaTeX writer: fix line breaks at start of paragraph. Previously we just omitted these. Now we render them using `\hfill\break` instead of `\\`. This is a revision of a PR by @sabine (#5591) who should be credited with the idea. Closes #3324.	2019-07-20 17:12:53 -07:00
John MacFarlane	465eeece6b	LaTeX reader: search for image with list of extensions... like latex does, if an extension is not provided. Closes #4933.	2019-07-20 10:17:49 -07:00
John MacFarlane	339392bf54	Markdown: Ensure that expanded latex macros end with space if original did. Closes #4442.	2019-07-19 10:32:59 -07:00
John MacFarlane	28cad16517	Markdown writer: prefer using raw_attribute when enabled. The `raw_attribute` will be used to mark raw bits, even HTML and LaTeX, and even when `raw_html` and `raw_tex` are enabled, as they are by default. To get the old behavior, disable `raw_attribute` in the writer. Closes #4311.	2019-07-18 22:31:03 -07:00
John MacFarlane	5c655e86d5	HTML writer: ensure that line numbers in code blocks get id-prefix. Closes #5650.	2019-07-18 22:08:37 -07:00
John MacFarlane	0d72237e27	Dokuwiki writer: handle mixed lists without HTML fallback. Closes #5107.	2019-07-16 13:14:37 -07:00
Vasily Alferov	f6c92c7523	Fix #4499 : add mbox and hbox handling to LaTeX reader (#5586 ) When `+raw_tex` is enabled, these are passed through literally. Otherwise, they are handled in a way that emulates LaTeX's behavior.	2019-07-13 16:55:41 -07:00
John MacFarlane	4a5e727c8c	Man writer: Improved definition list term output. Now we boldface code but not other things. This matches the most common style in man pages (particularly option lists). Also, remove a regression in the last commit in which 'nowrap' was removed.	2019-07-13 16:41:43 -07:00
John MacFarlane	d0bf7efe95	Man writer: fixed boldfacing of definition terms. Previously the bold-facing would be interrupted by other formatting, because we used `.B`. Closes #5620.	2019-07-13 16:12:28 -07:00
John MacFarlane	1784161946	LaTeX reader: Properly handle \providecommand and environment... They are now ignored if the corresponding command or environment is already defined. Closes #5635.	2019-07-13 15:51:33 -07:00
mb21	6cf5c3f6ac	fix filename and issue reference of previous commit	2019-07-13 12:03:45 +02:00
John MacFarlane	6d30d3e0b3	Pass through aria- attributes to HTML5. Also document addition of data- prefix to unknown attributes. Closes #5646.	2019-07-12 17:03:01 -07:00
Brian Leung	1d9ff85b45	RST reader: keep `name` property in `imgAttr`. (#5637 ) Closes #5619.	2019-07-10 18:35:01 -07:00
Brian Leung	9c4ba81357	Markdown reader: handle inline code more eagerly within lists. (#5628 ) Closes #5627.	2019-07-06 23:14:21 +02:00
oquechy	f0edf60364	Support epigraph command in LaTeX Reader. Closes #3523.	2019-06-21 18:27:26 +02:00
John MacFarlane	bec95c97ac	LaTeX writer: Don't highlight code in headings. This causes compilation errors, and I don't know how to work around them. Closes #5574.	2019-06-11 20:47:29 -07:00
John MacFarlane	3febd81cbc	LaTeX writer: Use mbox to get proper behavior inside `\sout`. Closes #5529.	2019-06-10 15:02:48 -07:00
John MacFarlane	59529e408b	Asciidoc writer: use doubled ## when necessary for spans. Closes #5566.	2019-06-10 14:47:04 -07:00
John MacFarlane	2e12106a90	Asciidoc writer: ensure correct nesting strong/emph. Closes #5565.	2019-06-10 14:42:08 -07:00
John MacFarlane	d1df2b2783	LaTeX reader: pass through unknown listings language as class. Previously if the language was not in the list of listings- supported languages, it would not be added as a class, so custom syntax highlighting could not be used. Closes #5540.	2019-06-08 12:25:34 -07:00
John MacFarlane	d8b4e45be0	LaTeX writer: Include inline code attributes with `--listings`. Closes #5420.	2019-06-07 10:03:10 -07:00
John MacFarlane	10615420de	Include trailing {}s in raw latex commands. Change is in rawLaTeXInline in LaTeX reader, but it affects the markdown reader and other readers that allow raw LaTeX. Previously, trailing `{}` would be included for unknown commands, but not for known commands. However, they are sometimes used to avoid a trailing space after the command. The chances that a `{}` after a LaTeX command is not part of the command are very small. Closes #5439.	2019-06-04 21:20:11 -07:00
John MacFarlane	f82d91eb49	Markdown reader: don't create implicit reference for empty header. Closes #5549.	2019-06-04 08:39:54 -07:00
John MacFarlane	928681ca04	Avoid unwanted interpretation of def list term as other kind of block, e.g. ordered list item, in Markdown writer. Closes #554.	2019-06-03 09:51:19 -07:00
mb21	a58304e00e	HTML writer: output video and audio elements depending on file extension of the image path	2019-05-29 09:43:50 +02:00
John MacFarlane	2ad5dacf87	Remove command test for #5517 . We need a better test that works cross-platform. Until then, removing this. Closes #5528.	2019-05-28 12:45:31 -07:00
Mauro Bieg	3f57f49033	HTML writer: emit empty alt tag in figures (#5518 ) The same text is already in the <figcaption> and screen-readers would read it twice, see #4737	2019-05-28 12:31:41 -04:00
John MacFarlane	8a5b9ac868	Add test for relative file: URI to #5517 .	2019-05-28 09:05:28 -07:00
Mauro Bieg	214da7217b	Fix handling of `file:` URL scheme in `downloadOrRead` (#5522 ) Move up the pattern match to be reachable, closes #5517. Previously `file:/` URLs were handled wrongly and pandoc attempted to make HTTP requests, which failed.	2019-05-28 11:51:21 -04:00
Alexander Krotov	7514277454	HTML reader: trim definition list terms	2019-05-25 18:36:56 +03:00
John MacFarlane	aef71894ce	Markdown writer: Ensure the code fence is long enough. Previously too few backticks were used when the code block contained an indented line of backticks. (Ditto tildes.) Cloess #5519.	2019-05-22 15:21:15 -07:00
Jesse Rosenthal	ed73bd28e5	Markdown writer: Handle labels with integer names Previously if labels had integer names, it could produce a conflict with auto-labeled reference links. Now we test for a conflict and find the next available integer. Note that this involves adding a new state variable `stPrevRefs` to keep track of refs used in other document parts when using `--reference-location=block\|section` Closes #5495	2019-05-21 12:19:59 -04:00

... 3 4 5 6 7 ...

817 commits