Commit graph

13437 commits

Author SHA1 Message Date
John MacFarlane
490f34dee5 Markdown writer: move asciify out of escapeString.
Otherwise unsmartify doesn't catch quotes that have
already been turned to entities.
2020-07-19 22:51:59 -07:00
John MacFarlane
d6b7b1dc77 Remove use of cmark-gfm for commonmark/gfm rendering.
Instead rely on the markdown writer with appropriate extensions.

Export writeCommonMark variant from Markdown writer.
This changes a few small things in rendering markdown,
e.g. w/r/t requiring backslashes before spaces inside
super/subscripts.
2020-07-19 22:51:59 -07:00
John MacFarlane
a63105ffff Markdown writer: use unicode super/subscript characters...
when possible if the superscript or subscript extension or
raw_html aren't available.
2020-07-19 22:51:59 -07:00
John MacFarlane
8d0676ec4d Markdown writer: render caption as following paragraph...
when `Ext_table_caption` not enabled.
2020-07-19 22:51:59 -07:00
John MacFarlane
0df5ea87cf Use released versions of commonmark libraries. 2020-07-19 22:51:59 -07:00
John MacFarlane
0aed9dd589 Add commonmark_x output format...
commonmark with a number of useful extensions (more than gfm).
2020-07-19 22:51:59 -07:00
John MacFarlane
3a22fbd11b Trim down githubMarkdownExtensions.
Previously it included all of the following, which make
sense for the legacy markdown_github but not for gfm,
since they are part of base commonmark and thus
can't be turned off in gfm:

- `Ext_all_symbols_escapable`
- `Ext_backtick_code_blocks`
- `Ext_fenced_code_blocks`
- `Ext_space_in_atx_header`
- `Ext_intraword_underscores`
- `Ext_lists_without_preceding_blankline`
- `Ext_shortcut_reference_links`
`
These have been removed from `githubMarkdownExtensions`, though
they're still turned on for legacy `markdown_github`.
2020-07-19 22:51:59 -07:00
John MacFarlane
8d523d80d4 Add generic attributes extension.
This allows attributes to be added to any block or inline
element, in principle.  (Though in many cases this will be
done by adding a Div or Span container, since pandoc's
AST doesn't have a slot for attributes for most elements.)

Currently this is only possible with the commonmark and gfm
readers.

Add `Ext_attributes` constructor for `Extension` [API change].
2020-07-19 22:51:59 -07:00
John MacFarlane
0db4702042 Use commonmark-hs to parse commonmark/gfm...
...instead of cmark-gfm (a wrapper around a C library).

We can now support many more pandoc extensions for
commonmark and gfm.

Add fenced_code_attributes to gfm/commonmark extensions.
2020-07-19 22:51:59 -07:00
John MacFarlane
c2cc4a0504 CI: Drop testing on ghc 8.0.2. 2020-07-19 22:51:59 -07:00
John MacFarlane
7dbf03f527 MANUAL.txt: rewrite Raw HTML/TeX section...
...to avoid duplicate headings for the extensions.
2020-07-19 22:51:59 -07:00
John MacFarlane
8ede05161f
Merge pull request #6495 from tarleb/html5-figure-accessiblity
HTML writer: improve alt-text/caption handling for HTML5
2020-07-19 11:24:54 -07:00
John MacFarlane
89b8624269
Merge pull request #6547 from Blake-Eryx/master
Remove duplicate 'titlepage' in docs
2020-07-19 11:15:11 -07:00
Blake Eryx
920732078c Remove duplicate 'titlepage' in docs 2020-07-19 14:03:18 -04:00
John MacFarlane
3b563cfe8f DocBook reader: parse releaseinfo as metadata.
Closes #6542.

Note that you'll need to put releaseinfo somewhere in your
template if you want this to be part of the converted output.
2020-07-18 12:32:31 -07:00
Albert Krewinkel
36fede2b02
Jira writer: keep image caption as alt attribute
Fixes #6529
2020-07-17 16:02:40 +02:00
John MacFarlane
06d834caaa Use selnolig to selectively suppress ligatures with lualatex.
Closes #6534
2020-07-15 13:28:44 -07:00
John MacFarlane
302543af08 Docbook reader: remove misleading comment...
suggesting that releaseinfo is handled. It isn't.
2020-07-14 10:10:43 -07:00
John MacFarlane
2cc0d68ca0
Merge pull request #6527 from lierdakil/fix-6514
[Docx Reader] Only use bCs/iCs on runs with rtl or cs property
2020-07-13 10:57:04 -07:00
John MacFarlane
86d465e572
Merge pull request #6528 from lierdakil/fix-ci-haskell-setup
[CI] Use setup-haskell action
2020-07-13 10:36:20 -07:00
Nikolay Yakimov
22c373370c [Docx Reader] Only use bCs/iCs on runs with rtl or cs property
Fixes #6514
2020-07-13 19:50:06 +03:00
Nikolay Yakimov
f52337ccdc [CI] Use setup-haskell action 2020-07-13 19:49:36 +03:00
John MacFarlane
c3b170be1c
Merge pull request #6513 from brisad/master
Escape starting periods in ms writer code blocks
2020-07-12 17:02:06 -07:00
John MacFarlane
651a3d96c4
Merge pull request #6506 from bewuethr/fix-manual-typo
Fix typo in MANUAL.txt
2020-07-12 16:56:13 -07:00
John MacFarlane
7be86b148e
Merge pull request #6509 from lierdakil/docx-smush-inlines-refactor
[Docx Reader] Refactor/update Text.Pandoc.Readers.Docx.Combine.smushInlines
2020-07-12 16:55:35 -07:00
John MacFarlane
20aea4c73e
Merge pull request #6508 from lierdakil/bench-bs-readers
[Bench] Benchmark bytestring readers
2020-07-12 16:53:47 -07:00
John MacFarlane
9ae792b0d4 Ms writer: fix code highlighting with blank lines.
Previously blank lines were simply omitted from highligted code.
2020-07-12 14:51:21 -07:00
John MacFarlane
e3217c3862 RST reader: fix spurious newlines in some attributes from directives. 2020-07-12 14:42:41 -07:00
John MacFarlane
37e68a818b RST reader: avoid extra newline in included code blocks. 2020-07-12 13:53:10 -07:00
tajmone
bf7c624c80
Fix Typos in Lua Filters Doc 2020-07-09 18:48:52 +02:00
Michael Hoffmann
09ea10e2b1 Escape starting periods in ms writer code blocks
If a line of ms code block output starts with a period (.), it should
be prepended by '\&' so that it is not interpreted as a roff command.

Fixes #6505
2020-07-08 23:52:28 +02:00
Nikolay Yakimov
f09e18753b [Docx Reader] Use null instead of isEmpty in Readers.Docx.Combine 2020-07-07 14:50:26 +03:00
Nikolay Yakimov
c28cff66b3 [Bench] Benchmark bytestring readers 2020-07-07 14:44:57 +03:00
Nikolay Yakimov
5a1e1db526 [Docx Reader] Remove unused LANGUAGE from Readers.Docx.Combine 2020-07-07 13:23:14 +03:00
Nikolay Yakimov
1ae4d76d42 [Docx Reader] Remove no-op stack/unstackInlines in Readers.Docx.Combine 2020-07-07 12:28:38 +03:00
Nikolay Yakimov
27465638a1 [Docx Reader] Get rid of unused NullModifier in Readers.Docx.Combine 2020-07-07 11:32:17 +03:00
Nikolay Yakimov
48cef91d18 [Docx Reader] Refactor/update smushInlines 2020-07-07 09:04:38 +03:00
Benjamin Wuethrich
e965926c1a Fix typo in MANUAL.txt 2020-07-05 23:50:04 -04:00
John MacFarlane
804e8eeed2 Revert "Ipnyb: allow lossless round-tripping of markdown cell content."
This reverts commit efbc205031.
2020-07-02 09:03:56 -07:00
John MacFarlane
9afa192c3a Revert "Ipynb reader: fix duplication of 'source' attribute."
This reverts commit 2d009366ce.
2020-07-02 09:03:26 -07:00
John MacFarlane
2d009366ce Ipynb reader: fix duplication of 'source' attribute.
See #5408.
2020-07-02 09:01:06 -07:00
John MacFarlane
761e62c678 CONTRIBUTING.md: Advertise the official nightlies in GitHub actions.
Replaces #6500, thanks to @ickc.
2020-07-02 08:51:38 -07:00
Albert Krewinkel
b894de6426
HTML writer: improve alt-text/caption handling for HTML5
Screen readers read an image's `alt` attribute and the figure caption,
both of which come from the same source in pandoc. The figure caption is
hidden from screen readers with the `aria-hidden` attribute. This
improves accessibility.

For HTML4, where `aria-hidden` is not allowed, pandoc still uses an
empty `alt` attribute to avoid duplicate contents.

Closes: #6491
2020-07-01 14:54:52 +02:00
Albert Krewinkel
ccf9889c2c
Org reader: respect tables-excluding export setting
Tables can be removed from the final document with the `#+OPTION:
|:nil` export setting.
2020-07-01 09:28:24 +02:00
Albert Krewinkel
d6711bd7d9
Org reader: respect export setting disabling footnotes
Footnotes can be removed from the final document with the `#+OPTION:
f:nil` export setting.
2020-06-30 22:30:15 +02:00
John MacFarlane
efbc205031 Ipnyb: allow lossless round-tripping of markdown cell content.
The reader now parses the contents of the markdown cell to a Pandoc
structure, but *also* stores the raw markdown in a `source`
attribute on the cell Div.  When we convert back to markdown,
this attribute is stripped off and the original source is used.
When we convert to other formats, the attribute is usually
ignored (though it will come through in HTML as a `data-source`
attribute, not unhelpfully).

I'll note some potential drawbacks of this approach:

- It makes it impossible to use pandoc to clean up or
  change the contents of markdown cells, e.g.
  going from `+smart` to `-smart`.

- There may be formats where the addition of the `source`
  attribute is problematic.  I can't think of any, though.

Closes #5408.
2020-06-30 12:32:44 -07:00
John MacFarlane
35a18bb654 Fix date on changelog 2020-06-30 08:47:49 -07:00
Albert Krewinkel
7c207c3051
Org reader: respect export setting which disables entities
MathML-like entities, e.g., `\alpha`, can be disabled with the
`#+OPTION: e:nil` export setting.
2020-06-30 11:39:32 +02:00
John MacFarlane
c212886d2b Add Vaibhav Sagar to authors. 2020-06-29 22:08:56 -07:00
John MacFarlane
2cd3694818 Fix 'make pandoc-templates'. 2020-06-29 21:57:53 -07:00