pandoc

Author	SHA1	Message	Date
John MacFarlane	68b09a6d81	Make some writers sensitive to 'unlisted' class on headings. If this is present on a heading with the 'unnumbered' class, the heading won't appear in the TOC. This class has no effect if 'unnumbered' is not also specified. This affects HTML-based writers (including slide shows and epub), LateX (including beamer), RTF, and PowerPoint. Other writers do not yet support `unlisted`. Closes #1762.	2019-10-10 09:15:40 -07:00
John MacFarlane	a3729ef2da	RST writer: proper handling of :align: on figures, images. When the image has the `align-right` (etc.) class, we now use an `:align:` attribute. Closes #4420.	2019-10-09 15:05:22 -07:00
John MacFarlane	5ec9044288	Update s5 test for mathjax 3 change.	2019-10-09 14:32:30 -07:00
John MacFarlane	aceee9ca48	Options.WriterOptions: Change type of writerVariables to Context Text. This will allow structured values. [API change]	2019-10-09 11:01:33 -07:00
Alexander Krotov	6a9cafc67a	hlint Muse reader tests	2019-10-04 18:28:53 +03:00
John MacFarlane	7caaa3d5d6	Minor ghc 8.8 fixups.	2019-10-03 22:41:24 -07:00
Nils Carlson	8028de3322	odt: Add external option for native numbering This adds an external options +native_numbering to the ODT writer enabling enumeration of figures and tables in ODT output.	2019-09-24 15:23:59 -07:00
John MacFarlane	f223196c35	Man writer: suppress non-absolute link URLs. Motivation: in a man page there's not much use for relative URLs, which you can't follow. Absolute URLs are still useful. We previously suppressed relative URLs starting with '#' (purely internal links), but it makes sense to go a bit farther. Closes #5770.	2019-09-23 17:46:39 -07:00
John MacFarlane	e99050283e	ConTeXt unit tests - tweak code property. Inline code will never have two consecutive newlines. We get a counterexample in this case https://pipelines.actions.githubusercontent.com/bMXCpShstkkHbFPgw9hBRMWw2w9plyzdVM8r7CRPFBHFvidaAG/5cf52d2d-3804-412d-ae65-4f8c059b0fb7/_apis/pipelines/1/runs/116/signedlogcontent/39?urlExpires=2019-09-23T17%3A38%3A05.8358735Z&urlSigningMethod=HMACV1&urlSignature=Qtd6vnzqgSwXpAkIyp9DJY4Kn7GJzYMR8UDkLR%2FsMQY%3D so for simplicity we just weed out code with newlines.	2019-09-23 15:03:26 -07:00
John MacFarlane	ba14649945	Improve test #5753	2019-09-22 22:00:20 -07:00
John MacFarlane	9abed45879	RST reader: Fixed parsing of indented blocks. We were requiring consistent indentation, but this isn't required by RST, as long as each nonblank line of the block has some indentation. Closes #5753.	2019-09-22 12:01:45 -07:00
John MacFarlane	d247e9f72e	Make `plain` output plainer. Previously we used the following Project Gutenberg conventions for plain output: - extra space before and after level 1 and 2 headings - all-caps for strong emphasis `LIKE THIS` - underscores surrounding regular emphasis `_like this_` This commit makes `plain` output plainer. Strong and Emph inlines are rendered without special formatting. Headings are also rendered without special formatting, and with only one blank line following. To restore the former behavior, use `-t plain+gutenberg`. API change: Add `Ext_gutenberg` constructor to `Extension`. See #5741.	2019-09-22 11:33:09 -07:00
Nikolay Yakimov	5c5d1a65d9	[Docx Reader] Update tests Notice this commit updates lists.docx. The old test file contained references to "ListParagraph" style, which should never leak outside of pandoc, so I'm not sure what that was supposed to test for exactly.	2019-09-21 11:37:21 -07:00
Nikolay Yakimov	c113ca6717	[Docx Reader] Use style names, not ids, for assigning semantic meaning Motivating issues: #5523, #5052, #5074 Style name comparisons are case-insensitive, since those are case-insensitive in Word. w:styleId will be used as style name if w:name is missing (this should only happen for malformed docx and is kept as a fallback to avoid failing altogether on malformed documents) Block quote detection code moved from Docx.Parser to Readers.Docx Code styles, i.e. "Source Code" and "Verbatim Char" now honor style inheritance Docx Reader now honours "Compact" style (used in Pandoc-generated docx). The side-effect is that "Compact" style no longer shows up in docx+styles output. Styles inherited from "Compact" will still show up. Removed obsolete list-item style from divsToKeep. That didn't really do anything for a while now. Add newtypes to differentiate between style names, ids, and different style types (that is, paragraph and character styles) Since docx style names can have spaces in them, and pandoc-markdown classes can't, anywhere when style name is used as a class name, spaces are replaced with ASCII dashes `-`. Get rid of extraneous intermediate types, carrying styleId information. Instead, styleId is saved with other style data. Use RunStyle for inline style definitions only (lacking styleId and styleName); for Character Styles use CharStyle type (which is basicaly RunStyle with styleId and StyleName bolted onto it).	2019-09-21 11:18:15 -07:00
Ben Steinberg	7389919bb4	Preserve built-in styles in DOCX with custom style (#5670 ) This commit prevents custom styles on divs and spans from overriding styles on certain elements inside them, like headings, blockquotes, and links. On those elements, the "native" style is required for the element to display correctly. This change also allows nesting of custom styles; in order to do so, it removes the default "Compact" style applied to Plain blocks, except when inside a table.	2019-09-20 22:13:29 -07:00
John MacFarlane	5a85789185	Remove admonition-title remnants. Completes `8e01ccb41d`	2019-09-19 16:09:38 -07:00
Albert Krewinkel	d0261d7387	Lua filters: allow passing of HTML-like tables instead of Attr (#5750 ) Attr values can now be given as normal Lua tables; this can be used as a convenient alternative to define Attr values, instead of constructing values with `pandoc.Attr`. Identifiers are taken from the id field, classes must be given as space separated words in the class field. All remaining fields are included as misc attributes. With this change, the following lines now create equal elements: pandoc.Span('test', {id = 'test', class = 'a b', check = 1}) pandoc.Span('test', pandoc.Attr('test', {'a','b'}, {check = 1})) This also works when using the attr setter: local span = pandoc.Span 'text' span.attr = {id = 'test', class = 'a b', check = 1} Furthermore, the attributes field of AST elements can now be a plain key-value table even when using the `attributes` accessor: local span = pandoc.Span 'test' span.attributes = {check = 1} -- works as expected now Closes: #5744	2019-09-15 12:11:58 -07:00
John MacFarlane	45b7636307	Revert "FB2 reader test: better diagnostics on failure." This reverts commit `c65af7d1a2`.	2019-09-15 10:27:19 -07:00
John MacFarlane	c65af7d1a2	FB2 reader test: better diagnostics on failure.	2019-09-15 09:06:38 -07:00
John MacFarlane	88a0327579	FB2 reader test: Another attempt to fix test failure on GitHub CI.	2019-09-14 10:37:19 -07:00
John MacFarlane	7ecae69e27	Revert "FB2 reader test: filter CRs." This reverts commit `e35147d715`.	2019-09-13 22:08:42 -07:00
John MacFarlane	e35147d715	FB2 reader test: filter CRs. This may help with the test failure on GitHub CI. `b59e6d0376/checks`	2019-09-13 16:50:00 -07:00
John MacFarlane	88dc6fac5d	Add --shift-heading-level-by option. Deprecate --base-heading-level. The new option does everything the old one does, but also allows negative shifts. It also promotes the document metadata (if not null) to a level-1 heading with a +1 shift, and demotes an initial level-1 heading to document metadata with a -1 shift. This supports converting documents that use an initial level-1 heading for the document title. Closes #5615.	2019-09-10 23:16:13 -07:00
John MacFarlane	4778d03473	LaTeX reader: Fix parsing of optional arguments that contain braced text. Closes #5740.	2019-09-09 21:33:16 -07:00
Brian Leung	0558ea9836	Org reader: modify handling of example blocks. (#5717 ) * Org reader: allow the `-i` switch to ignore leading spaces. * Org reader: handle awkwardly-aligned code blocks within lists. Code blocks in Org lists must have their #+BEGIN_ aligned in a reasonable way, but their other components can be positioned otherwise.	2019-09-08 22:34:10 -07:00
John MacFarlane	9f984ff26a	Replace Element and makeHierarchical with makeSections. Text.Pandoc.Shared: + Remove `Element` type [API change] + Remove `makeHierarchicalize` [API change] + Add `makeSections` [API change] + Export `deLink` [API change] Now that we have Divs, we can use them to represent the structure of sections, and we don't need a special Element type. `makeSections` reorganizes a block list, adding Divs with class `section` around sections, and adding numbering if needed. This change also fixes some longstanding issues recognizing section structure when the document contains Divs. Closes #3057, see also #997. All writers have been changed to use `makeSections`. Note that in the process we have reverted the change `c1d058aeb1` made in response to #5168, which I'm not completely sure was a good idea. Lua modules have also been adjusted accordingly. Existing lua filters that use `hierarchicalize` will need to be rewritten to use `make_sections`.	2019-09-08 22:20:19 -07:00
John MacFarlane	1ccff3339d	Revert changes to hierarchicalizeWithIds. Revert "hierarchicalize: ensure that sections get ids..." This reverts commit `212406a61d`. Revert "Improve detection of headings in Divs by hierarchicalize." This reverts commit `6e2cfd6c97`. Revert "Shared.hierarchicalize: improve handling of div and section structure." This reverts commit `345b33762e`.	2019-09-08 21:56:42 -07:00
John MacFarlane	212406a61d	hierarchicalize: ensure that sections get ids... even if they're in divs. Improves #3057.	2019-09-06 09:05:52 -07:00
John MacFarlane	6e2cfd6c97	Improve detection of headings in Divs by hierarchicalize. The structure ``` <h1>one</h1> <div> <h1>two</h1> </div> ``` should create two coordinate sections, not a section with a subsection. Now it does. Extends #3057.	2019-09-06 08:44:59 -07:00
John MacFarlane	345b33762e	Shared.hierarchicalize: improve handling of div and section structure. Previously Divs were opaque to hierarchicalize, so headings inside divs didn't get into the table of contents, for example (#3057). Now hierarchicalize treats Divs as sections when appropriate. For example, these structures both yield a section and a subsection: ``` html <div> <h1>one</h1> <div> <h2>two</h2> </div> </div> ``` ``` html <div> <h1>one</h1> <div> <h1>two</h1> </div> </div> ``` Note that ``` html <h1>one</h1> <div> <h2>two</h2> </div> <h1>three</h1> ``` gets parsed as the structure one two three which may not always be desirable. Closes #3057.	2019-09-05 22:37:13 -07:00
John MacFarlane	381654a704	Add div.hanging-indent CSS to HTML templates.	2019-09-05 12:42:23 -07:00
John MacFarlane	bb362fd76c	Add partial styles.html in HTML5 template. Avoid duplication in HTML templates by using styles.html partial. Change indentation of styles in template.	2019-09-05 12:39:50 -07:00
John MacFarlane	0e31483d43	asciidoc writer: don't include `+` in code blocks for regular asciidoc. This is asciidoctor-specific. Amends `98ee6ca289`.	2019-09-04 14:57:22 -07:00
John MacFarlane	e4cca4cf67	Roff readers: better parsing of groups. We now allow groups where the closing `\\}` isn't at the beginning of a line. Closes #5410.	2019-09-04 09:24:42 -07:00
John MacFarlane	513058a24e	XML: change toEntities to emit numerical hex character references. Previously decimal references were used. But Polyglot Markup prefers hex. See #5718. This affects the output of pandoc with `--ascii`.	2019-09-03 11:28:20 -07:00
John MacFarlane	6b286a1d74	LaTeX reader: don't try to parse includes if raw_tex is set. When the `raw_tex` extension is set, we just carry through `\usepackage`, `\input`, etc. verbatim as raw LaTeX. Closes #5673.	2019-09-02 21:03:05 -07:00
John MacFarlane	d79242796b	HTML writer: use numeric character references with `--ascii`. Previously we used named character references with html5 output. But these aren't valid XML, and we aim to produce html5 that is also valid XHTML (polyglot markup). (This is also needed for epub3.) Closes #5718.	2019-09-02 20:36:57 -07:00
John MacFarlane	5e708eb8ce	LaTeX reader: properly handle optional arguments for macros. Closes #5682.	2019-09-02 18:48:37 -07:00
John MacFarlane	fba1296fd1	LaTeX reader: fix `\\` in `\parbox` inside a table cell. Closes #5711.	2019-08-27 10:48:02 -07:00
John MacFarlane	167fc4bc87	Markdown reader: Headers: don't parse content over newline boundary. Closes #5714.	2019-08-27 10:15:00 -07:00
Jesse Rosenthal	4a7dad18b1	PowerPoint writer: Start numbering at appopriate numbers. Starting numbers for ordered lists were previously ignored. Now we specify the number if it is something other than 1. Closes: #5709	2019-08-27 01:24:41 -04:00
John MacFarlane	180f534d21	Add test for issue #5708 .	2019-08-26 15:20:22 -07:00
John MacFarlane	1ee6e0e087	Use new doctemplates, doclayout. + Remove Text.Pandoc.Pretty; use doclayout instead. [API change] + Text.Pandoc.Writers.Shared: remove metaToJSON, metaToJSON' [API change]. + Text.Pandoc.Writers.Shared: modify `addVariablesToContext`, `defField`, `setField`, `getField`, `resetField` to work with Context rather than JSON values. [API change] + Text.Pandoc.Writers.Shared: export new function `endsWithPlain` [API change]. + Use new templates and doclayout in writers. + Use Doc-based templates in all writers. + Adjust three tests for minor template rendering differences. + Added indentation to body in docbook4, docbook5 templates. The main impact of this change is better reflowing of content interpolated into templates. Previously, interpolated variables were rendered independently and intepolated as strings, which could lead to overly long lines. Now the templates interpolated as Doc values which may include breaking spaces, and reflowing occurs after template interpolation rather than before.	2019-08-25 14:24:31 -07:00
Owen McGrath	92debe4b9e	Change optMetadataFile type from Maybe to List (#5702 ) Changed optMetadataFile from `Maybe FilePath` to `[FilePath]`. This allows for multiple YAML metadata files to be added. The new default value has been changed from `Nothing` to `[]`. To account for this change in `Text.Pandoc.App`, `metaDataFromFile` now operates on two `mapM` calls (for `readFileLazy` and `yamlToMeta`) and a fold. Added a test (command/5700.md) which tests this functionality and updated MANUAL.txt, as per the contributing guidelines. With the current behavior, using `foldr1 (<>)`, values within files specified first will be used over those in later files. (If the reverse of this behavior would be preferred, it should be fixed by changing foldr1 to foldl1.)	2019-08-24 09:41:25 -07:00
John MacFarlane	9d581428f9	Add test for #5690 .	2019-08-23 10:15:42 -07:00
John MacFarlane	1c71bd1ff5	Ensure proper nesting when we have long ordered list markers. Closes #5705.	2019-08-23 09:16:54 -07:00
Albert Krewinkel	2712d3e869	Lua: traverse nested blocks and inlines in correct order Traversal methods are updated to use the new Walk module such that sequences with nested Inline (or Block) elements are traversed in the order in which they appear in the linearized document. Fixes: #5667	2019-08-16 20:52:15 +02:00
John MacFarlane	79a3449eeb	LaTeX reader: improve withRaw so it can handle cases where... the token string is modified by a parser (e.g. accent when it only takes part of a Word token). Closes #5686. Still not ideal, because we get the whole `\t0BAR` and not just `\t0` as a raw latex inline command. But I'm willing to let this be an edge case, since you can easily work around this by inserting a space, braces, or raw attribute. The important thing is that we no longer drop the rest of the document after a raw latex inline command that gobbles only part of a Word token!	2019-08-14 14:34:44 -07:00
John MacFarlane	eb23527121	Rename test for 5685 -> 5684 (typo in last commit). Closes #5684. (Note that #5685 is NOT closed by previous commit.)	2019-08-14 11:13:18 -07:00
John MacFarlane	0b2fb9b8f9	Add thin space when needed in LaTeX quote ligatures. Closes #5685.	2019-08-14 11:07:02 -07:00

1 2 3 4 5 ...

1182 commits