pandoc

Author	SHA1	Message	Date
Diego Balseiro	eda5540719	DOCX reader: Allow empty dates in comments and tracked changes (#6726 ) For security reasons, some legal firms delete the date from comments and tracked changes. * Make date optional (Maybe) in tracked changes and comments datatypes * Add tests	2020-10-06 21:03:00 -07:00
Christian Despres	cae155b095	Fix hlint suggestions, update hlint.yaml (#6680 ) * Fix hlint suggestions, update hlint.yaml Most suggestions were redundant brackets. Some required LambdaCase. The .hlint.yaml file had a small typo, and didn't ignore camelCase suggestions in certain modules.	2020-09-13 07:48:14 -07:00
Joseph C. Sible	7233a7a932	More cleanup (#6209 ) * Simplify by collapsing a do block into a single <$> * Remove an unnecessary variable: `all` takes any Foldable, so only blocksToInlines needs toList.	2020-03-28 22:48:47 -07:00
Albert Krewinkel	11b5f1e40b	Update copyright year (#6186 ) * Update copyright year * Copyright: add notes for Lua and Jira modules	2020-03-13 09:52:47 -07:00
Joseph C. Sible	12c75701be	Use <$> instead of >>= and return (#6128 )	2020-02-08 09:12:01 -08:00
John MacFarlane	4c3db9273f	Apply linter suggestions. Add fix_spacing to lint target in Makefile.	2020-02-07 09:08:22 -08:00
despresc	90e436d496	Switch to new pandoc-types and use Text instead of String [API change]. PR #5884. + Use pandoc-types 1.20 and texmath 0.12. + Text is now used instead of String, with a few exceptions. + In the MediaBag module, some of the types using Strings were switched to use FilePath instead (not Text). + In the Parsing module, new parsers `manyChar`, `many1Char`, `manyTillChar`, `many1TillChar`, `many1Till`, `manyUntil`, `mantyUntilChar` have been added: these are like their unsuffixed counterparts but pack some or all of their output. + `glob` in Text.Pandoc.Class still takes String since it seems to be intended as an interface to Glob, which uses strings. It seems to be used only once in the package, in the EPUB writer, so that is not hard to change.	2019-11-12 16:03:45 -08:00
John MacFarlane	530bfe5f5a	Docx reader: fix list number resumption for sublists. Closes #4324 . The first list item of a sublist should not resume numbering from the number of the last sublist item of the same level, if that sublist was a sublist of a different list item. That is, we should not get: ``` 1. one 1. sub one 2. sub two 2. two 3. sub one ```	2019-11-03 12:54:42 -08:00
Nikolay Yakimov	c113ca6717	[Docx Reader] Use style names, not ids, for assigning semantic meaning Motivating issues: #5523, #5052, #5074 Style name comparisons are case-insensitive, since those are case-insensitive in Word. w:styleId will be used as style name if w:name is missing (this should only happen for malformed docx and is kept as a fallback to avoid failing altogether on malformed documents) Block quote detection code moved from Docx.Parser to Readers.Docx Code styles, i.e. "Source Code" and "Verbatim Char" now honor style inheritance Docx Reader now honours "Compact" style (used in Pandoc-generated docx). The side-effect is that "Compact" style no longer shows up in docx+styles output. Styles inherited from "Compact" will still show up. Removed obsolete list-item style from divsToKeep. That didn't really do anything for a while now. Add newtypes to differentiate between style names, ids, and different style types (that is, paragraph and character styles) Since docx style names can have spaces in them, and pandoc-markdown classes can't, anywhere when style name is used as a class name, spaces are replaced with ASCII dashes `-`. Get rid of extraneous intermediate types, carrying styleId information. Instead, styleId is saved with other style data. Use RunStyle for inline style definitions only (lacking styleId and styleName); for Character Styles use CharStyle type (which is basicaly RunStyle with styleId and StyleName bolted onto it).	2019-09-21 11:18:15 -07:00
John MacFarlane	b35fae6511	Use doctemplates 0.3, change type of writerTemplate. * Require recent doctemplates. It is more flexible and supports partials. * Changed type of writerTemplate to Maybe Template instead of Maybe String. * Remove code from the LaTeX, Docbook, and JATS writers that looked in the template for strings to determine whether it is a book or an article, or whether csquotes is used. This was always kludgy and unreliable. To use csquotes for LaTeX, set `csquotes` in your variables or metadata. It is no longer sufficient to put `\usepackage{csquotes}` in your template or header includes. To specify a book style, use the `documentclass` variable or `--top-level-division`. * Change template code to use new API for doctemplates.	2019-07-28 19:25:45 -07:00
Jesse Rosenthal	9a1a3fe482	Docx reader: add tests for trimming last inline.	2019-02-18 15:49:00 -05:00
Jesse Rosenthal	332e2ba5b6	Docx reader: Add test for reading sdts in footnotes.	2019-02-12 17:26:37 -05:00
Jesse Rosenthal	1847bdbb83	Docx reader: Tests for alternate document.xml	2019-02-06 21:14:46 -05:00
Albert Krewinkel	37a82b0b11	Add missing copyright notices and remove license boilerplate (#5112 ) Quite a few modules were missing copyright notices. This commit adds copyright notices everywhere via haddock module headers. The old license boilerplate comment is redundant with this and has been removed. Update copyright years to 2019. Closes #4592.	2019-02-04 13:52:31 -08:00
Jesse Rosenthal	0f736d778f	Docx: add test for lists with level overrides.	2018-12-10 19:24:56 -05:00
Jesse Rosenthal	c5d8fab058	Docx reader tests: Test for combining adjacent code blocks.	2018-04-17 09:29:54 -04:00
John MacFarlane	7e389cb3db	Use NoImplicitPrelude and explicitly import Prelude. This seems to be necessary if we are to use our custom Prelude with ghci. Closes #4464.	2018-03-18 10:46:28 -07:00
Jesse Rosenthal	85a65c6a51	Docx reader: add tests for nested smart tags.	2018-03-13 22:16:54 -04:00
Jesse Rosenthal	7d3e7a5a6d	Docx reader: Handle nested sdt tags. Previously we had only unwrapped one level of sdt tags. Now we recurse if we find them. Closes: #4415	2018-02-28 16:32:20 -05:00
Jesse Rosenthal	ffcecfacb1	Docx reader tests: test custom style extension.	2018-02-22 13:05:44 -05:00
danse	e6ff7f7986	Docx reader: Pick table width from the longest row or header This change is intended to preserve as much of the table content as possible Closes #4360	2018-02-15 15:06:01 -05:00
John MacFarlane	b8ffd834cf	hlint code improvements.	2018-01-19 21:25:24 -08:00
Jesse Rosenthal	004f60bf26	Docx reader: Add test for hyperlinks in instrText tag This is difficult to recreate with a modern version of Word, so I'm using the file submitted with the bug report. It would be preferable to find a smaller example with Latin characters, though, so as not to confuse the issue being tested.	2018-01-16 13:22:02 -05:00
Jesse Rosenthal	a5b71a3c7f	Docx reader: Add tests for paragraph insertion/deletion.	2018-01-02 11:32:48 -05:00
Jesse Rosenthal	3f30455b49	Docx reader: tests for overlapping targets (anchor spans).	2017-12-31 09:36:42 -05:00
Jesse Rosenthal	475b0dcb66	Docx reader: tests for removing unused anchors.	2017-12-30 22:43:33 -05:00
Jesse Rosenthal	d71165c8e2	Docx reader: add tests for structured document tags unwrapping.	2017-12-27 10:03:00 -05:00
Jesse Rosenthal	440533643e	Docx writer: Add tests for list continuation.	2017-12-13 15:16:44 -05:00
John MacFarlane	ae60e0196c	Add `empty_paragraphs` extension. * Deprecate `--strip-empty-paragraphs` option. Instead we now use an `empty_paragraphs` extension that can be enabled on the reader or writer. By default, disabled. * Add `Ext_empty_paragraphs` constructor to `Extension`. * Revert "Docx reader: don't strip out empty paragraphs." This reverts commit `d6c58eb836`. * Implement `empty_paragraphs` extension in docx reader and writer, opendocument writer, html reader and writer. * Add tests for `empty_paragraphs` extension.	2017-12-04 14:56:57 -08:00
John MacFarlane	d6c58eb836	Docx reader: don't strip out empty paragraphs. We now have the `--strip-empty-paragraphs` option for that, if you want it. Closes #2252. Updated docx reader tests. We use stripEmptyParagraphs to avoid changing too many tests. We should add new tests for empty paragraphs.	2017-12-02 16:51:31 -08:00
John MacFarlane	ff16db1aa3	Automatic reformating by stylish-haskell.	2017-10-27 20:28:29 -07:00
John MacFarlane	a2a14f9029	Removed old adjacent_links test for docx reader. See #2270 for background -- this test blocked the consistent underline change and was hard to revise, so for now we are removing it.	2017-10-27 16:09:44 -07:00
Jesse Rosenthal	a67a96b932	Docx reader: Add tests for avoiding zero-level header.	2017-08-06 19:36:25 -07:00
John MacFarlane	fa719d0264	Switched Writer types to use Text. * XML.toEntities: changed type to Text -> Text. * Shared.tabFilter -- fixed so it strips out CRs as before. * Modified writers to take Text. * Updated tests, benchmarks, trypandoc. [API change] Closes #3731.	2017-06-11 00:46:31 +02:00
John MacFarlane	94b3dacb4e	Changed all readers to take Text instead of String. Readers: Renamed StringReader -> TextReader. Updated tests. API change.	2017-06-10 18:26:44 +02:00
John MacFarlane	6cb54c3def	Got rid of distracting warning in test output.	2017-03-14 21:06:14 +01:00
John MacFarlane	6ecc5b96a9	Use tasty for tests rather than test-framework.	2017-03-14 17:07:23 +01:00
John MacFarlane	e256c8ce17	Stylish-haskell automatic formatting changes.	2017-03-04 13:03:41 +01:00
John MacFarlane	76c55466d3	Use new warnings throughout the code base.	2017-02-11 00:14:44 +01:00
John MacFarlane	18ab864269	Moved tests/ -> test/.	2017-02-04 12:56:30 +01:00

40 commits