pandoc

Author	SHA1	Message	Date
Nikolay Yakimov	251ce0738d	LaTeX Reader: Test for `^^` character escapes	2015-04-13 03:22:39 +03:00
John MacFarlane	2d2e4c9ab2	Merge branch 'master' of https://github.com/rootzlevel/pandoc into rootzlevel-master Conflicts: src/Text/Pandoc/Readers/Org.hs	2015-03-28 21:09:38 -07:00
John MacFarlane	6a3a04c428	Merge branch 'errortype' of https://github.com/mpickering/pandoc into mpickering-errortype Conflicts: benchmark/benchmark-pandoc.hs src/Text/Pandoc/Readers/Markdown.hs src/Text/Pandoc/Readers/Org.hs src/Text/Pandoc/Readers/RST.hs tests/Tests/Readers/LaTeX.hs	2015-03-28 12:12:48 -07:00
John MacFarlane	619b2e8ca2	Merge pull request #1968 from lierdakil/issue1607 Fixes for multiple docx writer style bugs.	2015-03-16 12:02:40 -07:00
John MacFarlane	0deb7c507d	Merge pull request #1989 from zudov/shortcut_ref_link_pr Support shortcut reference links in markdown writer	2015-03-15 11:58:30 -07:00
Konstantin Zudov	b9f77ed03d	Support shortcut reference links in markdown writer Issue #1977 Most markdown processors support the [shortcut format] for reference links. Pandoc's markdown reader parsed this shortcuts unoptionally. Pandoc's markdown writer (with --reference-links option) never shortcutted links. This commit adds an extension `shortcut_reference_links`. The extension is enabled by default for those markdown flavors that support reading shortcut reference links, namely: - pandoc - strict pandoc - github flavoured - PHPmarkdown If extension is enabled, reader parses the shortcuts in the same way as it preveously did. Otherwise it would parse them as normal text. If extension is enabled, writer outputs shortcut reference links unless doing so would cause problems (see test cases in `tests/Tests/Writers/Markdown.hs`).	2015-03-10 20:32:24 +02:00
Craig S. Bosma	513221f822	Org reader: add support for smart punctuation	2015-03-09 07:11:53 -05:00
Mathias Schenner	12bf0ff3e5	LaTeX reader: allow non-empty colsep in tables The `tabular` environment allows non-empty column separators with the "@{...}" syntax. Previously, pandoc would fail to parse tables if a non-empty colsep was present. With this commit, these separators are still ignored, but the table gets parsed. A test case is included.	2015-03-08 15:47:39 +01:00
Mathias Schenner	1e3ef0e36f	LaTeX reader: allow valign argument in tables The `tabular` environment takes an optional parameter for vertical alignment. Previously, pandoc would fail to parse tables if this parameter was present. With this commit, the parameter is still ignored, but the table gets parsed. A test case is included.	2015-03-08 15:39:18 +01:00
Mathias Schenner	4f9a10619f	LaTeX reader: add some test cases for simple tables	2015-03-08 15:17:09 +01:00
Nikolay Yakimov	59c4d28d8c	Docx Writer: Tables test	2015-03-08 04:42:50 +03:00
Nikolay Yakimov	a82dedf1ff	Lists test	2015-03-08 03:59:48 +03:00
Nikolay Yakimov	ae07d5ed49	Initial tests for writer	2015-03-03 14:37:02 +03:00
Hans-Peter Deifel	5871955169	Org reader: Add test for image links Tests for image links with non-image targets, as introduced in commit `2ca5101`.	2015-02-26 13:11:50 +01:00
Jesse Rosenthal	9654514e8a	Docx reader: add test for verbatim in sub/superscript.	2015-02-21 08:45:38 -05:00
Jesse Rosenthal	2995526772	Docx reader: Add tests for new list style parsing.	2015-02-19 00:24:04 -05:00
Matthew Pickering	1a7a99161a	Update tests	2015-02-18 21:09:07 +00:00
Matthias C. M. Troffaes	dccd408a9c	Allow digit as first character of a citation key. * Update parser to recognize citation keys starting with a digit. * Update documentation accordingly. * Test case added. See https://github.com/jgm/pandoc-citeproc/issues/97	2015-02-18 15:30:17 +00:00
Jesse Rosenthal	616e211f36	Docx reader: test lists in table cells.	2015-02-13 09:08:07 -05:00
Jesse Rosenthal	e88119f2d1	Docx Reader: Add test for VML images. Since images are often visually (not structurally) placed on the page, people might not always get the results they're looking for here.	2015-01-21 13:41:16 -05:00
John MacFarlane	47c360e079	Improved texorpdfstring patch #1148 . * Make LaTeX reader recognize texorpdfstring. * Don't use texorpdfstring unless it's actually needed. * Fix tests.	2014-12-15 10:06:03 -08:00
John MacFarlane	544f3e5b45	Merge branch 'use-texorpdfstring' of https://github.com/wilx/pandoc into wilx-use-texorpdfstring Conflicts: src/Text/Pandoc/Writers/LaTeX.hs tests/Tests/Writers/LaTeX.hs	2014-12-15 10:01:50 -08:00
John MacFarlane	a864e9a348	Merge pull request #1805 from bergey/rst RST Reader - Improved Role Support	2014-12-15 09:06:45 -08:00
John MacFarlane	1d3ca088f2	Merge pull request #1813 from tarleb/file-links Org reader: properly handle links to `file:target`	2014-12-14 13:36:34 -08:00
Albert Krewinkel	4d85b17fc5	Org reader: properly handle links to `file:target` Org links like `[[file:target][title]]` were not handled correctly, parsing the link target verbatim. The org reader is changed such that the leading `file:` is dropped from the link target. This is related to issues #756 and #1812.	2014-12-14 21:30:10 +01:00
John MacFarlane	2b08e32a90	Fixe autolinks with following punctuation. Closes #1811. The price of this is that autolinked bare URIs can no longer contain `>` characters, but this is not a big issue.	2014-12-14 12:20:33 -08:00
Daniel Bergey	689fb112bf	RST Reader: compute Attrs when role is defined Move recursive role lookup from renderRole to addNewRole. The Attr value will be the same for every occurance of this role, so there's no reason to compute it every time. This allows simplifying the stateRstCustomRoles map considerably. We could go even further, and remove the fmt and attr arguments to renderRole, which are null except for custom roles.	2014-12-12 14:45:45 +00:00
Daniel Bergey	4e040160e0	WIP: tests for RST roles	2014-12-12 14:45:45 +00:00
Matthew Pickering	48e2586ec8	Merge pull request #1746 from shelf/dw-ext-images DokuWiki writer: fix external images	2014-12-08 23:55:36 +00:00
Daniel Bergey	74c1b547c2	parse RST class directives The class directive accepts one or more class names, and creates a Div value with those classes. If the directive has an indented body, the body is parsed as the children of the Div. If not, the first block folowing the directive is made a child of the Div. This differs from the behavior of rst2xml, which does not create a Div element. Instead, the specified classes are applied to each child of the directive. However, most Pandoc Block constructors to not take an Attr argument, so we can't duplicate this behavior.	2014-12-01 18:22:03 +00:00
Daniel Bergey	2cdfa5eb20	parse RST quoted literal blocks closes #65 RST quoted literal blocks are the same as indented literal blocks (which pandoc already supports) except that the quote character is preserved in each line. This includes test cases for the quoted literal block, as well as additional tests for line blocks and indented literal blocks, to verify that these are unaffected by the changes.	2014-12-01 18:22:03 +00:00
John MacFarlane	46d343f474	Fixed bug in org with bulleted lists: - a - b * c was being parsed as a list, even though an unindented `*` should make a heading. See <http://orgmode.org/manual/Plain-lists.html#fn-1>.	2014-11-13 23:40:18 -08:00
John MacFarlane	43c1978fae	Merge pull request #1645 from neongreen/issue1636 Fix 'Ext_lists_without_preceding_blankline' bug.	2014-11-12 09:05:29 -08:00
Timothy Humphries	98161afa1a	DokuWiki writer: add external_images test Add test for #1739.	2014-11-09 02:18:58 -05:00
Albert Krewinkel	e6cd8c9077	Org reader: allow empty links for gitit interop While empty links are not allowed in Emacs org-mode, Pandoc org-mode should support them: gitit relies on empty links as they are used to create wiki links. Fixes jgm/gitit#471	2014-11-05 23:15:28 +01:00
Albert Krewinkel	daaf635806	Org reader: absolute, relative paths in links The org reader was to restrictive when parsing links, some relative links and links to files given as absolute paths were not recognized correctly. The org reader's link parsing function was amended to handle such cases properly. This fixes #1741	2014-11-05 22:27:25 +01:00
Alexander Sulfrian	79f25fb9ce	TWiki Reader: add basic syntax test	2014-10-30 20:02:05 +01:00
Jesse Rosenthal	60846471a3	Docx test: Remove Danish header test. Redundant, now that we're testing for a more generalized sort of internationalized blocks.	2014-10-25 16:02:31 -04:00
Jesse Rosenthal	c0ddcb359e	Docx reader: add tests for i18n headers. This tests blockquotes and headers in Russian. Previous tests make sure that this doesn't produce a regression in en-us Header and Blockquotes.	2014-10-25 16:00:27 -04:00
Albert Krewinkel	a5eb02f6a7	Org reader: parse LaTeX-style MathML entities Org supports special symbols which can be included using LaTeX syntax, but are actually MathML entities. Examples for this are `\nbsp` (non-breaking space), `\Aacute` (the letter A with accent acute) or `\copy` (the copyright sign ©). This fixes #1657.	2014-10-20 22:57:36 +02:00
John MacFarlane	84f6b1e41a	Merge pull request #1680 from shelf/master Respect indent when parsing Org bullet lists	2014-10-18 13:20:27 -07:00
John MacFarlane	31713d572a	Merge pull request #1700 from tarleb/org-emphasis-fix Org reader: fix rules for emphasis recognition	2014-10-18 13:19:42 -07:00
Albert Krewinkel	e3c36ed6ce	Org reader: Drop COMMENT document trees Document trees under a header starting with the word `COMMENT` are comment trees and should not be exported. Those trees are dropped silently. This closes #1678.	2014-10-18 22:11:53 +02:00
Albert Krewinkel	d571bec454	Org reader: fix rules for emphasis recognition Things like `/hello,/` or `/hi'/` were falsy recognized as emphasised strings. This is wrong, as `,` and `'` are forbidden border chars and may not occur on the inner border of emphasized text. This patch enables the reader to matches the reference implementation in that it reads the above strings as plain text.	2014-10-18 12:47:59 +02:00
Timothy Humphries	f1f56e8533	Fix indent issue for definition lists Tidy up fix for #1650, #1698 as per comments in #1680. Fix same issue for definition lists with the same method.	2014-10-17 20:06:25 -04:00
Timothy Humphries	4f4b0f031d	Respect indent when parsing Org bullet lists Fixes issue with top-level bullet list parsing. Previously we would use `many1 spaceChars` rather than respecting the list's indent level. We also permitted `*` bullets on unindented lists, which should unambiguously parse as `header 1`. Combined, this meant headers at a different indent level were being unwittingly slurped into preceding bullet lists, as per Issue #1650.	2014-10-12 03:18:36 -04:00
John MacFarlane	fe6d43b3e0	Merge pull request #1601 from jkr/windowsfix Fix path-slashes inside archive for windows	2014-09-27 16:21:17 -07:00
Matthew Pickering	fa2d11c954	Update tests for #1649	2014-09-27 22:40:25 +01:00
Artyom	bc115ffc2d	Fix 'Ext_lists_without_preceding_blankline' bug. * Fixes #1636. * Adds a test.	2014-09-26 13:32:08 +04:00
mpickering	c0b9ad4c5d	EPUB Tests: Seperating image testing from other features	2014-09-25 13:33:25 +01:00
mpickering	cc07d0c6bf	Shared: Make collapseFilePath OS-agnostic	2014-09-25 12:42:53 +01:00
Jesse Rosenthal	8d22bf26ab	LaTeX writer: Test for protecting images in header.	2014-09-09 11:05:47 -04:00
Jesse Rosenthal	f56e0e958a	Docx reader: Add test for polyglot headers. Only Danish at the moment.	2014-09-05 22:07:06 -04:00
Jesse Rosenthal	313355e373	Org reader: Update Tests Test for markup after blank line.	2014-09-04 19:55:53 -04:00
Jesse Rosenthal	08359c44e4	Docx Reader: Add tests for numbered headers.	2014-09-04 19:39:49 -04:00
Jesse Rosenthal	a6eead7f26	Docx reader: Modify mediabag test accordingly.	2014-09-02 14:05:54 -04:00
Jose Luis Duran	9557eb6f8e	LaTeX writer: Use a declaration for tight lists Currently, pandoc has hard-coded the following in order to make tight lists in LaTeX: ```hs text "\\itemsep1pt\\parskip0pt\\parsep0pt" ``` Which is fine, but does not allow customizations. For example, the `memoir` class already has a `\tightlist` declaration for this purpose: ```tex \newcommand{\tightlist}{% \setlength{\itemsep}{0pt}\setlength{\parskip}{0pt}} ``` I'm proposing to use a similar solution: ```diff @@ In Writers/LaTeX.hs: -then text "\\itemsep1pt\\parskip0pt\\parsep0pt" +then text "\\tightlist" @@ In templates/default.latex: +\newcommand{\tightlist}{% + \setlength{\itemsep}{1pt}\setlength{\parskip}{0pt}\setlength{\parsep}{0pt}} ``` This allows us to customize the tightness to our needs. Backward Compatibility If a person is using a custom LaTeX template (not based upon the `memoir` class), the `\tightlist` declaration must be added.	2014-09-01 05:08:24 +00:00
John MacFarlane	3533218d6d	Merge pull request #1594 from jkr/itemFix Item fix	2014-08-31 19:31:38 -07:00
Jesse Rosenthal	dfc8ab5a6a	LaTeX writer: Add tests for header-in-item.	2014-08-31 16:05:20 -04:00
John MacFarlane	598d3ee23b	Markdown reader: better handling of paragraph in div. Previously text that ended a div would be parsed as Plain unless there was a blank line before the closing div tag. Test case: <div class="first"> This is a paragraph. This is another paragraph. </div> Closes #1591.	2014-08-31 12:55:47 -07:00
Jesse Rosenthal	4eb9769a4c	Dokuwiki writer: Add a test for multiblock table cells. We have to add a new file, because the original table tests don't look for this.	2014-08-30 15:34:40 -04:00
mpickering	2cd049a1bf	Txt2Tags reader: Header is now parsed only if standalone flag is set	2014-08-20 18:11:37 +01:00
Jesse Rosenthal	180f5cbe63	Docx reader: Test for character styles.	2014-08-16 14:05:56 -04:00
John MacFarlane	76d14bcc11	Old tests: better path for test program.	2014-08-13 12:20:25 -07:00
John MacFarlane	40e67b8737	Revised tests directory. Renamed some tests, introducing subsidiary directories for fb2, docx, epub. Cleaned up tests in cabal file. Combined dokuwiki-writer and dokuwiki_inline_formatting tests.	2014-08-13 11:16:50 -07:00
John MacFarlane	1d6e1cf9f3	Removed special testHook from Setup. This was just too fragile and dependent on a changing Cabal API (see #1526). Instead of passing the bulid directory to the test program, we now let the test program find itself (using executable-path) and then find the pandoc executable relative to itself.	2014-08-13 08:12:07 -07:00
Matthew Pickering	063ba81622	EPUB Tests: Added wasteland test This epub contains many epub:type elements including footnotes and titlepage.	2014-08-13 00:25:18 +01:00
Jesse Rosenthal	0808449547	Docx: Add dropcap tests.	2014-08-11 23:10:50 -04:00
Matthew Pickering	f33ae631f3	Improved EPUB Tests Rewrote features test to remove all unimplemented features. There are now all three examples of where an image can be included in the test. 1. Cover image 2. As a spine elemnt 3. In the document Tests have also been added to make sure that the mediabag contains all these images after processing.	2014-08-10 14:58:53 +01:00
Matthew Pickering	5b5e53024d	Added tests for collapseFilePath	2014-08-08 22:31:02 +01:00
John MacFarlane	10b662c120	EPUB test renaming. Renamed epub test files so they're identified more clearly as epub: features.{epub,native} -> epub.features.{epub,native}, and similarly with formatting.{epub,native}. Added epub test files to cabal file, so they'll be included in the tarball.	2014-08-07 22:25:06 -07:00
Jesse Rosenthal	98d14b2b2a	Docx reader: Test inline image code.	2014-08-07 15:34:49 -04:00
John MacFarlane	4630cff2a6	Merge branch 'epubend' of https://github.com/mpickering/pandoc into mpickering-epubend Conflicts: pandoc.cabal	2014-08-04 07:36:18 -07:00
Artyom Kazak	ec88d47f23	Correctly implement capitalisation. Using `map toUpper` to capitalise text is wrong, as e.g. “Straße” should be converted to “STRASSE”, which is 1 character longer. This commit adds a `capitalize` function and replaces 2 identical implementations in different modules (`toCaps` and `capitalize`) with it.	2014-08-03 17:37:37 +04:00
Matthew Pickering	0ae2c1f146	EPUB Reader: Added tests	2014-07-31 21:39:50 +01:00
Jesse Rosenthal	ed71e9b31d	Docx tests: rewrite mediabag tests. This will allow us to test the whole mediabag (making sure, for example, that images are added with the correct keys) instead of just individual extracted images. We compare each entry in the media bag to an image extracted on the fly from the docx. As a result, we only need one file to test with. The image in the current tests was also replaced with a smaller one.	2014-07-31 15:47:45 -04:00
John MacFarlane	6dd2418476	New module, Text.Pandoc.MediaBag. Moved `MediaBag` definition and functions from Shared: `lookupMedia`, `mediaDirectory`, `insertMedia`, `extractMediaBag`. Removed `emptyMediaBag`; use `mempty` instead, since `MediaBag` is a Monoid.	2014-07-31 12:00:21 -07:00
John MacFarlane	00662faefb	Made MediaBag a newtype, and added mime type information to media. Shared now exports functions for interacting with a MediaBag: - `emptyMediaBag` - `lookuMedia` - `insertMedia` - `mediaDirectory` - `extractMediaBag`	2014-07-31 11:05:35 -07:00
Jesse Rosenthal	4d1d8a4b6f	Docx test: Test image from media bag.	2014-07-30 22:32:55 -04:00
Jesse Rosenthal	16f88edb3b	Docx tests: Added media test comparison function. Also tell pandoc.cabal that we'll be needing base64, since we want to compare strings here.	2014-07-30 22:31:38 -04:00
Jesse Rosenthal	941df1b0de	Docx reader: change tests to make use of media bag.	2014-07-30 12:46:53 -04:00
John MacFarlane	8c2ed54e2e	LaTeX writer: use $..$ instead of $..$ for inline math. Closes #1464.	2014-07-29 20:45:49 -07:00
Jesse Rosenthal	54708da371	Add and update docx tests in pandoc.cabal.	2014-07-29 13:05:19 -04:00
Jesse Rosenthal	840108a9c1	Docx reader: Make metavalues out of styled paragraphs. This will make paragraphs styled with `Author`, `Title`, `Subtitle`, `Date`, and `Abstract` into pandoc metavalues, rather than text. The implementation only takes those elements from the beginning of the document (ignoring empty paragraphs). Multiple paragraphs in the `Author` style will be made into a metaList, one paragraph per item. Hard linebreaks (shift-return) in the paragraph will be maintained, and can be used for institution, email, etc.	2014-07-29 13:03:01 -04:00
Matthew Pickering	e340a7da02	Txt2Tags Reader: Added tests	2014-07-27 00:12:57 +01:00
John MacFarlane	18f4490482	Fixed runtime error with compactify'DL on certain lists. Closes #1452. Added test.	2014-07-25 10:53:04 -07:00
John MacFarlane	4af8eed764	Markdown reader: revised definition list syntax (closes #1429 ). * This change brings pandoc's definition list syntax into alignment with that used in PHP markdown extra and multimarkdown (with the exception that pandoc is more flexible about the definition markers, allowing tildes as well as colons). * Lazily wrapped definitions are now allowed; blank space is required between list items; and the space before definition is used to determine whether it is a paragraph or a "plain" element. * For backwards compatibility, a new extension, `compact_definition_lists`, has been added that restores the behavior of pandoc 1.12.x, allowing tight definition lists with no blank space between items, and disallowing lazy wrapping.	2014-07-20 16:33:59 -07:00
John MacFarlane	87096c64f8	Org reader: text adjacent to a list yields a Plain, not Para. This gives better results for tight lists. Closes #1437. An alternative solution would be to use Para everywhere, and never Plain. I am not sufficiently familiar with org to know which is best. Thoughts, @tarleb?	2014-07-20 12:56:01 -07:00
John MacFarlane	0f01421f81	AsciiDoc writer: Double markers in intraword emphasis. Closes #1441.	2014-07-20 12:24:53 -07:00
Craig S. Bosma	1bb4f0c497	Org reader: Respect :exports header arguments on code blocks Adds support to the org reader for conditionally exporting either the code block, results block immediately following, both, or neither, depending on the value of the `:exports` header argument. If no such argument is supplied, the default org behavior (for most languages) of exporting code is used.	2014-07-17 10:23:22 -05:00
Jesse Rosenthal	643435f1de	Docx reader: Add test Test auto ident header anchors with pandoc-generated pandoc.	2014-07-15 18:32:19 +01:00
John MacFarlane	ff86702a95	Added failing test for issue #1121 .	2014-07-10 14:23:20 -07:00
John MacFarlane	d1ac594d4a	Added test for issue #1330 .	2014-07-07 22:27:28 -06:00
John MacFarlane	f96a2b91f5	Reorganized some markdown tests.	2014-07-07 22:21:04 -06:00
John MacFarlane	616cf6c539	Merge branch 'dokuwiki' of https://github.com/claremacrae/pandoc into claremacrae-dokuwiki	2014-07-07 16:15:35 -06:00
John MacFarlane	e4263d306e	Revamped raw HTML block parsing in markdown. - We no longer include trailing spaces and newlines in the raw blocks. - We look for closing tags for elements (but without backtracking). - Each block-level tag is its own RawBlock; we no longer try to consolidate them (though `--normalize` will do so). Closes #1330.	2014-07-07 15:53:59 -06:00
Clare Macrae	7647d87657	DokuWiki writer: Add new test showing that span swallows content.	2014-07-02 22:26:11 +01:00
Clare Macrae	3cb76d9560	Merge branch 'master' of git://github.com/jgm/pandoc into dokuwiki	2014-07-01 22:10:08 +01:00
John MacFarlane	3fbbafd391	Rewrote normalize for efficiency. (Closes #1385.) * Added normalizeInlines, normalizeBlocks. * Type signature is now more narrow, `Pandoc -> Pandoc` instead of `Data a :: a -> a`. Some users may need to change their uses of `normalize` to the newly exported `normalizeInlines` or `normalizeBlocks`.	2014-06-29 23:05:08 -07:00
Jesse Rosenthal	1405e7b709	Docx reader: Add tests for hanging indent handline. We want to treat it as a plain paragraph if the hanging amount is greater to or equal to the left indent---i.e., if the first line has zero indentation. But we still want it to be a block quote if it starts to the right of the margin. Someone might format verse with wrapping lines with a hanging indent, for example.	2014-06-29 23:37:00 -04:00
Clare Macrae	717e16660d	Merge remote-tracking branch 'jgm/master' into dokuwiki	2014-06-29 19:22:31 +01:00
Jesse Rosenthal	afdc0af779	Track changes tests.	2014-06-25 16:13:59 -04:00
Jesse Rosenthal	a2b6ab847c	Docx reader: Add tests for basic track changes This is what seems like the sensible default: read in insertions, and ignore deletions. In the future, it would be good if options were available for either taking in deletions or keeping both in some scriptable format.	2014-06-25 11:09:28 -04:00
Jesse Rosenthal	2621482d69	Docx Reader: add failing defintion list tests.	2014-06-24 12:11:57 -04:00
Jesse Rosenthal	21295c5ab5	Docx reader: add failing tests for inline code and code blocks.	2014-06-24 10:33:49 -04:00
John MacFarlane	ac6756009f	Merge pull request #1366 from jkr/reducible3 Docx rewrite and cleanup (in terms of Reducible typeclass)	2014-06-23 14:33:38 -07:00
Jesse Rosenthal	9b954fa855	Add test for correctly trimming spaces in formatting. This used to be fixed in the tree-walking. We need to make sure we're doing it right now.	2014-06-23 17:08:26 -04:00
John MacFarlane	87ab01637e	LaTeX writer: Use `\textquotesingle` for `'` in inline code. Otherwise we get curly quotes in the PDF output. Closes #1364.	2014-06-23 12:51:10 -07:00
Jesse Rosenthal	ed43513087	Docx reader tests: add tests for normalization deep in blocks.	2014-06-22 01:58:41 -04:00
Jesse Rosenthal	ca4add679c	Add normalization test. Add torture-test for new normalization functions. One problem that this test demonstrates is that word has a tendency to turn off formatting at a space, and then turn it back on after. I'm not sure yet whether this is something we should fix.	2014-06-22 00:46:19 -04:00
Jesse Rosenthal	a4508d7fcf	Docx reader tests: Introduce NoNormPandoc type. This is just a wrapper around Pandoc that doesn't normalize with `toString`. We want to make sure that our own normalization process works. If, in the future, we are able to hook into the builder's normalization, this will be removed.	2014-06-20 18:37:52 -04:00
John MacFarlane	12efffa85a	LaTeX writer: Fixed strikeout + highlighted code. Closes #1294 . Previously strikeout highlighted code caused an error.	2014-06-20 10:24:30 -07:00
Jesse Rosenthal	da0d1d27ac	Add tabs tests.	2014-06-19 19:33:22 -04:00
Jesse Rosenthal	ceb742b124	Add ReaderOptions to the docx tests This will allow for testing different media embedding (in addition to any other applicable options.)	2014-06-19 12:16:53 -04:00
John MacFarlane	cf15b929f8	Added haddock writer tests.	2014-06-18 17:55:21 -07:00
John MacFarlane	bbe99003f8	Naming: Use Docx instead of DocX. For consistency with the existing writer.	2014-06-16 22:44:40 -07:00
John MacFarlane	bec9f3c641	Merge branch 'docx' of https://github.com/jkr/pandoc into jkr-docx	2014-06-16 22:16:45 -07:00
John MacFarlane	78ee2416d1	Org reader: make tildes create inline code. Closes #1345. Also relabeled 'code' and 'verbatim' parsers to accord with the org-mode manual. I'm not sure what the distinction between code and verbatim is supposed to be, but I'm pretty sure both should be represented as Code inlines in pandoc. The previous behavior resulted in the text not appearing in any output format.	2014-06-16 22:03:26 -07:00
Jesse Rosenthal	f928e4c8dc	Add DocX automated tests. Note this makes use of input and output files in the tests/ dir.	2014-06-16 07:18:40 -04:00
Albert Krewinkel	3238a2f919	Org reader: support for inline LaTeX Inline LaTeX is now accepted and parsed by the org-mode reader. Both, math symbols (like \tau) and LaTeX commands (like \cite{Coffee}), can be used without any further escaping.	2014-05-20 22:29:21 +02:00
Albert Krewinkel	ceeb701c25	Org reader: support Pandocs citation extension Citations are defined via the "normal citation" syntax used in markdown, with the sole difference that newlines are not allowed between "[...]". This is for consistency, as org-mode generally disallows newlines between square brackets. The extension is turned on by default and can be turned off via the default syntax-extension mechanism, i.e. by specifying "org-citation" as the input format. Move `citeKey` from Readers.Markdown into Parsing The function can be used by other readers, so it is made accessible for all parsers.	2014-05-14 15:00:26 +02:00
Albert Krewinkel	c5fd631b55	Org reader: Fix block parameter reader, relax constraints The reader produced wrong results for block containing non-letter chars in their parameter arguments. This patch relaxes constraints in that it allows block header arguments to contain any non-space character (except for ']' for inline blocks). Thanks to Xiao Hanyu for noticing this.	2014-05-10 11:35:54 +02:00
Albert Krewinkel	07694b3018	Org reader: Fix parsing of blank lines within blocks Blank lines were parsed as two newlines instead of just one. Thanks to Xiao Hanyu (@xiaohanyu) for pointing this out.	2014-05-09 18:23:23 +02:00
Albert Krewinkel	757c4f68f3	Org reader: Support arguments for code blocks The general form of source block headers (`#+BEGIN_SRC <language> <switches> <header arguments>`) was not recognized by the reader. This patch adds support for the above form, adds header arguments to the block's key-value pairs and marks the block as a rundoc block if header arguments are present. This closes #1286.	2014-05-09 18:08:30 +02:00
Albert Krewinkel	71bd4fb2b3	Org reader: Read inline code blocks Org's inline code blocks take forms like `src_haskell(print "hi")` and are frequently used to include results from computations called from within the document. The blocks are read as inline code and marked with the special class `rundoc-block`. Proper handling and execution of these blocks is the subject of a separate library, rundoc, which is work in progress. This closes #1278.	2014-05-06 13:21:26 +02:00
John MacFarlane	1e50424892	Added test for #1154 .	2014-05-04 08:19:48 -07:00
John MacFarlane	6b532c2131	Added Tests.Writer.AsciiDoc to repository.	2014-05-03 22:33:36 -07:00
Neil Mayhew	ccbf4fc9c2	Distinguish tight and loose lists in Docbook output Determined by the first block of the first item being Plain.	2014-05-03 18:37:02 -07:00
John MacFarlane	b306405caa	Merge pull request #1272 from tarleb/link-types Org reader: add support for custom link types	2014-05-01 08:44:05 -07:00
Albert Krewinkel	8726eebcd3	Org reader: Add support for custom link types Org allows users to define their own custom link types. E.g., in a document with a lot of links to Wikipedia articles, one can define a custom wikipedia link-type via #+LINK: wp https://en.wikipedia.org/wiki/ This allows to write [[wp:Org_mode][Org-mode]] instead of the equivallent [[https://en.wikipedia.org/wiki/Org_mode][Org-mode]].	2014-05-01 11:50:32 +02:00
John MacFarlane	f8a34f1694	Added Cite to Arbitrary instance. See #1269. This reveals some test failures.	2014-04-29 18:32:42 -07:00
John MacFarlane	b6ae5d5e99	ADded SmallCaps to Arbitrary instance.	2014-04-29 18:14:39 -07:00
Albert Krewinkel	2eec20d92f	Org reader: Enable internal links Internal links in Org are possible by using an anchor-name as the target of a link: [[some-anchor][This]] is an internal link. It links <<some-anchor>> here.	2014-04-25 15:29:28 +02:00
Albert Krewinkel	c128daba9d	Org reader: Recognize plain and angle links This adds support for plain links (like http://zeitlens.com) and angle links (like <http://moltkeplatz.de>).	2014-04-24 17:55:24 +02:00
Albert Krewinkel	8276449520	Org reader: Allow for compact definition lists Use `Text.Pandoc.Shared.compactify'DL` to allow for compact definition lists.	2014-04-19 15:13:16 +02:00
Albert Krewinkel	8e91d362a3	Org reader: Fix parsing of footnotes Footnotes can consist of multiple blocks and end only at a header or at the beginning of another footnote. This fixes the previous behavior, which restricted notes to a single paragraph.	2014-04-19 14:40:46 +02:00
Albert Krewinkel	6ded3d41d9	Org reader: Apply captions to code blocks and tables The `Table` blocktype already takes the caption as an argument, while code blocks are wrapped in a `Div` block together with a labelling `Span`.	2014-04-19 10:41:45 +02:00
Albert Krewinkel	09441b65a8	Org reader: Add support for plain LaTeX fragments This adds support for LaTeX fragments like the following: ``` \begin{equation} \int fg \mathrm{d}x \end{equation} ```	2014-04-18 10:22:54 +02:00
Albert Krewinkel	f19d7233d8	Org reader: Fix parsing of loose lists Loose lists (i.e. lists with blankline separated items), were parsed as multiple lists, each containing a single item. This patch fixes this issue.	2014-04-18 08:34:06 +02:00
Albert Krewinkel	6d6724cf2c	Org reader: Support more types of '#+BEGIN_<type>' blocks Support for standard org-blocks is improved. The parser now handles "HTML", "LATEX", "ASCII", "EXAMPLE", "QUOTE" and "VERSE" blocks in a sensible fashion.	2014-04-17 18:33:39 +02:00
Albert Krewinkel	0672f58a44	Org reader: Support footnotes	2014-04-17 13:23:14 +02:00
John MacFarlane	857fcff7d6	Merge pull request #1240 from neilmayhew/master Docbook output of Line Blocks	2014-04-13 14:37:28 -07:00
John MacFarlane	86b4da9dec	Merge pull request #1239 from tarleb/org-linebreak Org linebreaks	2014-04-13 14:04:48 -07:00
Neil Mayhew	f22ce4ff28	Add some unit tests for Writers.Docbook These are primarily aimed at testing the new treatment of line breaks, but hopefully other tests can be added more easily now as features and changes are implemented in the writer. Adapted from Tests.Writers.HTML.tests.	2014-04-12 09:18:09 -06:00
Albert Krewinkel	82d4160bdc	Org reader: Read linebreaks Linebreaks are marked by the string `\\` at the end of a line.	2014-04-12 11:07:38 +02:00
Albert Krewinkel	ae4280fba5	Org reader: Add support for figures Support for figures (images with name and caption) is added.	2014-04-12 10:31:45 +02:00
Albert Krewinkel	6f19be7d40	Org reader: Fix parsing of sub-/superscript expressions This fixes the org-reader's handling of sub- and superscript expressions. Simple expressions (like `2^+10`), expressions in parentheses (`a_(n+1)`) and nested sexp (like `a_(nested()parens)`) are now read correctly.	2014-04-11 11:05:42 +02:00
Albert Krewinkel	1715d7cee0	Org reader: Support more inline/display math variants Support all of the following variants as valid ways to define inline or display math inlines: - `\[..\]` (display) - `$$..$$` (display) - `$..$` (inline) - `$..$` (inline) This closes #1223. Again.	2014-04-10 15:32:02 +02:00
Albert Krewinkel	030020236c	Org reader: Precise rules for the recognition of markup The inline parsers have been rewritten using the org source code as a reference. This fixes a couple of bugs related to erroneous markup recognition.	2014-04-09 15:26:06 +02:00
Albert Krewinkel	c47bd8404f	Org reader: Support inline math (like $E=mc^2$) Closes #1223.	2014-04-07 11:47:36 +02:00
Albert Krewinkel	480b33b710	Org reader: Add support for definition lists	2014-04-06 20:39:10 +02:00
Albert Krewinkel	652c781e37	Org reader: Support inline images	2014-04-05 16:15:53 +02:00
Albert Krewinkel	fd98532784	Org reader: Fix parsing of nested inlines Text such as /this/ was not correctly parsed as a strong, emphasised word. This was due to the end-of-word recognition being to strict as it did not accept markup chars as part of a word. The fix involves an additional parser state field, listing the markup chars which might be parsed as part of a word.	2014-04-05 16:14:40 +02:00
John MacFarlane	ae86e24ff6	Merge branch 'master' of https://github.com/mb21/pandoc into mb21-master	2014-03-04 10:15:43 -08:00
Albert Krewinkel	24b2ac43b0	Add a simple Emacs Org-mode reader The basic structure of org-mode documents is recognized; however, org-mode features like todo markers, tags etc. are not supported yet.	2014-03-04 10:40:40 +01:00
mb21	80511f1b34	InDesign ICML Writer	2014-02-28 13:35:35 +01:00
Vaclav Zeman	08d80809c2	Update tests suite to expect \texorpdfstring.	2014-02-13 00:03:54 +01:00
Henry de Valence	f6d151889c	HLint: redundant parens Remove parens enclosing a single element.	2013-12-19 20:43:25 -05:00
Henry de Valence	c35f5ba42d	HLint: Remove lambdas.	2013-12-19 20:28:53 -05:00
John MacFarlane	7f09c1834d	Markdown writer: Fix rendering of tight sublists. E.g. - foo - bar - baz Previously a spurious blank line was included before the last item. Closes #1050.	2013-11-30 17:59:28 -08:00
John MacFarlane	cf149fcf38	Fixed bug with intraword emphasis. Closes #1066.	2013-11-22 19:41:08 -08:00
John MacFarlane	e63aafd620	Fix definition lists with internal links in terms (closes #1032 ). This fix puts braces around a term that contains an internal link, to avoid problems with square brackets.	2013-10-21 17:33:42 -07:00
John MacFarlane	9d6bca06ee	Pass the buildDir as first argument to test suite. Allows test suite to work with cabal sandboxes. Previously we hard-coded the build directory.	2013-10-20 12:36:26 -07:00
John MacFarlane	9b0b9b6e03	Markdown reader: Don't autolink a bare URI that is followed by `</a>`. Closes #937.	2013-09-01 15:18:56 -07:00
John MacFarlane	9282f63278	Use registerHeader in RST and LaTeX readers. This will give automatic unique identifiers, unless `-auto_identifiers` is specified.	2013-09-01 09:13:31 -07:00
John MacFarlane	4e4c948b41	Added markdown citation parsing test.	2013-08-26 22:30:27 -07:00
John MacFarlane	e7a4bcc6fe	Merge pull request #961 from nougad/add_latex_listings_label Write id for code block to label attr in latex when listing is used	2013-08-25 20:48:38 -07:00
John MacFarlane	deb59b6235	Removed dependency on citeproc-hs. Going forward we'll use pandoc-citeproc, as an external filter. The `--bibliography`, `--csl`, and `--citation-abbreviation` fields have been removed. Instead one must include `bibliography`, `csl`, or `csl-abbrevs` fields in the document's YAML metadata. The filter can then be used as follows: pandoc --filter pandoc-citeproc The `Text.Pandoc.Biblio` module has been removed. Henceforth, `Text.CSL.Pandoc` from pandoc-citations can be used by library users. The Markdown and LaTeX readers now longer format bibliographies and citations. That must be done using `processCites` or `processCites'` from Text.CSL.Pandoc. All bibliography-related fields have been removed from `ReaderOptions` and `WriterOptions`: `writerBiblioFiles`, `readerReferences`, `readerCitationStyle`. API change.	2013-08-24 22:33:01 -07:00
Florian Eitel	5f09cf7ff0	Write id for code block to label attr in latex when listing is used The code: ~~~{#test} asdf ~~~ gets compiled to html: <pre id="test"> asdf </pre> So it is possible to link to the identifier `test` But this doesn't happen on latex When using the listings package (`--listings`) it is possible to set the identifier using the `label=test` property: \begin{lstlisting}[label=id] hi \end{lstlisting} And this is exactly what this patch is doing. Modified LaTeX Reader/Writer and added tests for this.	2013-08-22 20:15:36 +02:00
John MacFarlane	7048c130ec	Create Cite element even if no matching reference in the biblio. * Add ??? as fallback text for non-resolved citations. * Biblio: Put references (including a header at the end of the document, if one exists) inside a Div with class "references". This gives some control over styling of references, and allows scripts to manipulate them. * Markdown writer: Print markdown citation codes, and disable printing of references, if `citations` extension is enabled. NOTE: It would be good to improve what citeproc-hs does for a nonexistent key.	2013-08-20 20:47:06 -07:00
claremacrae	2ae2fcde2f	Add extra pair of test files for dokuwiki writer (#386 ) I've found some incorrect behaviours with the dokuwiki output, for which extra test cases will be needed - that aren't covered by the standard pandoc test input files.	2013-08-17 18:53:01 +01:00
John MacFarlane	441a7aebf8	LaTeX writer: Avoid problem with footnotes in unnumbered headers. Closes #940. Added test case.	2013-08-16 13:03:38 -07:00
John MacFarlane	6f736dfa75	Added Tests.Walk. This verifies that walk and query match the generic traversals.	2013-08-10 19:04:15 -07:00
John MacFarlane	cbfa932106	Adjustments for new Format newtype.	2013-08-10 17:24:54 -07:00
John MacFarlane	800c5490ec	LaTeX reader: Don't add spurious ", " to citation suffixes. This is added when needed in Text.Pandoc.Biblio anyway.	2013-07-21 11:44:49 -07:00
Clare Macrae	7eded47bcd	Initial work to create dokuwiki writer (#386 ) In this first version, all dokuwiki files are straight copies of the media wiki counterparts.	2013-07-14 13:40:27 +01:00
John MacFarlane	35e2caa058	Updated a test whose output changed due to last commit.	2013-07-13 13:47:09 -07:00
John MacFarlane	79a4ea03e2	Stop escaping `\|` in LaTeX math. This caused problems with array environments. Closes #891.	2013-06-26 20:54:31 -07:00
John MacFarlane	f869f7e08d	Use new flexible metadata type. * Depend on pandoc 1.12. * Added yaml dependency. * `Text.Pandoc.XML`: Removed `stripTags`. (API change.) * `Text.Pandoc.Shared`: Added `metaToJSON`. This will be used in writers to create a JSON object for use in the templates from the pandoc metadata. * Revised readers and writers to use the new Meta type. * `Text.Pandoc.Options`: Added `Ext_yaml_title_block`. * Markdown reader: Added support for YAML metadata block. Note that it must come at the beginning of the document. * `Text.Pandoc.Parsing.ParserState`: Replace `stateTitle`, `stateAuthors`, `stateDate` with `stateMeta`. * RST reader: Improved metadata. Treat initial field list as metadata when standalone specified. Previously ALL fields "title", "author", "date" in field lists were treated as metadata, even if not at the beginning. Use `subtitle` metadata field for subtitle. * `Text.Pandoc.Templates`: Export `renderTemplate'` that takes a string instead of a compiled template.. * OPML template: Use 'for' loop for authors. * Org template: '#+TITLE:' is inserted before the title. Previously the writer did this.	2013-06-24 20:29:41 -07:00
John MacFarlane	a2b98ba218	Added test for #882 .	2013-06-19 09:27:11 -07:00
John MacFarlane	0b85ad7546	Added stubs for haddock reader tests. Modify tests/haddock-reader.haddock and tests/haddock-reader.native.	2013-03-28 15:58:09 -07:00
John MacFarlane	5b4d239b85	Added OPML template, tests. Minor fixes to OPML writer. Improved OPML reader tests.	2013-03-20 10:17:59 -07:00
John MacFarlane	74d53f4347	Added Text.Pandoc.Readers.OPML, exporting readOPML. The _note attribute is supported. This is unofficial, but used e.g. in OmniOutliner and supported by multimarkdown. We treat the contents as markdown blocks under a section header. Added to documentation and tests.	2013-03-19 20:22:14 -07:00
John MacFarlane	835deee58b	Markdown writer: New approach for citations. * Reverts 1.11 change that caused citations to be rendered as markdown citations, even if `--biblio` was specified, unless `citation` extension is disabled. Now, formatted citations are always printed if `--biblio` was specified. If you want to reformat markdown keeping pandoc markdown citations intact, just don't specify `--biblio`. * Reverted now unnecessary changes to Text.Pandoc.Biblio adding the raw block to mark the bibliography, and to Text.Pandoc.Writers.Markdown to remove the bibliography if `citations` not specified. * If the content of a `Cite` inline is a `RawInline "latex"`, which means that a LaTeX citation command was parsed and `--biblio` wasn't specified, then render it as a pandoc markdown citation. This means that `pandoc -f latex -t markdown`, without `--biblio`, will convert LaTeX citation commands to pandoc markdown citations.	2013-03-17 10:33:54 -07:00
John MacFarlane	cae52ecc31	Revert "LaTeX reader: citation handling changes." This reverts commit `f7229b1473`.	2013-03-17 08:48:29 -07:00
John MacFarlane	f7229b1473	LaTeX reader: citation handling changes. Previously, a LaTeX citation would always be parsed as a Citation element, with the raw LaTeX in the [Inline] part. Now, the LaTeX citation is parsed as a Citation element only if `--biblio` was specified (i.e. only if there is a nonempty set of references in readerReferences). Otherwise it is parsed as raw LaTeX. This will make it possible to simplify some things in the markdown writer. It also makes the LaTeX reader behave more like the Markdown reader.	2013-03-09 10:33:25 -08:00
John MacFarlane	af7e97b9f5	Markdown writer: Render citations as pandoc-markdown citations. Previously citations were rendered as citeproc-formatted citations by default. Now we render them as pandoc citations, e.g. `[@item1]`, unless the `citations` extension is disabled. If you still want formatted citations in your markdown output, use `pandoc -t markdown-citations`.	2013-03-07 16:38:19 -08:00
John MacFarlane	3b63cb0903	Hide Text.Pandoc.Highlighting. * Moved code for translating listings language names to highlighting-kate names and back from LaTeX reader to Highlighting. * Text.Pandoc.Highlighting no longer exposed (API change) * Text.Pandoc.Highlighting exports toListingsLang, fromListingsLang	2013-03-05 22:09:42 -08:00
John MacFarlane	daeb52d4e0	Eliminated use of TH in test suite.	2013-01-23 19:26:39 -08:00
John MacFarlane	2fbe611a96	Get rid of compiler warnings in Tests.Helpers.	2013-01-19 10:41:12 -08:00
John MacFarlane	bf3a911a1c	Changed Ext_autolink_urls -> Ext_autolink_bare_uris. Added tests.	2013-01-15 12:44:50 -08:00
John MacFarlane	e9b3d5aa7a	Added lots of tests for bare URIs.	2013-01-15 12:28:31 -08:00
John MacFarlane	5c067bb457	RST reader: Line block improvements. * Use nonbreaking spaces for initial indent (otherwise lost in HTML and LaTeX). * Allow multiple paragraphs in a single line block.	2013-01-13 11:15:31 -08:00
John MacFarlane	70e308f2f9	Escape `\|` as `\vert` in LaTeX math. This avoids a clash with highlighting-kate's macros, which redefine \| as a short verbatim delimiter. Thanks to Björn Peemöller for raising this issue.	2013-01-12 10:21:19 -08:00
John MacFarlane	f2aa5fd661	Fixed/simplified diff output for tests.	2013-01-12 10:21:07 -08:00
John MacFarlane	d599c4cdab	Added Attr field to Header. Previously header ids were autogenerated by the writers. Now they are generated (unless supplied explicitly) in the markdown parser, if the `header_identifiers` extension is selected. In addition, the textile reader now supports id attributes on headers.	2013-01-09 09:30:05 -08:00
John MacFarlane	8ff81dc9ca	Updated tests for tight/loose lists. Taking into account new context/latex output, and fixing some bugs in the test suite Tests.Helpers and Tests.Writers.ConTeXt. (We had the wrong order of expected/actual in the diff output.)	2013-01-07 20:58:49 -08:00
John MacFarlane	56ff5e1845	Updated test runner for changes in pandoc.	2013-01-03 11:20:10 -08:00
John MacFarlane	0675346e76	Fixed test suite to use Diff 0.2 API.	2013-01-02 11:41:22 -08:00
John MacFarlane	06300e59d5	Removed citationSuppressParens. Makefile: Use citeproc-0.3.6 release.	2012-10-28 09:36:15 -07:00

... 2 3 4 5 6 ...

402 commits