pandoc

Author	SHA1	Message	Date
Jesse Rosenthal	e88119f2d1	Docx Reader: Add test for VML images. Since images are often visually (not structurally) placed on the page, people might not always get the results they're looking for here.	2015-01-21 13:41:16 -05:00
John MacFarlane	a864e9a348	Merge pull request #1805 from bergey/rst RST Reader - Improved Role Support	2014-12-15 09:06:45 -08:00
John MacFarlane	1d3ca088f2	Merge pull request #1813 from tarleb/file-links Org reader: properly handle links to `file:target`	2014-12-14 13:36:34 -08:00
Albert Krewinkel	4d85b17fc5	Org reader: properly handle links to `file:target` Org links like `[[file:target][title]]` were not handled correctly, parsing the link target verbatim. The org reader is changed such that the leading `file:` is dropped from the link target. This is related to issues #756 and #1812.	2014-12-14 21:30:10 +01:00
John MacFarlane	2b08e32a90	Fixe autolinks with following punctuation. Closes #1811. The price of this is that autolinked bare URIs can no longer contain `>` characters, but this is not a big issue.	2014-12-14 12:20:33 -08:00
Daniel Bergey	689fb112bf	RST Reader: compute Attrs when role is defined Move recursive role lookup from renderRole to addNewRole. The Attr value will be the same for every occurance of this role, so there's no reason to compute it every time. This allows simplifying the stateRstCustomRoles map considerably. We could go even further, and remove the fmt and attr arguments to renderRole, which are null except for custom roles.	2014-12-12 14:45:45 +00:00
Daniel Bergey	4e040160e0	WIP: tests for RST roles	2014-12-12 14:45:45 +00:00
Daniel Bergey	74c1b547c2	parse RST class directives The class directive accepts one or more class names, and creates a Div value with those classes. If the directive has an indented body, the body is parsed as the children of the Div. If not, the first block folowing the directive is made a child of the Div. This differs from the behavior of rst2xml, which does not create a Div element. Instead, the specified classes are applied to each child of the directive. However, most Pandoc Block constructors to not take an Attr argument, so we can't duplicate this behavior.	2014-12-01 18:22:03 +00:00
Daniel Bergey	2cdfa5eb20	parse RST quoted literal blocks closes #65 RST quoted literal blocks are the same as indented literal blocks (which pandoc already supports) except that the quote character is preserved in each line. This includes test cases for the quoted literal block, as well as additional tests for line blocks and indented literal blocks, to verify that these are unaffected by the changes.	2014-12-01 18:22:03 +00:00
John MacFarlane	46d343f474	Fixed bug in org with bulleted lists: - a - b * c was being parsed as a list, even though an unindented `*` should make a heading. See <http://orgmode.org/manual/Plain-lists.html#fn-1>.	2014-11-13 23:40:18 -08:00
John MacFarlane	43c1978fae	Merge pull request #1645 from neongreen/issue1636 Fix 'Ext_lists_without_preceding_blankline' bug.	2014-11-12 09:05:29 -08:00
Albert Krewinkel	e6cd8c9077	Org reader: allow empty links for gitit interop While empty links are not allowed in Emacs org-mode, Pandoc org-mode should support them: gitit relies on empty links as they are used to create wiki links. Fixes jgm/gitit#471	2014-11-05 23:15:28 +01:00
Albert Krewinkel	daaf635806	Org reader: absolute, relative paths in links The org reader was to restrictive when parsing links, some relative links and links to files given as absolute paths were not recognized correctly. The org reader's link parsing function was amended to handle such cases properly. This fixes #1741	2014-11-05 22:27:25 +01:00
Jesse Rosenthal	60846471a3	Docx test: Remove Danish header test. Redundant, now that we're testing for a more generalized sort of internationalized blocks.	2014-10-25 16:02:31 -04:00
Jesse Rosenthal	c0ddcb359e	Docx reader: add tests for i18n headers. This tests blockquotes and headers in Russian. Previous tests make sure that this doesn't produce a regression in en-us Header and Blockquotes.	2014-10-25 16:00:27 -04:00
Albert Krewinkel	a5eb02f6a7	Org reader: parse LaTeX-style MathML entities Org supports special symbols which can be included using LaTeX syntax, but are actually MathML entities. Examples for this are `\nbsp` (non-breaking space), `\Aacute` (the letter A with accent acute) or `\copy` (the copyright sign ©). This fixes #1657.	2014-10-20 22:57:36 +02:00
John MacFarlane	84f6b1e41a	Merge pull request #1680 from shelf/master Respect indent when parsing Org bullet lists	2014-10-18 13:20:27 -07:00
John MacFarlane	31713d572a	Merge pull request #1700 from tarleb/org-emphasis-fix Org reader: fix rules for emphasis recognition	2014-10-18 13:19:42 -07:00
Albert Krewinkel	e3c36ed6ce	Org reader: Drop COMMENT document trees Document trees under a header starting with the word `COMMENT` are comment trees and should not be exported. Those trees are dropped silently. This closes #1678.	2014-10-18 22:11:53 +02:00
Albert Krewinkel	d571bec454	Org reader: fix rules for emphasis recognition Things like `/hello,/` or `/hi'/` were falsy recognized as emphasised strings. This is wrong, as `,` and `'` are forbidden border chars and may not occur on the inner border of emphasized text. This patch enables the reader to matches the reference implementation in that it reads the above strings as plain text.	2014-10-18 12:47:59 +02:00
Timothy Humphries	f1f56e8533	Fix indent issue for definition lists Tidy up fix for #1650, #1698 as per comments in #1680. Fix same issue for definition lists with the same method.	2014-10-17 20:06:25 -04:00
Timothy Humphries	4f4b0f031d	Respect indent when parsing Org bullet lists Fixes issue with top-level bullet list parsing. Previously we would use `many1 spaceChars` rather than respecting the list's indent level. We also permitted `*` bullets on unindented lists, which should unambiguously parse as `header 1`. Combined, this meant headers at a different indent level were being unwittingly slurped into preceding bullet lists, as per Issue #1650.	2014-10-12 03:18:36 -04:00
John MacFarlane	fe6d43b3e0	Merge pull request #1601 from jkr/windowsfix Fix path-slashes inside archive for windows	2014-09-27 16:21:17 -07:00
Matthew Pickering	fa2d11c954	Update tests for #1649	2014-09-27 22:40:25 +01:00
Artyom	bc115ffc2d	Fix 'Ext_lists_without_preceding_blankline' bug. * Fixes #1636. * Adds a test.	2014-09-26 13:32:08 +04:00
mpickering	c0b9ad4c5d	EPUB Tests: Seperating image testing from other features	2014-09-25 13:33:25 +01:00
Jesse Rosenthal	f56e0e958a	Docx reader: Add test for polyglot headers. Only Danish at the moment.	2014-09-05 22:07:06 -04:00
Jesse Rosenthal	313355e373	Org reader: Update Tests Test for markup after blank line.	2014-09-04 19:55:53 -04:00
Jesse Rosenthal	08359c44e4	Docx Reader: Add tests for numbered headers.	2014-09-04 19:39:49 -04:00
Jesse Rosenthal	a6eead7f26	Docx reader: Modify mediabag test accordingly.	2014-09-02 14:05:54 -04:00
John MacFarlane	598d3ee23b	Markdown reader: better handling of paragraph in div. Previously text that ended a div would be parsed as Plain unless there was a blank line before the closing div tag. Test case: <div class="first"> This is a paragraph. This is another paragraph. </div> Closes #1591.	2014-08-31 12:55:47 -07:00
mpickering	2cd049a1bf	Txt2Tags reader: Header is now parsed only if standalone flag is set	2014-08-20 18:11:37 +01:00
Jesse Rosenthal	180f5cbe63	Docx reader: Test for character styles.	2014-08-16 14:05:56 -04:00
John MacFarlane	40e67b8737	Revised tests directory. Renamed some tests, introducing subsidiary directories for fb2, docx, epub. Cleaned up tests in cabal file. Combined dokuwiki-writer and dokuwiki_inline_formatting tests.	2014-08-13 11:16:50 -07:00
Jesse Rosenthal	0808449547	Docx: Add dropcap tests.	2014-08-11 23:10:50 -04:00
Matthew Pickering	f33ae631f3	Improved EPUB Tests Rewrote features test to remove all unimplemented features. There are now all three examples of where an image can be included in the test. 1. Cover image 2. As a spine elemnt 3. In the document Tests have also been added to make sure that the mediabag contains all these images after processing.	2014-08-10 14:58:53 +01:00
Jesse Rosenthal	98d14b2b2a	Docx reader: Test inline image code.	2014-08-07 15:34:49 -04:00
Jesse Rosenthal	ed71e9b31d	Docx tests: rewrite mediabag tests. This will allow us to test the whole mediabag (making sure, for example, that images are added with the correct keys) instead of just individual extracted images. We compare each entry in the media bag to an image extracted on the fly from the docx. As a result, we only need one file to test with. The image in the current tests was also replaced with a smaller one.	2014-07-31 15:47:45 -04:00
John MacFarlane	6dd2418476	New module, Text.Pandoc.MediaBag. Moved `MediaBag` definition and functions from Shared: `lookupMedia`, `mediaDirectory`, `insertMedia`, `extractMediaBag`. Removed `emptyMediaBag`; use `mempty` instead, since `MediaBag` is a Monoid.	2014-07-31 12:00:21 -07:00
John MacFarlane	00662faefb	Made MediaBag a newtype, and added mime type information to media. Shared now exports functions for interacting with a MediaBag: - `emptyMediaBag` - `lookuMedia` - `insertMedia` - `mediaDirectory` - `extractMediaBag`	2014-07-31 11:05:35 -07:00
Jesse Rosenthal	4d1d8a4b6f	Docx test: Test image from media bag.	2014-07-30 22:32:55 -04:00
Jesse Rosenthal	16f88edb3b	Docx tests: Added media test comparison function. Also tell pandoc.cabal that we'll be needing base64, since we want to compare strings here.	2014-07-30 22:31:38 -04:00
Jesse Rosenthal	941df1b0de	Docx reader: change tests to make use of media bag.	2014-07-30 12:46:53 -04:00
Jesse Rosenthal	54708da371	Add and update docx tests in pandoc.cabal.	2014-07-29 13:05:19 -04:00
Jesse Rosenthal	840108a9c1	Docx reader: Make metavalues out of styled paragraphs. This will make paragraphs styled with `Author`, `Title`, `Subtitle`, `Date`, and `Abstract` into pandoc metavalues, rather than text. The implementation only takes those elements from the beginning of the document (ignoring empty paragraphs). Multiple paragraphs in the `Author` style will be made into a metaList, one paragraph per item. Hard linebreaks (shift-return) in the paragraph will be maintained, and can be used for institution, email, etc.	2014-07-29 13:03:01 -04:00
Matthew Pickering	e340a7da02	Txt2Tags Reader: Added tests	2014-07-27 00:12:57 +01:00
John MacFarlane	4af8eed764	Markdown reader: revised definition list syntax (closes #1429 ). * This change brings pandoc's definition list syntax into alignment with that used in PHP markdown extra and multimarkdown (with the exception that pandoc is more flexible about the definition markers, allowing tildes as well as colons). * Lazily wrapped definitions are now allowed; blank space is required between list items; and the space before definition is used to determine whether it is a paragraph or a "plain" element. * For backwards compatibility, a new extension, `compact_definition_lists`, has been added that restores the behavior of pandoc 1.12.x, allowing tight definition lists with no blank space between items, and disallowing lazy wrapping.	2014-07-20 16:33:59 -07:00
John MacFarlane	87096c64f8	Org reader: text adjacent to a list yields a Plain, not Para. This gives better results for tight lists. Closes #1437. An alternative solution would be to use Para everywhere, and never Plain. I am not sufficiently familiar with org to know which is best. Thoughts, @tarleb?	2014-07-20 12:56:01 -07:00
Craig S. Bosma	1bb4f0c497	Org reader: Respect :exports header arguments on code blocks Adds support to the org reader for conditionally exporting either the code block, results block immediately following, both, or neither, depending on the value of the `:exports` header argument. If no such argument is supplied, the default org behavior (for most languages) of exporting code is used.	2014-07-17 10:23:22 -05:00
Jesse Rosenthal	643435f1de	Docx reader: Add test Test auto ident header anchors with pandoc-generated pandoc.	2014-07-15 18:32:19 +01:00

1 2 3 4 5

237 commits