Matthew Pickering
8460ea417f
EPUB Reader: Integrated into program
2014-07-31 21:39:50 +01:00
Matthew Pickering
51051e9953
HTML Reader: Added ability to read MathML formatted <math> blocks
2014-07-31 21:39:50 +01:00
Matthew Pickering
7c1f867397
HTML Reader: Added support for anchors on links and list items
2014-07-31 21:39:50 +01:00
Matthew Pickering
266e1977e0
HTML Reader: Extended HTML Reader to recognise EPUB specific elements
2014-07-31 21:39:49 +01:00
Matthew Pickering
002ae95d7a
Options: Added option to turn on epub html extensions
2014-07-31 21:39:49 +01:00
Matthew Pickering
b57e554b59
Except Compat: Updated to export more module functions
2014-07-31 21:39:49 +01:00
Matthew Pickering
089745af61
EPUB Reader: Added EPUB reader
2014-07-31 21:39:49 +01:00
John MacFarlane
6dd2418476
New module, Text.Pandoc.MediaBag.
...
Moved `MediaBag` definition and functions from Shared:
`lookupMedia`, `mediaDirectory`, `insertMedia`, `extractMediaBag`.
Removed `emptyMediaBag`; use `mempty` instead, since `MediaBag`
is a Monoid.
2014-07-31 12:00:21 -07:00
John MacFarlane
00662faefb
Made MediaBag a newtype, and added mime type information to media.
...
Shared now exports functions for interacting with a MediaBag:
- `emptyMediaBag`
- `lookuMedia`
- `insertMedia`
- `mediaDirectory`
- `extractMediaBag`
2014-07-31 11:05:35 -07:00
Matthew Pickering
31e7c1b67c
Shared: Added function insertMedia which is an alias for M.insert
2014-07-31 15:47:49 +01:00
John MacFarlane
f1885ae799
Removed deprecated and no longer used readerStrict in ReaderOptions.
...
This is handled by readerExtensions now.
2014-07-30 18:32:06 -07:00
John MacFarlane
71e76175be
getT2TMeta: Take list of source files instead of single.
...
Get latest modification time.
2014-07-30 17:25:00 -07:00
John MacFarlane
e4913d6dba
Allow --self-contained to get content from MediaBag.
...
Added a parameter to makeSelfContained (API change).
2014-07-30 15:26:40 -07:00
John MacFarlane
23d806644f
RTF writer: Improved image embedding.
...
Use calculated sizes.
2014-07-30 14:49:57 -07:00
John MacFarlane
e365af9c23
RTF writer: refactored image embedding, using fetchItem'.
2014-07-30 14:27:51 -07:00
John MacFarlane
234652a4b8
PDF, Docx, EPUB, and ODT writers now automatically use MediaBag.
...
The MediaBag is thread through from the reader, with no need
to extract to files.
2014-07-30 14:07:31 -07:00
John MacFarlane
28321a18bf
Shared: Added fetchItem', which searches a media bag too.
2014-07-30 13:47:07 -07:00
John MacFarlane
67c5c7a575
Moved MediaBag back from Shared to Options, to avoid module cycle.
2014-07-30 13:46:48 -07:00
John MacFarlane
08e2498e73
Added writerMediaBag to WriterOptions.
2014-07-30 13:09:55 -07:00
John MacFarlane
555f9b746d
Moved MediaBag from Shared to Options.
...
This will allow us to put a MediaBag in WriterOptions.
2014-07-30 13:00:54 -07:00
John MacFarlane
d3cf53a956
Moved withTempDir from PDF to Shared, export from Shared.
...
API change.
2014-07-30 12:29:04 -07:00
Jesse Rosenthal
a79ea18503
Pandoc.hs: change BSReader to output MediaBag as well as pandoc.
2014-07-30 12:47:49 -04:00
Jesse Rosenthal
f78d2f6219
Shared: Make MediaBag available through Shared.
2014-07-30 12:47:26 -04:00
Jesse Rosenthal
9ce2295700
Docx reader: Make docx reader put image data in MediaBag.
...
Image data will not be put in a media bag map, which will be output
along with the pandoc output.
2014-07-30 12:46:03 -04:00
John MacFarlane
02c79ea4f6
Mediawiki writer: don't escape inside <source>
.
...
Closes #1445 . Escapes can still be used with `<code>`
and `<pre>`.
2014-07-29 21:32:07 -07:00
John MacFarlane
33a051d00d
Docx writer: Print subtitle from metadata if present.
...
Use Subtitle style.
See #1451 .
2014-07-29 20:46:55 -07:00
John MacFarlane
8c2ed54e2e
LaTeX writer: use \(..\) instead of $..$ for inline math.
...
Closes #1464 .
2014-07-29 20:45:49 -07:00
John MacFarlane
8d4eebaff4
Merge pull request #1463 from jkr/metadata
...
Make metadata out of styled pars
2014-07-29 11:15:34 -07:00
Jesse Rosenthal
840108a9c1
Docx reader: Make metavalues out of styled paragraphs.
...
This will make paragraphs styled with `Author`, `Title`, `Subtitle`,
`Date`, and `Abstract` into pandoc metavalues, rather than text. The
implementation only takes those elements from the beginning of the
document (ignoring empty paragraphs).
Multiple paragraphs in the `Author` style will be made into a metaList,
one paragraph per item. Hard linebreaks (shift-return) in the paragraph
will be maintained, and can be used for institution, email, etc.
2014-07-29 13:03:01 -04:00
Matthew Pickering
95fb0755c1
Parsing: Added isbn and pmid schemes
2014-07-27 19:59:57 +01:00
John MacFarlane
19109331df
Markdown writer: Separate adjacent lists of the same kind with comment.
...
Closes #1458 .
2014-07-27 08:26:17 -07:00
John MacFarlane
c302ab3133
Markdown writer: More improvements to 'plain' output, updated tests.
...
Math now appears in unicode if possible, without the distracting
italics around identifiers.
Blank lines around headers are more consistent.
Footnotes appear in regular [n] style.
2014-07-27 07:57:23 -07:00
John MacFarlane
cc24a1f3e5
Text.Pandoc.Pretty: added blanklines.
...
This ensures a certain number of blanklines (and no more) in output.
2014-07-27 07:56:55 -07:00
John MacFarlane
3eff3782c1
Markdown writer: Better 'plain' output.
...
We now largely follow the style of Project Gutenberg.
Emphasis is rendered with `_underscores_`, strong with ALL CAPS.
The appearance of horizontal rules has changed (even in regular
markdown) to a line across the whole page.
Headings are rendered differently, using space to set them off.
2014-07-27 00:17:15 -07:00
John MacFarlane
18d72a0768
Markdown writer: Update definition lists.
...
They now behave like the new reader does. The old behavior
can be activated with the `compact_definition_lists` extension.
2014-07-27 00:15:18 -07:00
John MacFarlane
b104db4fb4
Docx writer: Added missing case from last commit.
2014-07-26 23:32:13 -07:00
John MacFarlane
2610de0159
Docx writer: include abstract with Abstract style.
...
Addresses docx part of #1451 .
2014-07-26 22:55:45 -07:00
John MacFarlane
d9751f91c4
Merge pull request #1457 from mpickering/generalstate
...
Generalised more in Parsing.hs to enable the use of custom state
2014-07-26 19:01:26 -07:00
Matthew Pickering
9e4604fa0b
Added compatability layer to support directory-1.1
2014-07-27 00:36:23 +01:00
Matthew Pickering
f0a32197c8
Txt2Tags Reader: Added copyright information
2014-07-27 00:12:57 +01:00
Matthew Pickering
43304d6bd6
Txt2Tags Reader: Added recognition of macros
2014-07-27 00:12:56 +01:00
Matthew Pickering
ab3589ff0b
Txt2Tags Reader: Integrated into pandoc
2014-07-27 00:12:56 +01:00
Matthew Pickering
7d04d383a6
Added txt2tags reader
...
http://txt2tags.org/
There are two points which currently do not match the official
implementation.
1. In the official implementation lists can not be nested like the
following but the reader would interpret this as a bullet list with the
first item being a numbered list.
```
- + This is not a list
```
2. The specification describes how URIs automatically becomes links.
Unfortunately as is often the case, their definitiong of URI is not
clear. I tried three solutions but was unsure about which to adopt.
* Using isURI from Network.URI, this matches far too many strings and is
therefore unsuitable
* Using uri from Text.Pandoc.Shared, this doesn't match all strings that
the reference implementation matches
* Try to simulate the regex which is used in the native code
I went with the third approach but it is not perfect, for example
trailing punctuation is captured in Urls.
2014-07-27 00:12:56 +01:00
Matthew Pickering
5e2d22a27e
Generalised more in Parsing.hs to enable the use of custom state
2014-07-26 21:23:49 +01:00
John MacFarlane
18f4490482
Fixed runtime error with compactify'DL on certain lists.
...
Closes #1452 . Added test.
2014-07-25 10:53:04 -07:00
John MacFarlane
9c3f7688ee
DocBook reader: Better handle elements inside code environments.
...
Of course, we can't include structure in the code block, but
this way we at least preserve the text. Closes #1449 .
2014-07-23 10:06:36 -07:00
Matthew Pickering
83028c5982
Exported runParserT and Stream
2014-07-22 16:37:41 +01:00
Matthew Pickering
e045b1d5f2
Generalised readWith to readWithM
2014-07-22 15:13:54 +01:00
John MacFarlane
5debb492ef
Revert "Shared.hierarchicalize: Don't number subsections of unnumbered sections."
...
This reverts commit 2a46042661
.
2014-07-21 20:47:18 -07:00
John MacFarlane
2a46042661
Shared.hierarchicalize: Don't number subsections of unnumbered sections.
...
They were previously numbered, starting from the previous numbered
section, which was wrong.
2014-07-21 12:35:11 -07:00