John MacFarlane
842c705097
SelfContained: Fixed determining of source URL from within CSS files.
...
(This fixes a bug introduced a couple commits back.)
2014-08-02 16:33:22 -07:00
John MacFarlane
85ff3c5771
fetchItem: improved mime type guessing.
...
Strip a fragment like `?#iefix` from the extension before doing
the mime lookup.
2014-08-02 16:32:11 -07:00
John MacFarlane
1d137fbed6
Shared: fetchItem improvements.
...
* More consistent logic: absolute URIs are fetched from the net;
other things are treated as relative URIs if sourceURL is a Just,
otherwise as file paths.
* We escape characters that are not allowed in URIs before trying
to parse them (e.g. '|', which often occurs in the wild).
* When treating relative paths as local file paths, we drop
any fragment or query. This is useful e.g. when you've downloaded
web fonts locally, but your source still contains the original
relative URLs.
Together with the previous commit, this should close #1477 .
2014-08-02 16:12:05 -07:00
John MacFarlane
ce8922437d
Text.Pandoc.SelfContained changes.
...
* mkSelfContained now takes just two arguments, WriterOptions and
the string.
* It no longer looks in data files. This only made sense when we
had copies of slidy and S5 code there.
* Shared.fetchItem' is used instead of the nearly duplicate getItem.
2014-08-02 16:07:19 -07:00
John MacFarlane
cbaaa17d49
Bump aeson upper bound.
2014-08-02 15:14:15 -07:00
John MacFarlane
36c9537b90
benchmark: use only test suite for benchmark.
...
This way the benchmarks don't take so long to run.
2014-08-01 13:40:26 -07:00
John MacFarlane
836d1f06bb
Use nfIO in benchmark instead of getLength trick.
2014-08-01 13:20:08 -07:00
John MacFarlane
a1bb71a02c
Merge pull request #1475 from Aelve/fix-benchmark
...
Make benchmarks compile again.
2014-08-01 08:40:36 -07:00
Jesse Rosenthal
3a411c3307
Docx Parser: Produce endnotes.
...
The parser had been changing footnotes and endnotes into footnotes. This
isn't a problem, because pandoc collapses them, but the parser should
maintain as much of the docx structure as is collapsed, and let the
toplevel reader worry about how to translate it into Pandoc. (This would
be an issue when, as is planned, the docx parser spins off into its
own module.)
The output is the same, so no test change is required.
2014-08-01 09:16:42 -04:00
Artyom Kazak
857c63f77e
Make benchmarks compile again.
...
Additionally, fix the problem which caused one failing benchmark
to stop other benchmarks from running.
2014-08-01 15:40:25 +04:00
John MacFarlane
1b9371cfdf
Merge branch 'underline-option' of https://github.com/jkr/pandoc
2014-07-31 16:40:20 -07:00
John MacFarlane
f075b0e5d9
pandoc.hs: More code reorganization.
2014-07-31 16:27:06 -07:00
Jesse Rosenthal
43ca784d1c
Update docx test to interpret single underline as emph.
2014-07-31 18:46:43 -04:00
Jesse Rosenthal
57939a9282
Docx Reader: Single underlines are "emph"
...
All other underlines are ignored.
2014-07-31 18:44:41 -04:00
John MacFarlane
2b7e7acc4b
Merge pull request #1473 from jkr/mediabag-test-2
...
Docx tests: rewrite mediabag tests.
2014-07-31 15:28:20 -07:00
Matthew Pickering
cd9a5d90cb
EPUB Reader: Now uses the new MediaBag for images
2014-07-31 22:48:08 +01:00
Beni Cherniavsky-Paskin
4919b1b568
README: make all extension headers H4
2014-07-31 23:56:40 +03:00
Matthew Pickering
d6717c7aba
README: Updated with EPUB information
2014-07-31 21:39:50 +01:00
Matthew Pickering
0ae2c1f146
EPUB Reader: Added tests
2014-07-31 21:39:50 +01:00
Matthew Pickering
8460ea417f
EPUB Reader: Integrated into program
2014-07-31 21:39:50 +01:00
Matthew Pickering
51051e9953
HTML Reader: Added ability to read MathML formatted <math> blocks
2014-07-31 21:39:50 +01:00
Matthew Pickering
7c1f867397
HTML Reader: Added support for anchors on links and list items
2014-07-31 21:39:50 +01:00
Matthew Pickering
266e1977e0
HTML Reader: Extended HTML Reader to recognise EPUB specific elements
2014-07-31 21:39:49 +01:00
Matthew Pickering
002ae95d7a
Options: Added option to turn on epub html extensions
2014-07-31 21:39:49 +01:00
Matthew Pickering
b57e554b59
Except Compat: Updated to export more module functions
2014-07-31 21:39:49 +01:00
Matthew Pickering
089745af61
EPUB Reader: Added EPUB reader
2014-07-31 21:39:49 +01:00
Beni Cherniavsky-Paskin
dc799507e1
README: blank line after headers
2014-07-31 23:11:56 +03:00
John MacFarlane
0565a81676
pandoc.hs: Rewrote some of the logic for clarity.
2014-07-31 12:51:01 -07:00
Jesse Rosenthal
96ad37536e
Remove now unneeded JSON test file.
2014-07-31 15:47:45 -04:00
Jesse Rosenthal
ed71e9b31d
Docx tests: rewrite mediabag tests.
...
This will allow us to test the whole mediabag (making sure, for example,
that images are added with the correct keys) instead of just individual
extracted images. We compare each entry in the media bag to an image
extracted on the fly from the docx. As a result, we only need one file
to test with.
The image in the current tests was also replaced with a smaller one.
2014-07-31 15:47:45 -04:00
John MacFarlane
6dd2418476
New module, Text.Pandoc.MediaBag.
...
Moved `MediaBag` definition and functions from Shared:
`lookupMedia`, `mediaDirectory`, `insertMedia`, `extractMediaBag`.
Removed `emptyMediaBag`; use `mempty` instead, since `MediaBag`
is a Monoid.
2014-07-31 12:00:21 -07:00
John MacFarlane
00662faefb
Made MediaBag a newtype, and added mime type information to media.
...
Shared now exports functions for interacting with a MediaBag:
- `emptyMediaBag`
- `lookuMedia`
- `insertMedia`
- `mediaDirectory`
- `extractMediaBag`
2014-07-31 11:05:35 -07:00
John MacFarlane
6e96f8b478
Merge branch 'master' of github.com:jgm/pandoc
2014-07-31 09:43:08 -07:00
John MacFarlane
76031f01c1
Merge pull request #1472 from mpickering/master
...
Shared: Added function insertMedia which is an alias for M.insert
2014-07-31 09:42:53 -07:00
John MacFarlane
fcafcaa8e6
Added note in README about pipe tables not wrapping in LaTeX/PDF.
...
See
http://stackoverflow.com/questions/25037357/pandoc-long-tablerows-in-markdown-pdf-documents-do-not-get-linewrap/25050474#25050474
2014-07-31 07:48:14 -07:00
Matthew Pickering
31e7c1b67c
Shared: Added function insertMedia which is an alias for M.insert
2014-07-31 15:47:49 +01:00
John MacFarlane
143d5b7328
Merge pull request #1470 from jkr/metadata-test
...
Media test
2014-07-30 23:52:36 -07:00
Jesse Rosenthal
4d1d8a4b6f
Docx test: Test image from media bag.
2014-07-30 22:32:55 -04:00
Jesse Rosenthal
b24b328906
Docx tests: Add test image.
...
This is the cow image extracted from `docx.image.docx`.
2014-07-30 22:32:27 -04:00
Jesse Rosenthal
16f88edb3b
Docx tests: Added media test comparison function.
...
Also tell pandoc.cabal that we'll be needing base64, since we want to
compare strings here.
2014-07-30 22:31:38 -04:00
John MacFarlane
f1885ae799
Removed deprecated and no longer used readerStrict in ReaderOptions.
...
This is handled by readerExtensions now.
2014-07-30 18:32:06 -07:00
John MacFarlane
71e76175be
getT2TMeta: Take list of source files instead of single.
...
Get latest modification time.
2014-07-30 17:25:00 -07:00
John MacFarlane
3e26fb517d
Updated RTF writer tests.
2014-07-30 15:30:51 -07:00
John MacFarlane
e4913d6dba
Allow --self-contained to get content from MediaBag.
...
Added a parameter to makeSelfContained (API change).
2014-07-30 15:26:40 -07:00
John MacFarlane
23d806644f
RTF writer: Improved image embedding.
...
Use calculated sizes.
2014-07-30 14:49:57 -07:00
John MacFarlane
e365af9c23
RTF writer: refactored image embedding, using fetchItem'.
2014-07-30 14:27:51 -07:00
John MacFarlane
234652a4b8
PDF, Docx, EPUB, and ODT writers now automatically use MediaBag.
...
The MediaBag is thread through from the reader, with no need
to extract to files.
2014-07-30 14:07:31 -07:00
John MacFarlane
28321a18bf
Shared: Added fetchItem', which searches a media bag too.
2014-07-30 13:47:07 -07:00
John MacFarlane
67c5c7a575
Moved MediaBag back from Shared to Options, to avoid module cycle.
2014-07-30 13:46:48 -07:00
John MacFarlane
77aa72ec24
pandoc: Thread media bag into WriterOptions.
...
This will make it available to docx and epub readers, etc.,
so we don't have to extract media to a directory when going
from docx -> epub.
2014-07-30 13:20:44 -07:00