Commit graph

1683 commits

Author SHA1 Message Date
John MacFarlane
998fd098d0 Moved parsing functions from Text.Pandoc.Shared to new module.
+ Text.Pandoc.Parsing
2010-07-05 00:06:27 -07:00
John MacFarlane
f1114733a6 Added an EPUB writer.
+ New writer module Text.Pandoc.Writers.EPUB
+ Stylesheet in epub.css
+ --epub-stylesheet command-line option.
+ New utility module Text.Pandoc.UUID to generate
  random UUIDs for EPUBs.
2010-07-05 00:06:27 -07:00
John MacFarlane
ff4d94e054 Made a proper ODT writer.
+ Transformed the old Text.Pandoc.ODT module into a proper
  writer module, Text.Pandoc.Writers.ODT.
+ Instead of saveOpenDocumentAsODT, we now have writeODT, which
  takes a Pandoc document and produces a bytestring.
  saveOpenDocumentAsODT has been removed.
+ To extract the images and insert them into the ODT, we now use
  processPandocM on the Pandoc document rather than a custom XML parser.
+ Handle the case where the image is remote (or not found) by
  converting the Image element into an Emph with the label.
+ Plumbing in pandoc.hs changed slightly to accomodate this, and to
  allow other writers that live in the IO monad.
2010-07-05 00:06:26 -07:00
John MacFarlane
ee51124412 Use unicode for smart punctuation in plain, markdown, rst formats. 2010-06-30 18:53:35 -07:00
John MacFarlane
163b0c0f8a Let markdown2pdf recognize citeproc options. 2010-06-30 18:34:14 -07:00
John MacFarlane
d0be7ca8cb Add birdtracks to lhs in HTML when compiled wo -fhighlighting.
Resolves Issue #242.  Previously the bird tracks would be
stripped off when pandoc was not compiled with highlighting support,
even if -t html+lhs was specified.

Thanks to Nicholas Wu for pointing out the problem.
2010-06-28 19:49:07 -07:00
John MacFarlane
b5bda7569e Made KeyTable a map instead of an association list.
* This affects the RST and Markdown readers.
* The type for stateKeys in ParserState has also changed.
* Pandoc, Meta, Inline, and Block have been given Ord instances.
* Reference keys now have a type of their own (Key), with its
  own Ord instance for case-insensitive comparison.
2010-05-08 10:29:40 -07:00
John MacFarlane
91f52e2229 Use explicit imports from Data.Generics.
Otherwise we have a conflict with the 'empty' symbol,
introduced in syb >= 0.2.  Thanks to ddssff for pointing
out the problem. Resolves Issue #237.
2010-05-07 11:28:38 -07:00
John MacFarlane
7d20ffe3e0 UTF8: Modified readFile and getContents to strip BOM if present. 2010-05-06 23:04:44 -07:00
John MacFarlane
fb201a5b46 Updated man page and README.
Pandoc no longer respects locale, even when compiled by GHC 6.12.
2010-05-06 22:32:06 -07:00
John MacFarlane
d6dd5fff7b No longer need to set LANG in RunTests. 2010-05-06 22:30:32 -07:00
John MacFarlane
1cc11e6086 Use new UTF8 module in Shared, ODT, and the executables. 2010-05-06 22:28:28 -07:00
John MacFarlane
7be82b4536 Added Text.Pandoc.UTF8 for portable UTF8 string IO. 2010-05-06 20:27:10 -07:00
John MacFarlane
f20c4ae928 Convert command-line arguments to UTF8.
Resolves Issue #234.
2010-05-04 23:36:03 -07:00
John MacFarlane
d253955a7e Changed rawLaTeXInline to accept '\section', '\begin', etc.
Use new rawLaTeXInline' in LaTeX reader, and export rawLaTeXInline
for use in markdown reader.

Fixes bug wherein '\section{foo}' was not recognized as raw TeX
in markdown document.
2010-04-26 23:17:34 -07:00
John MacFarlane
4aaa1991fd Man writer: convert math to unicode when possible.
(As in other writers.)
2010-04-26 23:04:29 -07:00
John MacFarlane
c243e5b67b Use texmath's parser in TexMath module.
* This replaces a lot of custom parser code, and expands
  the tex -> unicode conversion.
* The behavior has also changed: if the whole formula can't
  be converted, the whole formula is left in raw TeX.
  Previously, pandoc converted parts of the formula to unicode
  and left other parts in raw TeX.
* Added (but not yet exported) readTeXMath', which returns a Maybe.
* Updated tests
2010-04-25 20:30:27 -07:00
John MacFarlane
b3669e139c RunTests: use UTF8 no matter what the locale. 2010-04-25 20:19:39 -07:00
John MacFarlane
2afa967ade Removed parsec < 3 restriction. 2010-04-23 22:43:47 -07:00
John MacFarlane
0d5065c1e7 Added 'threaded' flag.
+ GHC lacks a threaded runtime on some architectures.
  Provide a 'threaded' flag that can be disabled for compiling
  on these architectures.
+ Removed '-threaded' from pandoc's ghc-options.  '-threaded'
  is only needed for markdown2pdf, due to its use of readProcess.
2010-04-20 22:59:38 -07:00
John MacFarlane
5d9d7f32ca In parsing smart quotes, leave unicode curly quotes alone.
Resolves Issue #143.
2010-04-10 12:05:26 -07:00
John MacFarlane
54316a0159 Removed obsolete markdown2pdf script. 2010-04-10 11:20:52 -07:00
John MacFarlane
7d8c8c7880 Removed 'library' Cabal flag.
Reason: starting with Cabal 1.8, installing pandoc with '-library
+executable' did not work, since the build-depends in the Library
stanza were ignored. The problem could be solved by repeating the
build-depends in the Executable stanza, but this seems non-ideal
(and might lead to errors later on).  The '-library' option isn't
so useful anyway, since to compile pandoc in the first place, you
need a large number of Haskell libraries installed, and in this
case, why balk at another? It was chiefly intended for packagers,
but packagers will need to use a chroot environment anyway, and
they can then simply copy the executable and not the library.

Thanks to Jim Pryor for calling the problem to my attention
in connection with an arch linux package.
2010-04-10 11:16:48 -07:00
Max Bolingbroke
e365c5950b Give LaTeX images a maximum width using the prelude only 2010-04-10 12:01:35 +08:00
John MacFarlane
15c3297ba9 Updated HCAR entry. 2010-04-09 15:40:19 -07:00
John MacFarlane
5a60bfd645 Removed markdown2pdf from list of data-files. 2010-04-06 23:00:42 -07:00
John MacFarlane
be1e542e3b Updated version in windows setup file. 2010-03-29 07:31:42 -07:00
John MacFarlane
41bc23709c Bumped version to 1.5.1.1, updated changelog. 2010-03-29 07:15:28 -07:00
John MacFarlane
c5c4e19c99 Shared: Fixed uniqueIdent so it behaves as described in README.
Previously some characters that are illegal in HTML identifiers,
such as '<', were being allowed in header identifiers.  The logic
has now been fixed. Thanks to Xyne for reporting.
2010-03-28 22:29:31 -07:00
John MacFarlane
be832b3676 Updated RELEASE-CHECKLIST. 2010-03-23 17:22:24 -07:00
John MacFarlane
3ed1018c94 Updated changelog. 2010-03-23 16:04:04 -07:00
John MacFarlane
14e7cb362a Markdown, RST writers: unescape URI in links, to make them readable.
(Since they aren't required to be escaped in these formats.)
2010-03-23 15:35:19 -07:00
John MacFarlane
8504055360 Shared: Export unescapeURI. 2010-03-23 15:34:53 -07:00
John MacFarlane
c87d52223a Properly escape URIs in all readers. 2010-03-23 15:07:48 -07:00
John MacFarlane
c3384bb3f1 Added multilingual URL tests for markdown reader. 2010-03-23 15:07:17 -07:00
John MacFarlane
85ca50d623 Shared: Rewrote uri and emailAddress to return original text + escaped URI. 2010-03-23 15:06:18 -07:00
John MacFarlane
921d0f3081 Shared: enamed stringToURI -> escapeURI. 2010-03-23 15:05:33 -07:00
John MacFarlane
09ca37bd62 Don't escape URIs in HTML writer.
This should be done in the readers instead.
2010-03-23 14:02:37 -07:00
John MacFarlane
9e65867300 Better definition of stringToURI.
Now it escapes all characters that aren't allowed in a URI.
%, ?, /, and other characters that are allowed in a URI are
left alone.  Unicode high characters are UTF-8 encoded.
2010-03-23 13:51:52 -07:00
John MacFarlane
4fbacd5d5b Removed refs to hsmarkdown, html2markdown from pandoc man page. 2010-03-23 13:38:41 -07:00
John MacFarlane
10459627e8 Updated COPYRIGHT file. 2010-03-23 13:36:36 -07:00
John MacFarlane
1aeb7d23ad Updated copyright notices. 2010-03-23 13:31:09 -07:00
John MacFarlane
c1fe6c820a Updated changelog. 2010-03-23 00:51:16 -07:00
John MacFarlane
666a51b7fc Version bump to 1.5.1. 2010-03-23 00:46:54 -07:00
John MacFarlane
71eac37ac5 Fixed treatment of unicode characters in URIs.
* Added stringToURI to Shared.  This is used in the HTML
  writer for all URIs.  It properly URI-encodes high
  characters (> 127), leaving everything else (including
  symbols and spaces) the same.

* Modified unsanitaryURI to allow UTF8 characters in a URI.
  (First, we convert the URI to URI-encoded octets, then we
  pass through parseURIReference.)
  This resolves gitit Issue #99. Previously
  '[abc](http://gitit.net/测试)' would not be rendered as
  a link when --sanitize was selected.
2010-03-23 00:33:50 -07:00
John MacFarlane
7689cacb5d Setup.hs: Don't assume that the build directory is "dist."
Instead, get it from localBuildInfo.
2010-03-22 18:43:38 -07:00
John MacFarlane
1b1ba25432 OpenDocument writer: Use a Map for stTextStyleAttr.
This avoids duplicates (and invalid xml).
2010-03-22 09:46:55 -07:00
John MacFarlane
f7d896b07f Bump version in windows setup file. 2010-03-21 23:24:26 -07:00
John MacFarlane
17b4478d4e Whitespace fix in changelog. 2010-03-21 23:13:23 -07:00
John MacFarlane
0e6bb176f9 Updated changelog and bumped version to 1.5.0.1. 2010-03-21 22:51:04 -07:00