No description
Find a file
John MacFarlane 0feb7504b1 Rewrote LaTeX reader with proper tokenization.
This rewrite is primarily motivated by the need to
get macros working properly.  A side benefit is that the
reader is significantly faster (27s -> 19s in one
benchmark, and there is a lot of room for further
optimization).

We now tokenize the input text, then parse the token stream.

Macros modify the token stream, so they should now be effective
in any context, including math. Thus, we no longer need the clunky
macro processing capacities of texmath.

A custom state LaTeXState is used instead of ParserState.
This, plus the tokenization, will require some rewriting
of the exported functions rawLaTeXInline, inlineCommand,
rawLaTeXBlock.

* Added Text.Pandoc.Readers.LaTeX.Types (new exported module).
  Exports Macro, Tok, TokType, Line, Column.  [API change]
* Text.Pandoc.Parsing: adjusted type of `insertIncludedFile`
  so it can be used with token parser.
* Removed old texmath macro stuff from Parsing.
  Use Macro from Text.Pandoc.Readers.LaTeX.Types instead.
* Removed texmath macro material from Markdown reader.
* Changed types for Text.Pandoc.Readers.LaTeX's
  rawLaTeXInline and rawLaTeXBlock.  (Both now return a String,
  and they are polymorphic in state.)
* Added orgMacros field to OrgState.  [API change]
* Removed readerApplyMacros from ReaderOptions.
  Now we just check the `latex_macros` reader extension.
* Allow `\newcommand\foo{blah}` without braces.

Fixes #1390.
Fixes #2118.
Fixes #3236.
Fixes #3779.
Fixes #934.
Fixes #982.
2017-07-07 12:36:00 +02:00
.github ISSUE_TEMPLATE: add URL for pandoc-discuss. 2017-03-13 14:38:07 +01:00
benchmark Fixed name shadowing in benchmark. 2017-06-19 23:42:27 +02:00
data data/pandoc.lua: regularize constructors. 2017-06-29 17:08:59 +02:00
doc Added link-table example to doc/lua-filters.md. 2017-06-28 15:31:42 +02:00
lib/fonts lib: Added symbol.txt and file to generate codepoint to unicode mapping 2014-08-09 22:37:12 -04:00
linux Added mention of vimwiki raeder more places. 2017-06-20 16:57:39 +02:00
macos Fixed MacOS packaging script. 2017-06-04 21:02:11 +02:00
man Updated man page. 2017-06-04 20:40:09 +02:00
prelude Remove unnecessary CPP in custom Prelude. 2016-09-03 15:23:32 -04:00
src/Text Rewrote LaTeX reader with proper tokenization. 2017-07-07 12:36:00 +02:00
test Rewrote LaTeX reader with proper tokenization. 2017-07-07 12:36:00 +02:00
tools Makefile: Separate refactor and reformat targets. 2017-03-04 13:13:12 +01:00
trypandoc minor updates to vimwiki reader. (#3759) 2017-06-26 08:41:51 +02:00
windows Windows packaging fixes to use new stack.pkg.yaml. 2017-02-12 22:04:53 +01:00
.editorconfig Fix editorconfig for test files 2014-04-12 12:22:09 +02:00
.gitignore Added deb/.vagrant to gitignore 2017-02-01 12:36:56 +01:00
.stylish-haskell.yaml Added 'make refactor' using hlint, stylish-haskell. 2017-03-04 12:49:14 +01:00
.travis.yml .travis.yml - removed hsb2hs stuff. 2017-06-04 15:59:05 +02:00
appveyor.yml Revert "appveyor.yml: don't use matrix." 2017-05-24 22:54:11 +02:00
BUGS BUGS: Added reference to CONTRIBUTING.md. 2013-04-14 22:14:44 -07:00
changelog Updated changelog. 2017-01-29 21:09:21 +01:00
CONTRIBUTING.md Fixed typos in CONTRIBUTING.md (#3479) 2017-03-01 15:00:53 +01:00
COPYING.md Download markdown version of the license from GNU and rename to COPYING.md 2016-10-19 04:11:36 -07:00
COPYRIGHT COPYRIGHT: list new files not written by John 2017-05-13 23:50:39 +02:00
INSTALL.md INSTALL: Improved instructions for tests with patterns. 2017-06-23 13:09:47 +02:00
Makefile Makefile: split 'make haddock' from 'make full'. 2017-06-25 10:06:19 +02:00
MANUAL.txt MANUAL: document ibooks specific epub metadata. 2017-06-30 23:56:01 +02:00
pandoc.cabal Rewrote LaTeX reader with proper tokenization. 2017-07-07 12:36:00 +02:00
pandoc.hs hlint suggestions. 2017-06-02 15:25:39 +02:00
README.md Added mention of vimwiki raeder more places. 2017-06-20 16:57:39 +02:00
RELEASE-CHECKLIST Update RELEASE_CHECKLIST. 2017-02-27 10:19:22 +01:00
RELEASE-CHECKLIST.md Updated RELEASE-CHECKLIST and markdownified. 2017-01-25 17:07:41 +01:00
Setup.hs Removed unused imports from Setup.hs. 2017-04-03 09:50:44 +02:00
stack.full.yaml Remove https flag. 2017-05-07 12:49:25 +02:00
stack.pkg.yaml Updated stack.pkg.yaml. 2017-06-30 23:26:17 +02:00
stack.yaml Use latest texmath. 2017-06-30 20:39:53 +02:00

Pandoc

github release hackage release homebrew stackage LTS package travis build status appveyor build status license pandoc-discuss on google groups

The universal markup converter

Pandoc is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library. It can read Markdown, CommonMark, PHP Markdown Extra, GitHub-Flavored Markdown, MultiMarkdown, and (subsets of) Textile, reStructuredText, HTML, LaTeX, MediaWiki markup, TWiki markup, Haddock markup, OPML, Emacs Org mode, DocBook, Muse, txt2tags, Vimwiki, EPUB, ODT, and Word docx; and it can write plain text, Markdown, CommonMark, PHP Markdown Extra, GitHub-Flavored Markdown, MultiMarkdown, reStructuredText, XHTML, HTML5, LaTeX (including beamer slide shows), ConTeXt, RTF, OPML, DocBook, OpenDocument, ODT, Word docx, GNU Texinfo, MediaWiki markup, DokuWiki markup, ZimWiki markup, Haddock markup, EPUB v2 or v3, FictionBook2, Textile, groff man, [groff ms], Emacs Org mode, AsciiDoc, InDesign ICML, TEI Simple, Muse and Slidy, Slideous, DZSlides, reveal.js or S5 HTML slide shows. It can also produce PDF output on systems where LaTeX, ConTeXt, pdfroff, or wkhtmltopdf is installed.

Pandoc's enhanced version of Markdown includes syntax for footnotes, tables, flexible ordered lists, definition lists, fenced code blocks, superscripts and subscripts, strikeout, metadata blocks, automatic tables of contents, embedded LaTeX math, citations, and [Markdown inside HTML block elements][Extension: markdown_in_html_blocks]. (These enhancements, described further under Pandoc's Markdown, can be disabled using the markdown_strict input or output format.)

In contrast to most existing tools for converting Markdown to HTML, which use regex substitutions, pandoc has a modular design: it consists of a set of readers, which parse text in a given format and produce a native representation of the document, and a set of writers, which convert this native representation into a target format. Thus, adding an input or output format requires only adding a reader or writer.

Because pandoc's intermediate representation of a document is less expressive than many of the formats it converts between, one should not expect perfect conversions between every format and every other. Pandoc attempts to preserve the structural elements of a document, but not formatting details such as margin size. And some document elements, such as complex tables, may not fit into pandoc's simple document model. While conversions from pandoc's Markdown to all formats aspire to be perfect, conversions from formats more expressive than pandoc's Markdown can be expected to be lossy.

Installing

Here's how to install pandoc.

Documentation

Pandoc's website contains a full User's Guide. It is also available here as pandoc-flavored Markdown. The website also contains some examples of the use of pandoc and a limited online demo.

Contributing

Pull requests, bug reports, and feature requests are welcome. Please make sure to read the contributor guidelines before opening a new issue.

License

© 2006-2017 John MacFarlane (jgm@berkeley.edu). Released under the GPL, version 2 or greater. This software carries no warranty of any kind. (See COPYRIGHT for full copyright and warranty notices.)