No description

Find a file

John MacFarlane 6e45607f99 Change reader types, allowing better tracking of source positions. Previously, when multiple file arguments were provided, pandoc simply concatenated them and passed the contents to the readers, which took a Text argument. As a result, the readers had no way of knowing which file was the source of any particular bit of text. This meant that we couldn't report accurate source positions on errors or include accurate source positions as attributes in the AST. More seriously, it meant that we couldn't resolve resource paths relative to the files containing them (see e.g. #5501, #6632, #6384, #3752). Add Text.Pandoc.Sources (exported module), with a `Sources` type and a `ToSources` class. A `Sources` wraps a list of `(SourcePos, Text)` pairs. [API change] A parsec `Stream` instance is provided for `Sources`. The module also exports versions of parsec's `satisfy` and other Char parsers that track source positions accurately from a `Sources` stream (or any instance of the new `UpdateSourcePos` class). Text.Pandoc.Parsing now exports these modified Char parsers instead of the ones parsec provides. Modified parsers to use a `Sources` as stream [API change]. The readers that previously took a `Text` argument have been modified to take any instance of `ToSources`. So, they may still be used with a `Text`, but they can also be used with a `Sources` object. In Text.Pandoc.Error, modified the constructor PandocParsecError to take a `Sources` rather than a `Text` as first argument, so parse error locations can be accurately reported. T.P.Error: showPos, do not print "-" as source name.		2021-05-09 19:11:34 -06:00
.circleci	MacOS release build: remove -split-sections.	2021-03-02 10:06:30 -08:00
.github	Fix nightly build.	2021-04-19 12:59:11 -07:00
app	Avoid unnecessary use of NoImplicitPrelude pragma (#7089 )	2021-02-07 10:02:35 -08:00
benchmark	Benchmark improvements.	2021-03-17 13:34:17 -07:00
citeproc/biblatex-localization	Add built-in citation support using new citeproc library.	2020-09-21 10:15:50 -07:00
data	Support toc-title in revealjs (#7171 )	2021-04-29 18:49:17 -07:00
doc	Add instructions for installing pandoc-types before compiling filter.	2021-04-30 08:35:52 -07:00
linux	Revert the -j related changes in make_artifacts.sh.	2021-03-18 14:59:50 -07:00
macos	Add built-in citation support using new citeproc library.	2020-09-21 10:15:50 -07:00
man	Update man page and manual date.	2021-03-20 00:40:04 -07:00
prelude	Use implicit Prelude (#6187 )	2020-03-15 09:45:44 -07:00
src/Text	Change reader types, allowing better tracking of source positions.	2021-05-09 19:11:34 -06:00
test	Change reader types, allowing better tracking of source positions.	2021-05-09 19:11:34 -06:00
tools	build-arm.sh : remove strict checking on first ssh access	2021-03-18 10:22:49 -07:00
trypandoc	trypandoc: remove unneeded imports.	2021-03-02 10:03:52 -08:00
windows	Fix/update URLs and use HTTPS where possible (#7122 )	2021-02-26 17:56:04 -08:00
.editorconfig
.gitattributes	Added .gitattributes.	2019-09-15 10:40:59 -07:00
.gitignore	Improve .gitignore.	2021-01-15 22:41:39 -08:00
.hlint.yaml	Change reader types, allowing better tracking of source positions.	2021-05-09 19:11:34 -06:00
.mailmap	Add .mailmap	2019-01-07 08:44:40 +03:00
.stylish-haskell.yaml	More spellcheck	2018-07-02 19:07:28 +03:00
AUTHORS.md	Update changelog, AUTHORS.	2021-03-20 18:40:18 -07:00
BUGS
cabal.project	Fixed another typo in cabal.project.	2021-04-25 23:11:53 -07:00
changelog.md	Update changelog, AUTHORS.	2021-03-20 18:40:18 -07:00
CONTRIBUTING.md	Benchmark improvements.	2021-03-17 13:34:17 -07:00
COPYING.md	Fix/update URLs and use HTTPS where possible (#7122 )	2021-02-26 17:56:04 -08:00
COPYRIGHT	Fix/update URLs and use HTTPS where possible (#7122 )	2021-02-26 17:56:04 -08:00
default.nix	Use simple default.nix.	2021-01-13 09:52:09 -08:00
INSTALL.md	INSTALL: add note that parallel installations should be avoided. (#7147 )	2021-04-29 18:54:36 -07:00
Makefile	Change -A256m to -A8m in GHCOPTS	2021-03-18 13:07:54 -07:00
MANUAL.txt	MANUAL: Add information about `lang` and bibliography sorting.	2021-04-18 09:09:53 -07:00
pandoc.cabal	Change reader types, allowing better tracking of source positions.	2021-05-09 19:11:34 -06:00
README.md	Update README.md.	2021-03-04 08:58:28 -08:00
README.template	Fix/update URLs and use HTTPS where possible (#7122 )	2021-02-26 17:56:04 -08:00
RELEASE-CHECKLIST	Update RELEASE-CHECKLIST	2021-03-08 08:17:18 -08:00
release.nix	Use project.nix instead of default.nix for generated file.	2021-01-12 22:32:26 -08:00
Setup.hs	Removed custom Setup.hs, use build-type: simple.	2019-01-02 17:02:02 -08:00
shell.nix	shell.nix - install zlib	2021-02-02 14:47:14 -08:00
stack.yaml	Use dev texmath, citeproc.	2021-04-25 21:56:54 -07:00

README.md

Pandoc

The universal markup converter

Pandoc is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library. It can convert from

bibtex (BibTeX bibliography)
biblatex (BibLaTeX bibliography)
commonmark (CommonMark Markdown)
commonmark_x (CommonMark Markdown with extensions)
creole (Creole 1.0)
csljson (CSL JSON bibliography)
csv (CSV table)
docbook (DocBook)
docx (Word docx)
dokuwiki (DokuWiki markup)
epub (EPUB)
fb2 (FictionBook2 e-book)
gfm (GitHub-Flavored Markdown), or the deprecated and less accurate markdown_github; use markdown_github only if you need extensions not supported in gfm.
haddock (Haddock markup)
html (HTML)
ipynb (Jupyter notebook)
jats (JATS XML)
jira (Jira/Confluence wiki markup)
json (JSON version of native AST)
latex (LaTeX)
markdown (Pandoc’s Markdown)
markdown_mmd (MultiMarkdown)
markdown_phpextra (PHP Markdown Extra)
markdown_strict (original unextended Markdown)
mediawiki (MediaWiki markup)
man (roff man)
muse (Muse)
native (native Haskell)
odt (ODT)
opml (OPML)
org (Emacs Org mode)
rst (reStructuredText)
t2t (txt2tags)
textile (Textile)
tikiwiki (TikiWiki markup)
twiki (TWiki markup)
vimwiki (Vimwiki)

It can convert to

asciidoc (AsciiDoc) or asciidoctor (AsciiDoctor)
beamer (LaTeX beamer slide show)
bibtex (BibTeX bibliography)
biblatex (BibLaTeX bibliography)
commonmark (CommonMark Markdown)
commonmark_x (CommonMark Markdown with extensions)
context (ConTeXt)
csljson (CSL JSON bibliography)
docbook or docbook4 (DocBook 4)
docbook5 (DocBook 5)
docx (Word docx)
dokuwiki (DokuWiki markup)
epub or epub3 (EPUB v3 book)
epub2 (EPUB v2)
fb2 (FictionBook2 e-book)
gfm (GitHub-Flavored Markdown), or the deprecated and less accurate markdown_github; use markdown_github only if you need extensions not supported in gfm.
haddock (Haddock markup)
html or html5 (HTML, i.e. HTML5/XHTML polyglot markup)
html4 (XHTML 1.0 Transitional)
icml (InDesign ICML)
ipynb (Jupyter notebook)
jats_archiving (JATS XML, Archiving and Interchange Tag Set)
jats_articleauthoring (JATS XML, Article Authoring Tag Set)
jats_publishing (JATS XML, Journal Publishing Tag Set)
jats (alias for jats_archiving)
jira (Jira/Confluence wiki markup)
json (JSON version of native AST)
latex (LaTeX)
man (roff man)
markdown (Pandoc’s Markdown)
markdown_mmd (MultiMarkdown)
markdown_phpextra (PHP Markdown Extra)
markdown_strict (original unextended Markdown)
mediawiki (MediaWiki markup)
ms (roff ms)
muse (Muse),
native (native Haskell),
odt (OpenOffice text document)
opml (OPML)
opendocument (OpenDocument)
org (Emacs Org mode)
pdf (PDF)
plain (plain text),
pptx (PowerPoint slide show)
rst (reStructuredText)
rtf (Rich Text Format)
texinfo (GNU Texinfo)
textile (Textile)
slideous (Slideous HTML and JavaScript slide show)
slidy (Slidy HTML and JavaScript slide show)
dzslides (DZSlides HTML5 + JavaScript slide show),
revealjs (reveal.js HTML5 + JavaScript slide show)
s5 (S5 HTML and JavaScript slide show)
tei (TEI Simple)
xwiki (XWiki markup)
zimwiki (ZimWiki markup)
the path of a custom Lua writer, see Custom writers below

Pandoc can also produce PDF output via LaTeX, Groff ms, or HTML.

Pandoc’s enhanced version of Markdown includes syntax for tables, definition lists, metadata blocks, footnotes, citations, math, and much more. See the User’s Manual below under Pandoc’s Markdown.

Pandoc has a modular design: it consists of a set of readers, which parse text in a given format and produce a native representation of the document (an abstract syntax tree or AST), and a set of writers, which convert this native representation into a target format. Thus, adding an input or output format requires only adding a reader or writer. Users can also run custom pandoc filters to modify the intermediate AST (see the documentation for filters and Lua filters).

Because pandoc’s intermediate representation of a document is less expressive than many of the formats it converts between, one should not expect perfect conversions between every format and every other. Pandoc attempts to preserve the structural elements of a document, but not formatting details such as margin size. And some document elements, such as complex tables, may not fit into pandoc’s simple document model. While conversions from pandoc’s Markdown to all formats aspire to be perfect, conversions from formats more expressive than pandoc’s Markdown can be expected to be lossy.

Installing

Here’s how to install pandoc.

Documentation

Pandoc’s website contains a full User’s Guide. It is also available here as pandoc-flavored Markdown. The website also contains some examples of the use of pandoc and a limited online demo.

Contributing

Pull requests, bug reports, and feature requests are welcome. Please make sure to read the contributor guidelines before opening a new issue.

README.md Unescape Escape