54526525bf
Previously we had tested certain properties of the output PowerPoint slides. Corruption, though, comes as the result of a numebr of interrelated issues in the output pptx archive. This is a new approach, which compares the output of the Powerpoint writer with files that we know to (a) not be corrupt, and (b) to show the desired output behavior (details below). This commit introduces three tests using the new framework. More will follow. The test procedure: given a native file and a pptx file, we generate a pptx archive from the native file, and then test: 1. Whether the same files are in the two archives 2. Whether each of the contained xml files is the same. (We skip time entries in `docProps/core.xml`, since these are derived from IO. We just check to make sure that they're there in the same way in both files.) 3. Whether each of the media files is the same. Note that steps 2 and 3, though they compare multiple files, are one test each, since the number of files depends on the input file (if there is a failure, it will only report the first failed file comparison in the test failure). |
||
---|---|---|
.github | ||
benchmark | ||
data | ||
doc | ||
lib/fonts | ||
linux | ||
macos | ||
man | ||
src/Text | ||
test | ||
tools | ||
trypandoc | ||
windows | ||
.editorconfig | ||
.gitignore | ||
.hlint.yaml | ||
.stylish-haskell.yaml | ||
.travis.yml | ||
appveyor.yml | ||
AUTHORS.md | ||
BUGS | ||
changelog | ||
CONTRIBUTING.md | ||
COPYING.md | ||
COPYRIGHT | ||
default.nix | ||
INSTALL.md | ||
Makefile | ||
MANUAL.txt | ||
pandoc.cabal | ||
pandoc.hs | ||
README.md | ||
README.template | ||
RELEASE-CHECKLIST | ||
RELEASE-CHECKLIST.md | ||
Setup.hs | ||
stack.lts9.yaml | ||
stack.yaml |
Pandoc
The universal markup converter
Pandoc is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library.
Pandoc can read Markdown, CommonMark, PHP Markdown Extra, GitHub-Flavored Markdown, MultiMarkdown, and (subsets of) Textile, reStructuredText, HTML, LaTeX, MediaWiki markup, TWiki markup, TikiWiki markup, Creole 1.0, Haddock markup, OPML, Emacs Org mode, DocBook, JATS, Muse, txt2tags, Vimwiki, EPUB, ODT, and Word docx.
Pandoc can write plain text,
Markdown,
CommonMark, PHP Markdown
Extra,
GitHub-Flavored
Markdown,
MultiMarkdown,
reStructuredText,
XHTML,
HTML5, LaTeX
(including beamer
slide shows),
ConTeXt,
RTF,
OPML, DocBook,
JATS,
OpenDocument,
ODT, Word
docx, GNU
Texinfo, MediaWiki
markup, DokuWiki
markup, ZimWiki
markup, Haddock
markup,
EPUB (v2 or v3),
FictionBook2,
Textile, groff
man, groff
ms, Emacs Org
mode,
AsciiDoc, InDesign
ICML,
TEI Simple,
Muse,
PowerPoint slide
shows and Slidy,
Slideous,
DZSlides,
reveal.js or
S5 HTML slide shows. It can also
produce PDF output on systems where LaTeX,
ConTeXt, pdfroff
, wkhtmltopdf
, prince
, or weasyprint
is
installed.
Pandoc’s enhanced version of Markdown includes syntax for tables,
definition lists, metadata blocks, Div
blocks, footnotes and
citations, embedded LaTeX (including math), Markdown inside HTML block
elements, and much more. These enhancements, described further under
Pandoc’s Markdown, can be disabled using the markdown_strict
format.
Pandoc has a modular design: it consists of a set of readers, which parse text in a given format and produce a native representation of the document (like an abstract syntax tree or AST), and a set of writers, which convert this native representation into a target format. Thus, adding an input or output format requires only adding a reader or writer. Users can also run custom pandoc filters to modify the intermediate AST.
Because pandoc’s intermediate representation of a document is less expressive than many of the formats it converts between, one should not expect perfect conversions between every format and every other. Pandoc attempts to preserve the structural elements of a document, but not formatting details such as margin size. And some document elements, such as complex tables, may not fit into pandoc’s simple document model. While conversions from pandoc’s Markdown to all formats aspire to be perfect, conversions from formats more expressive than pandoc’s Markdown can be expected to be lossy.
Installing
Here’s how to install pandoc.
Documentation
Pandoc’s website contains a full User’s Guide. It is also available here as pandoc-flavored Markdown. The website also contains some examples of the use of pandoc and a limited online demo.
Contributing
Pull requests, bug reports, and feature requests are welcome. Please make sure to read the contributor guidelines before opening a new issue.
License
© 2006-2018 John MacFarlane (jgm@berkeley.edu). Released under the GPL, version 2 or greater. This software carries no warranty of any kind. (See COPYRIGHT for full copyright and warranty notices.)