pandoc/server/pandoc-server.md

306 lines
7.5 KiB
Markdown
Raw Normal View History

---
title: pandoc-server
section: 1
date: August 15, 2022
---
# SYNOPSIS
`pandoc-server` [*options*]
# DESCRIPTION
`pandoc-server` is a web server that can perform pandoc
conversions. It can be used either as a running server
or as a CGI program. To use `pandoc-server` as a CGI
program, rename it (or symlink it) as `pandoc-server.cgi`.
(Note: if you symlink it, you may need to adjust your
webserver's configuration in order to allow it to follow
symlinks for the CGI script.)
All pandoc functions are run in the PandocPure monad, which
ensures that they can do no I/O operations on the server.
This should provide a high degree of security. It does,
however, impose certain limitations:
- PDFs cannot be produced.
- Filters are not supported.
- Resources cannot be fetched via HTTP.
- Any images, include files, or other resources needed for
the document conversion must be explicitly included in
the request, via the `files` field (see below under API).
# OPTIONS
`--port NUM`
: HTTP port on which to run the server. Default: 3030.
`--timeout SECONDS`
: Timeout in seconds, after which a conversion is killed. Default: 2.
`--help`
2022-08-15 09:57:08 -07:00
: Print this help.
`--version`
: Print version.
# API
## Root endpoint
The root (`/`) endpoint accepts only POST requests.
It returns a converted document in one of the following
formats, depending on Accept headers:
- `text/plain`
- `application/json`
- `application/octet-stream`
If the result is a binary format (e.g., `epub` or `docx`)
and the content is returned as plain text or JSON, the
binary will be base64 encoded.
The body of the POST request should be a JSON object,
with the following fields. Only the `text` field is
required; all of the others can be omitted for default
values. When there are several string alternatives,
the first one given is the default.
`text` (string)
: the document to be converted. Note:
if the `from` format is binary (e.g., `epub` or `docx`), then
`text` should be a base64 encoding of the document.
`from` (string, default `"markdown"`)
: the input format, possibly with extensions, just as it is
specified on the pandoc command line.
`to` (string, default `"html"`)
: the output format, possibly with extensions, just as it is
specified on the pandoc command line.
`wrapText` (`"auto"|"preserve"|"none"`)
: text wrapping option: either `"auto"` (automatic
hard-wrapping to fit within a column width), `"preserve"`
(insert newlines where they are present in the source),
or `"none"` (don't insert any unnecessary newlines at all).
`columns` (integer, default 72)
: column width (affects text wrapping and calculation of
table column widths in plain text formats)
`standalone` (boolean, default false)
: if true, causes a standalone document to be produced, using
the default template or the custom template specified using
`template`. If false, a fragment will be produced.
`template` (string)
: string contents of a document template (see Templates in
`pandoc(1)` for the format).
`tabStop` (integer, default 4)
: tab stop (spaces per tab).
`indentedCodeClasses` (array of strings)
: list of classes to be applied to indented Markdown code blocks.
`abbreviations` (array of strings)
: list of strings to be regarded as abbreviations when
parsing Markdown. See `--abbreviations` in `pandoc(1)` for
details.
`defaultImageExtension` (string)
: extension to be applied to image sources that lack extensions
(e.g. `".jpg"`).
`trackChanges` (`"accept"|"reject"|"all"`)
: specifies what to do with insertions, deletions, and
comments produced by the MS Word "Track Changes" feature. Only
affects docx input.
`stripComments` (boolean, default false)
: causes HTML comments to be stripped in Markdown or Textile
source, instead of being passed through to the output format.
`citeproc` (boolean, default false)
: causes citations to be processed using citeproc. See
Citations in `pandoc(1)` for details.
`citeMethod` (`"citeproc"|"natbib"|"biblatex"`)
: determines how citations are formatted in LaTeX output.
`tableOfContents` (boolean, default false)
: include a table of contents (in supported formats).
`tocDepth` (integer, default 3)
: depth of sections to include in the table of contents.
`numberSections` (boolean, default false)
: automatically number sections (in supported formats).
`numberOffset` (array of integers)
: offsets to be added to each component of the section number.
For example, `[1]` will cause the first section to be
numbered "2" and the first subsection "2.1"; `[0,1]` will
cause the first section to be numbered "1" and the first
subsection "1.2."
`identifierPrefix` (string)
: prefix to be added to all automatically-generated identifiers.
`sectionDivs` (boolean, default false)
: arrange the document into a hierarchy of nested sections
based on the headings.
`htmlQTags` (boolean, default false)
: use `<q>` elements in HTML instead of literal quotation marks.
`listings` (boolean, default false)
: use the `listings` package to format code in LaTeX output.
`referenceLinks` (boolean, default false)
: create reference links rather than inline links in Markdown output.
`setextHeaders` (boolean, default false)
: use Setext (underlined) headings instead of ATX (`#`-prefixed)
in Markdown output.
`preferAscii` (boolean, default false)
: use entities and escapes when possible to avoid non-ASCII
characters in the output.
`referenceLocation` (`"document"|"block"|"section"`)
:
`topLevelDivision` (`"default"|"part"|"chapter"|"section"`)
:
`emailObfuscation` (`"none"|"references"|"javascript"`)
:
`htmlMathMethod` (`"plain"|"webtex"|"gladtex"|"mathml"|"mathjax"|"katex"`)
:
`variables` (JSON mapping)
:
`dpi` (integer, default 96)
:
`incremental` (boolean, default false)
:
`slideLevel` (integer)
:
`highlightStyle` (string, default `"pygments"`)
: pygments (the default), kate, monochrome, breezeDark,
espresso, zenburn, haddock, and tango or .theme file
`epubMetadata` (string)
:
`epubChapterLevel` (integer, default 1)
:
`epubSubdirectory` (string, default "EPUB")
:
`epubFonts` (array of file paths)
:
`referenceDoc` (file path)
:
`files` (JSON mapping of file paths to base64-encoded strings)
:
## `/batch` endpoint
The `/batch` endpoint behaves like the root endpoint,
except for these two points:
- It accepts a JSON array, each element of which is a JSON
object like the one expected by the root endpoint.
- It returns a JSON array of results. (It will not return
plain text or octet-stream, like the root endpoint.)
This endpoint can be used to convert a sequence of small
snippets in one request.
## `/version` endpoint
The `/version` endpoint accepts a GET request and returns
the pandoc version as a plain or JSON-encoded string,
depending on Accept headers.
## `/babelmark` endpoint
The `/babelmark` endpoint accepts a GET request with
the following query parameters:
- `text` (required string)
- `from` (optional string, default is `"markdown"`)
- `to` (optional string, default is `"html"`)
- `standalone` (optional boolean, default is `false`)
It returns a JSON object with fields `html` and `version`.
This endpoint is designed to support the
[Babelmark]()https://babelmark.github.io website.
# AUTHORS
Copyright 2022 John MacFarlane (jgm@berkeley.edu). Released
under the [GPL], version 2 or greater. This software carries no
warranty of any kind. (See COPYRIGHT for full copyright and
warranty notices.)
[GPL]: https://www.gnu.org/copyleft/gpl.html "GNU General Public License"