+ <code> tag is no longer needed. <pre> suffices.
+ all HTML tags in the code block (e.g. for syntax highlighting)
are skipped, because they are not portable to other output formats.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@1022 788f1e2b-df1e-0410-8736-df70ead52e1b
+ <code>...</code> not surrounded by <pre> should count as
inline HTML, not code block.
+ parser for minimized attributes should not swallow trailing spaces
git-svn-id: https://pandoc.googlecode.com/svn/trunk@1015 788f1e2b-df1e-0410-8736-df70ead52e1b
readers:
+ The Reference data structure from Text.Pandoc.Shared is no longer
needed, since
+ referenceKey and noteBlock parses return strings (as many blank lines
as are occuried by the key or note) and update state themselves.
+ getPosition and setPosition are now used to ensure that error messages
will give the correct line number.
+ This yields cleaner (and slightly faster) code, with more accurate
parsing error messages.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@1012 788f1e2b-df1e-0410-8736-df70ead52e1b
definition lists to see if the next line begins
appropriately; if not, don't waste any more time
parsing...
git-svn-id: https://pandoc.googlecode.com/svn/trunk@976 788f1e2b-df1e-0410-8736-df70ead52e1b
by an alphanumeric character. This is to help prevent interpretation
of e.g. [LC_TYPE]: my_type as '[LC<em>TYPE]:my</em>type'.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@974 788f1e2b-df1e-0410-8736-df70ead52e1b
roman numeral after "enum" in "setcounter" would consist entirely
of "i"s. enumiv is legitimate.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@961 788f1e2b-df1e-0410-8736-df70ead52e1b
Rearranged order of parsers in inline for slight speed improvement.
Added ` to special characters and 'unescapedChar'.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@960 788f1e2b-df1e-0410-8736-df70ead52e1b
Now it works as before.
+ Improved Markdown.pl-compatibility in referenceLink: the two
parts of a reference-style link may be separated by one space,
but not more... [a] [link], [not] [a link].
git-svn-id: https://pandoc.googlecode.com/svn/trunk@955 788f1e2b-df1e-0410-8736-df70ead52e1b
Markdown.pl: the marker for the end of the code section is
a clump of the same number of `'s with which the section began,
followed by a non-` character. So, for example,
` h ``` i ` -> <code>h ``` i</code>.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@954 788f1e2b-df1e-0410-8736-df70ead52e1b
'try' is not automatically applied to the 'end' parser.
Added 'try' in calls to 'enclosed' where needed. Slight speed
increase.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@926 788f1e2b-df1e-0410-8736-df70ead52e1b
+ Rearranged parsers in definition of 'inline' so that the most
frequently used would (by and large) be tried first.
+ Removed some unneeded 'try's.
+ Removed tabchar parser, as whitespace handles tabs anyway.
+ All in all, these changes, together with the last two commits,
cut almost in half the time it takes pandoc to parse a large test file.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@924 788f1e2b-df1e-0410-8736-df70ead52e1b
p. 27
at the beginning of a line as an ordered list start, since
it's most likely a page number.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@900 788f1e2b-df1e-0410-8736-df70ead52e1b
is a capital letter followed by a period (including a single-letter
capital roman numeral), then it must be followed by at least two spaces.
The point of this is to avoid accidentally treating people's initials as
list markers: a paragraph may begin:
B. Russell was an English philosopher.
and this shouldn't be treated as a list.
Modified Markdown reader and README documentation.
Added a test case.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@880 788f1e2b-df1e-0410-8736-df70ead52e1b
Markdown test suite under --strict:
+ Removed check for a following setext header in endline.
A full test is too inefficient (doubles benchmark time), and
the substitute we had before is not 100% accurate.
+ Don't use Code elements for autolinks if --strict specified.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@876 788f1e2b-df1e-0410-8736-df70ead52e1b
+ LaTeX reader: skip anything after \end{document}
+ HTML reader: fixed bug skipping material after </html> -- previously,
stuff at the end was skipped even if no </html> was present, which
meant only part of the file would be parsed and no error issued
+ HTML reader: added new constant eitherBlockOrInline with elements that
may count either as block-level or inline
+ Modified isInline and isBlock to take this into account
+ modified rawHtmlBlock to accept any tag (even an inline tag);
this is innocuous, because rawHtmlBlock is tried only if a regular
inline element can't be parsed.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@862 788f1e2b-df1e-0410-8736-df70ead52e1b
that anyOrderedListStart did not check for a space following the
ordered list marker. So, 'A.B. 2007' would be parsed as a list item,
then fail because of the lack of space after 'A.' (required by
orderedListStart). Resolves Issue #22.
+ Fixed a similar problem in RST reader.
+ Added regression test.
git-svn-id: https://pandoc.googlecode.com/svn/trunk@861 788f1e2b-df1e-0410-8736-df70ead52e1b