pandoc

Author	SHA1	Message	Date
fiddlosopher	65047e354a	Remove just one leading and one trailing newline from contents of <pre>...</pre> in codeBlock parser. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1023 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-17 03:08:14 +00:00
fiddlosopher	6f16d52c11	Changed parsing of code blocks in HTML reader: + <code> tag is no longer needed. <pre> suffices. + all HTML tags in the code block (e.g. for syntax highlighting) are skipped, because they are not portable to other output formats. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1022 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-17 02:49:28 +00:00
fiddlosopher	d5b7257d7f	Simplified HTML attribute parsing (HTML reader). git-svn-id: https://pandoc.googlecode.com/svn/trunk@1016 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-15 00:44:32 +00:00
fiddlosopher	28c2ee396c	Fixed two bugs in HTML reader: + <code>...</code> not surrounded by <pre> should count as inline HTML, not code block. + parser for minimized attributes should not swallow trailing spaces git-svn-id: https://pandoc.googlecode.com/svn/trunk@1015 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-14 22:40:28 +00:00
fiddlosopher	4b4060b8ef	Simplified parsing of reference keys and notes in markdown and RST readers: + The Reference data structure from Text.Pandoc.Shared is no longer needed, since + referenceKey and noteBlock parses return strings (as many blank lines as are occuried by the key or note) and update state themselves. + getPosition and setPosition are now used to ensure that error messages will give the correct line number. + This yields cleaner (and slightly faster) code, with more accurate parsing error messages. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1012 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-05 23:55:38 +00:00
fiddlosopher	0982a67585	LaTeX command and environment names can't contain numbers. LaTeX reader updated accordingly. git-svn-id: https://pandoc.googlecode.com/svn/trunk@987 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-02 00:36:44 +00:00
fiddlosopher	25bbe134cb	Skip notes parsing if running in strict mode. (This yields a nice speed improvement in strict mode.) git-svn-id: https://pandoc.googlecode.com/svn/trunk@983 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 20:26:40 +00:00
fiddlosopher	76d462c1cd	Simplify autolink parsing code, using Network.URI to test for URIs. Added dependency on network library to debian/control and pandoc.cabal. git-svn-id: https://pandoc.googlecode.com/svn/trunk@982 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 17:22:22 +00:00
fiddlosopher	f8f9fa49d6	More perspicuous definition of nonindentSpaces. git-svn-id: https://pandoc.googlecode.com/svn/trunk@981 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 16:08:47 +00:00
fiddlosopher	5c1632be5d	Removed unneeded 'try' in 'rawLine'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@979 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:26:39 +00:00
fiddlosopher	85d49ee936	Combined linebreak and whitespace into a new whitespace parser, to avoid unnecessary reparsing of space characters. git-svn-id: https://pandoc.googlecode.com/svn/trunk@978 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:18:25 +00:00
fiddlosopher	8d3bec3e4d	Removed unnecessary 'try' in 'codeBlock'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@977 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:10:30 +00:00
fiddlosopher	bdf78fe33f	Use lookAhead in parsers for setext headers and definition lists to see if the next line begins appropriately; if not, don't waste any more time parsing... git-svn-id: https://pandoc.googlecode.com/svn/trunk@976 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:01:12 +00:00
fiddlosopher	f55d62c04a	Don't require blank lines after code block. (It's sufficient to end code block with a nonindented line.) git-svn-id: https://pandoc.googlecode.com/svn/trunk@975 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-31 22:21:41 +00:00
fiddlosopher	c206332558	Changed definition of 'emph': italics with '_' must not be followed by an alphanumeric character. This is to help prevent interpretation of e.g. [LC_TYPE]: my_type as '[LC<em>TYPE]:my</em>type'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@974 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-31 22:15:35 +00:00
fiddlosopher	3b790b80f3	Fixed bug in LaTeX reader, which wrongly assumed that the roman numeral after "enum" in "setcounter" would consist entirely of "i"s. enumiv is legitimate. git-svn-id: https://pandoc.googlecode.com/svn/trunk@961 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-30 19:12:50 +00:00
fiddlosopher	14dc520669	Cleaned up LaTeX reader. Rearranged order of parsers in inline for slight speed improvement. Added ` to special characters and 'unescapedChar'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@960 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 21:03:26 +00:00
fiddlosopher	451b426fd6	Removed unneeded try's in RST reader; also minor code cleanup. git-svn-id: https://pandoc.googlecode.com/svn/trunk@959 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 20:39:31 +00:00
fiddlosopher	2a37d8d30a	Efficiency improvements to RST reader (more than doubled speed): + removed tabchar + rearranged parsers in inline git-svn-id: https://pandoc.googlecode.com/svn/trunk@958 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 20:22:24 +00:00
fiddlosopher	015644b60e	Purely stylistic change. git-svn-id: https://pandoc.googlecode.com/svn/trunk@957 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 20:03:28 +00:00
fiddlosopher	7f8ec9577e	Removed unneeded 'try' in 'ellipses'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@956 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 19:59:08 +00:00
fiddlosopher	60f700997c	+ Fixed bug introduced into referenceTitle by previous changes. Now it works as before. + Improved Markdown.pl-compatibility in referenceLink: the two parts of a reference-style link may be separated by one space, but not more... [a] [link], [not] [a link]. git-svn-id: https://pandoc.googlecode.com/svn/trunk@955 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 19:57:01 +00:00
fiddlosopher	845a658aff	Fixed markdown inline code parsing so it better accords with Markdown.pl: the marker for the end of the code section is a clump of the same number of `'s with which the section began, followed by a non-` character. So, for example, ` h ``` i ` -> <code>h ``` i</code>. git-svn-id: https://pandoc.googlecode.com/svn/trunk@954 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 16:38:41 +00:00
fiddlosopher	77f63605f5	Small change to referenceTitle: should end with line-end, not ')'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@953 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 01:53:50 +00:00
fiddlosopher	21a2acaac9	Split 'title' into 'linkTitle' and 'referenceTitle', since the rules are slightly different. git-svn-id: https://pandoc.googlecode.com/svn/trunk@952 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 01:51:03 +00:00
fiddlosopher	64023d8ba8	Removed unneeded 'try' from noteMarker. git-svn-id: https://pandoc.googlecode.com/svn/trunk@950 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 00:16:50 +00:00
fiddlosopher	40b870375d	Minor reformatting. git-svn-id: https://pandoc.googlecode.com/svn/trunk@949 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 00:11:37 +00:00
fiddlosopher	3edf5834e8	Rewrote 'para' for greater efficiency. git-svn-id: https://pandoc.googlecode.com/svn/trunk@948 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 00:08:18 +00:00
fiddlosopher	fcb91e8e51	Rewrote link parsers for greater efficiency. git-svn-id: https://pandoc.googlecode.com/svn/trunk@945 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 22:41:05 +00:00
fiddlosopher	cda7e7ac21	Removed redundant 'referenceLink' in definition of inline (it's already in 'link'). git-svn-id: https://pandoc.googlecode.com/svn/trunk@940 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:31:18 +00:00
fiddlosopher	6906584f47	Refactored escapeChar so it doesn't need 'try'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@939 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:24:56 +00:00
fiddlosopher	06a5a0e235	Removed unneeded 'try' in multilineRow. git-svn-id: https://pandoc.googlecode.com/svn/trunk@938 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:22:21 +00:00
fiddlosopher	ce800f7121	Removed unneeded 'try' in dashedLine. git-svn-id: https://pandoc.googlecode.com/svn/trunk@937 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:20:47 +00:00
fiddlosopher	3e337eb9c8	Removed unneeded try in rawHtmlBlocks (Markdown parser). git-svn-id: https://pandoc.googlecode.com/svn/trunk@936 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:19:07 +00:00
fiddlosopher	74dc62e730	Refactored hrule for performance in Markdown reader. git-svn-id: https://pandoc.googlecode.com/svn/trunk@935 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:04:47 +00:00
fiddlosopher	8cf4971821	Minor reformatting. git-svn-id: https://pandoc.googlecode.com/svn/trunk@934 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:59:57 +00:00
fiddlosopher	0c475bfd47	Refactored setext header parsing in Markdown reader for greater speed. git-svn-id: https://pandoc.googlecode.com/svn/trunk@933 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:58:20 +00:00
fiddlosopher	18b379c1ca	More rearranging in definition of inline. git-svn-id: https://pandoc.googlecode.com/svn/trunk@932 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:44:47 +00:00
fiddlosopher	b6ebe75656	More intelligent rearranging of 'inline' for speed boosts in Text.Pandoc.Readers.Markdown. git-svn-id: https://pandoc.googlecode.com/svn/trunk@931 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:38:38 +00:00
fiddlosopher	f04df62db1	Changed definition of 'enclosed' in Text.Pandoc.Shared so that 'try' is not automatically applied to the 'end' parser. Added 'try' in calls to 'enclosed' where needed. Slight speed increase. git-svn-id: https://pandoc.googlecode.com/svn/trunk@926 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 05:58:21 +00:00
fiddlosopher	1d8fe2653a	Performance improvements: + Rearranged parsers in definition of 'inline' so that the most frequently used would (by and large) be tried first. + Removed some unneeded 'try's. + Removed tabchar parser, as whitespace handles tabs anyway. + All in all, these changes, together with the last two commits, cut almost in half the time it takes pandoc to parse a large test file. git-svn-id: https://pandoc.googlecode.com/svn/trunk@924 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 02:33:53 +00:00
fiddlosopher	1163e622ab	Don't count p. 27 at the beginning of a line as an ordered list start, since it's most likely a page number. git-svn-id: https://pandoc.googlecode.com/svn/trunk@900 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-26 03:17:40 +00:00
fiddlosopher	f11360f50e	Added new rule for enhanced markdown ordered lists: if the list marker is a capital letter followed by a period (including a single-letter capital roman numeral), then it must be followed by at least two spaces. The point of this is to avoid accidentally treating people's initials as list markers: a paragraph may begin: B. Russell was an English philosopher. and this shouldn't be treated as a list. Modified Markdown reader and README documentation. Added a test case. git-svn-id: https://pandoc.googlecode.com/svn/trunk@880 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-23 04:25:09 +00:00
fiddlosopher	4a0e02dab7	Added a needed 'try' to listItem in Markdown reader. git-svn-id: https://pandoc.googlecode.com/svn/trunk@878 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-22 20:19:37 +00:00
fiddlosopher	cc0460f952	Code cleanup in Markdown reader. git-svn-id: https://pandoc.googlecode.com/svn/trunk@877 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-20 19:21:18 +00:00
fiddlosopher	84aa875bdb	Changes to Markdown reader for better conformity to the Markdown test suite under --strict: + Removed check for a following setext header in endline. A full test is too inefficient (doubles benchmark time), and the substitute we had before is not 100% accurate. + Don't use Code elements for autolinks if --strict specified. git-svn-id: https://pandoc.googlecode.com/svn/trunk@876 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-20 18:52:49 +00:00
fiddlosopher	81eba062f2	Refactor RST and Markdown readers using parseFromString. git-svn-id: https://pandoc.googlecode.com/svn/trunk@864 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-19 00:18:46 +00:00
fiddlosopher	4e149f898a	Added a necessary "try" in definition of "para" (HTML reader). git-svn-id: https://pandoc.googlecode.com/svn/trunk@863 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-19 00:06:03 +00:00
fiddlosopher	4399db4fd2	Bug fixes in readers: + LaTeX reader: skip anything after \end{document} + HTML reader: fixed bug skipping material after </html> -- previously, stuff at the end was skipped even if no </html> was present, which meant only part of the file would be parsed and no error issued + HTML reader: added new constant eitherBlockOrInline with elements that may count either as block-level or inline + Modified isInline and isBlock to take this into account + modified rawHtmlBlock to accept any tag (even an inline tag); this is innocuous, because rawHtmlBlock is tried only if a regular inline element can't be parsed. git-svn-id: https://pandoc.googlecode.com/svn/trunk@862 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-18 23:44:26 +00:00
fiddlosopher	e48f046aa0	+ Fixed bug in markdown ordered list parsing. The problem was that anyOrderedListStart did not check for a space following the ordered list marker. So, 'A.B. 2007' would be parsed as a list item, then fail because of the lack of space after 'A.' (required by orderedListStart). Resolves Issue #22. + Fixed a similar problem in RST reader. + Added regression test. git-svn-id: https://pandoc.googlecode.com/svn/trunk@861 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-18 15:26:29 +00:00

... 3 4 5 6 7 ...

355 commits