pandoc

Author	SHA1	Message	Date
fiddlosopher	f7b705b44c	Implemented implicit reference-style links to section headers in markdown. For example, if you have a header '# Supported architectures', you can link to it with '[Supported architectures]'. If there are multiple headers with this label, the link will point to the first of them. Implicit references are always overridden by explicitly specified references. Addresses Issue #20. + Moved isPunctuation, uniqueIdentifiers, and inlineListToIdentifier from Text.Pandoc.Writers.HTML to Text.Pandoc.Shared. + Added stHeaders to ParserState. This holds a list of header texts used in the document, and is used to construct implicit header references. + In Text.Pandoc.Readers.Markdown, added call to headerReference parser in initial parsing pass, constructing a list of section header labels. This is then passed to uniqueIdentifiers to produce identifiers, and a list of implicit references is constructed. This is added to the end of the explicitly specified references, so it will be overridden by explicitly specified references. All of this processing is skipped if --strict was specified. + Modified documentation in README. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1086 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-11-22 17:14:21 +00:00
fiddlosopher	87d6d0d069	Fixed logic in markdown smart quote parsing: + Added some needed 'try' statements. + Unicode right single-quote can double as apostrophe, so treat it as a quote-end only when not followed by an alphanumeric character. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1077 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-11-15 17:29:24 +00:00
fiddlosopher	ccb5fbb209	Fixed smart quote parsing in markdown reader so that unicode characters 8216 and 8217 are recognized as single quotes, and 8220 and 8221 as double quotes. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1075 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-11-15 03:55:58 +00:00
fiddlosopher	bd7f5f3f7c	Fixed bug in LaTeX reader (pointed out by Mark Eli Kalderon): needed a "try" before "string" in parser for \[ math blocks. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1068 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-11-10 21:44:05 +00:00
fiddlosopher	fe684764e6	Reverted back to state as of r1062. The template haskell changes are more trouble than they're worth. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1064 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-11-03 23:27:58 +00:00
fiddlosopher	4a841bfc54	Use template haskell to avoid the need for templates: + Added library Text.Pandoc.Include, with a template haskell function $(includeStrFrom fname) to include a file as a string constant at compile time. + This removes the need for the 'templates' directory or Makefile target. These have been removed. + The base source directory has been changed from src to . + A new 'data' directory has been added, containing the ASCIIMathML.js script, writer headers, and S5 files. + The src/wrappers directory has been moved to 'wrappers'. + The Text.Pandoc.ASCIIMathML library is no longer needed, since Text.Pandoc.Writers.HTML can use includeStrFrom to include the ASCIIMathML.js code directly. It has been removed. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1063 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-11-03 22:14:03 +00:00
fiddlosopher	d5adbcb774	Fixed bug in parsing files that begin with blank lines. + In Text.Pandoc.Shared: rewrote lineClump to parse EITHER a string of blank lines OR a string of nonblanks. Removed code for parsing eof. + In Markdown and RST readers, use 'manyTill (... <\|> lineClump) eof' instead of many, since lineClump no longer parses eof. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1057 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-10-29 21:00:48 +00:00
fiddlosopher	63dfc3abf2	Modified specialChar in LaTeX reader so that '"' characters are parsed and do not cause an error. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1056 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-10-28 23:30:29 +00:00
fiddlosopher	8144c54f82	Improvements to RST reader: + Allow field lists to be indented. + Parse the contents of field lists instead of treating them as raw strings. + Represent field lists as definition lists rather than blockquotes. + Fixed bug in which metadata would be overridden if the document contained more than one field list. + Parse fields associated with ..image: blocks, and use the 'alt' field, if present, for image alt text and title. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1050 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-10-13 21:39:17 +00:00
fiddlosopher	ad9603231f	Fixed bug in RST reader: previously, code blocks had to be indented a full tabstop, but RST allows any amount of indentation. Resolves Issue #27. + removed 'variable' parameter from indentedBlock function in RST reader, as it is no longer needed + updated test suite + updated changelog git-svn-id: https://pandoc.googlecode.com/svn/trunk@1046 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-10-11 05:50:28 +00:00
fiddlosopher	fbb048238e	Markdown reader: require space before title in links and references. This fixes a bug in parsing URLs like http://silly/url(withparen). git-svn-id: https://pandoc.googlecode.com/svn/trunk@1025 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-18 01:34:36 +00:00
fiddlosopher	65047e354a	Remove just one leading and one trailing newline from contents of <pre>...</pre> in codeBlock parser. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1023 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-17 03:08:14 +00:00
fiddlosopher	6f16d52c11	Changed parsing of code blocks in HTML reader: + <code> tag is no longer needed. <pre> suffices. + all HTML tags in the code block (e.g. for syntax highlighting) are skipped, because they are not portable to other output formats. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1022 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-17 02:49:28 +00:00
fiddlosopher	d5b7257d7f	Simplified HTML attribute parsing (HTML reader). git-svn-id: https://pandoc.googlecode.com/svn/trunk@1016 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-15 00:44:32 +00:00
fiddlosopher	28c2ee396c	Fixed two bugs in HTML reader: + <code>...</code> not surrounded by <pre> should count as inline HTML, not code block. + parser for minimized attributes should not swallow trailing spaces git-svn-id: https://pandoc.googlecode.com/svn/trunk@1015 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-14 22:40:28 +00:00
fiddlosopher	4b4060b8ef	Simplified parsing of reference keys and notes in markdown and RST readers: + The Reference data structure from Text.Pandoc.Shared is no longer needed, since + referenceKey and noteBlock parses return strings (as many blank lines as are occuried by the key or note) and update state themselves. + getPosition and setPosition are now used to ensure that error messages will give the correct line number. + This yields cleaner (and slightly faster) code, with more accurate parsing error messages. git-svn-id: https://pandoc.googlecode.com/svn/trunk@1012 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-05 23:55:38 +00:00
fiddlosopher	0982a67585	LaTeX command and environment names can't contain numbers. LaTeX reader updated accordingly. git-svn-id: https://pandoc.googlecode.com/svn/trunk@987 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-02 00:36:44 +00:00
fiddlosopher	25bbe134cb	Skip notes parsing if running in strict mode. (This yields a nice speed improvement in strict mode.) git-svn-id: https://pandoc.googlecode.com/svn/trunk@983 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 20:26:40 +00:00
fiddlosopher	76d462c1cd	Simplify autolink parsing code, using Network.URI to test for URIs. Added dependency on network library to debian/control and pandoc.cabal. git-svn-id: https://pandoc.googlecode.com/svn/trunk@982 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 17:22:22 +00:00
fiddlosopher	f8f9fa49d6	More perspicuous definition of nonindentSpaces. git-svn-id: https://pandoc.googlecode.com/svn/trunk@981 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 16:08:47 +00:00
fiddlosopher	5c1632be5d	Removed unneeded 'try' in 'rawLine'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@979 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:26:39 +00:00
fiddlosopher	85d49ee936	Combined linebreak and whitespace into a new whitespace parser, to avoid unnecessary reparsing of space characters. git-svn-id: https://pandoc.googlecode.com/svn/trunk@978 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:18:25 +00:00
fiddlosopher	8d3bec3e4d	Removed unnecessary 'try' in 'codeBlock'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@977 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:10:30 +00:00
fiddlosopher	bdf78fe33f	Use lookAhead in parsers for setext headers and definition lists to see if the next line begins appropriately; if not, don't waste any more time parsing... git-svn-id: https://pandoc.googlecode.com/svn/trunk@976 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-09-01 02:01:12 +00:00
fiddlosopher	f55d62c04a	Don't require blank lines after code block. (It's sufficient to end code block with a nonindented line.) git-svn-id: https://pandoc.googlecode.com/svn/trunk@975 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-31 22:21:41 +00:00
fiddlosopher	c206332558	Changed definition of 'emph': italics with '_' must not be followed by an alphanumeric character. This is to help prevent interpretation of e.g. [LC_TYPE]: my_type as '[LC<em>TYPE]:my</em>type'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@974 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-31 22:15:35 +00:00
fiddlosopher	3b790b80f3	Fixed bug in LaTeX reader, which wrongly assumed that the roman numeral after "enum" in "setcounter" would consist entirely of "i"s. enumiv is legitimate. git-svn-id: https://pandoc.googlecode.com/svn/trunk@961 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-30 19:12:50 +00:00
fiddlosopher	14dc520669	Cleaned up LaTeX reader. Rearranged order of parsers in inline for slight speed improvement. Added ` to special characters and 'unescapedChar'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@960 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 21:03:26 +00:00
fiddlosopher	451b426fd6	Removed unneeded try's in RST reader; also minor code cleanup. git-svn-id: https://pandoc.googlecode.com/svn/trunk@959 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 20:39:31 +00:00
fiddlosopher	2a37d8d30a	Efficiency improvements to RST reader (more than doubled speed): + removed tabchar + rearranged parsers in inline git-svn-id: https://pandoc.googlecode.com/svn/trunk@958 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 20:22:24 +00:00
fiddlosopher	015644b60e	Purely stylistic change. git-svn-id: https://pandoc.googlecode.com/svn/trunk@957 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 20:03:28 +00:00
fiddlosopher	7f8ec9577e	Removed unneeded 'try' in 'ellipses'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@956 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 19:59:08 +00:00
fiddlosopher	60f700997c	+ Fixed bug introduced into referenceTitle by previous changes. Now it works as before. + Improved Markdown.pl-compatibility in referenceLink: the two parts of a reference-style link may be separated by one space, but not more... [a] [link], [not] [a link]. git-svn-id: https://pandoc.googlecode.com/svn/trunk@955 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 19:57:01 +00:00
fiddlosopher	845a658aff	Fixed markdown inline code parsing so it better accords with Markdown.pl: the marker for the end of the code section is a clump of the same number of `'s with which the section began, followed by a non-` character. So, for example, ` h ``` i ` -> <code>h ``` i</code>. git-svn-id: https://pandoc.googlecode.com/svn/trunk@954 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 16:38:41 +00:00
fiddlosopher	77f63605f5	Small change to referenceTitle: should end with line-end, not ')'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@953 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 01:53:50 +00:00
fiddlosopher	21a2acaac9	Split 'title' into 'linkTitle' and 'referenceTitle', since the rules are slightly different. git-svn-id: https://pandoc.googlecode.com/svn/trunk@952 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 01:51:03 +00:00
fiddlosopher	64023d8ba8	Removed unneeded 'try' from noteMarker. git-svn-id: https://pandoc.googlecode.com/svn/trunk@950 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 00:16:50 +00:00
fiddlosopher	40b870375d	Minor reformatting. git-svn-id: https://pandoc.googlecode.com/svn/trunk@949 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 00:11:37 +00:00
fiddlosopher	3edf5834e8	Rewrote 'para' for greater efficiency. git-svn-id: https://pandoc.googlecode.com/svn/trunk@948 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-29 00:08:18 +00:00
fiddlosopher	fcb91e8e51	Rewrote link parsers for greater efficiency. git-svn-id: https://pandoc.googlecode.com/svn/trunk@945 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 22:41:05 +00:00
fiddlosopher	cda7e7ac21	Removed redundant 'referenceLink' in definition of inline (it's already in 'link'). git-svn-id: https://pandoc.googlecode.com/svn/trunk@940 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:31:18 +00:00
fiddlosopher	6906584f47	Refactored escapeChar so it doesn't need 'try'. git-svn-id: https://pandoc.googlecode.com/svn/trunk@939 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:24:56 +00:00
fiddlosopher	06a5a0e235	Removed unneeded 'try' in multilineRow. git-svn-id: https://pandoc.googlecode.com/svn/trunk@938 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:22:21 +00:00
fiddlosopher	ce800f7121	Removed unneeded 'try' in dashedLine. git-svn-id: https://pandoc.googlecode.com/svn/trunk@937 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:20:47 +00:00
fiddlosopher	3e337eb9c8	Removed unneeded try in rawHtmlBlocks (Markdown parser). git-svn-id: https://pandoc.googlecode.com/svn/trunk@936 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:19:07 +00:00
fiddlosopher	74dc62e730	Refactored hrule for performance in Markdown reader. git-svn-id: https://pandoc.googlecode.com/svn/trunk@935 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 07:04:47 +00:00
fiddlosopher	8cf4971821	Minor reformatting. git-svn-id: https://pandoc.googlecode.com/svn/trunk@934 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:59:57 +00:00
fiddlosopher	0c475bfd47	Refactored setext header parsing in Markdown reader for greater speed. git-svn-id: https://pandoc.googlecode.com/svn/trunk@933 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:58:20 +00:00
fiddlosopher	18b379c1ca	More rearranging in definition of inline. git-svn-id: https://pandoc.googlecode.com/svn/trunk@932 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:44:47 +00:00
fiddlosopher	b6ebe75656	More intelligent rearranging of 'inline' for speed boosts in Text.Pandoc.Readers.Markdown. git-svn-id: https://pandoc.googlecode.com/svn/trunk@931 788f1e2b-df1e-0410-8736-df70ead52e1b	2007-08-28 06:38:38 +00:00

1 2 3 4

166 commits