diff --git a/debian/changelog b/debian/changelog index a83cc7370..a875ebf66 100644 --- a/debian/changelog +++ b/debian/changelog @@ -2,10 +2,121 @@ pandoc (0.4) UNRELEASED; urgency=low [ John MacFarlane ] - * Added support for simple and multiline tables to markdown reader, - LaTeX writer, DocBook writer, and HTML writer. Added tests and - documentation in README. - + * Added support for Markdown tables. Two kinds of tables are supported + (a simple table with one-line rows, and a more complex variety with + multiline rows). Currently only the Markdown reader and the LaTeX, + Docbook, and HTML writers support tables. The syntax is documented in + README. + + * Refactored to avoid reliance on Haskell's Text.Regex library, which + (a) is slow, and (b) does not properly handle unicode. This fixed + some strange bugs, e.g. in parsing S-cedilla, and improved performance. + + + Replaced 'gsub' with a general list function 'substitute' + that does not rely on Text.Regex. + + Rewrote extractTagType in HTML reader so that it doesn't use + regexs. + + In Markdown reader, replaced email regex test with a custom email + autolink parser (autoLinkEmail). Also replaced selfClosingTag regex + with a custom function isSelfClosingTag. + + Modified Docbook writer so that it doesn't rely on Text.Regex for + detecting 'mailto' links. + + Removed escapePreservingRegex and reamped entity-handling + functions in Text/Pandoc/Shared.hs and Text/Pandoc/Entities.hs to + avoid reliance on Text.Regex (see below on Entity handling changes). + + * Changed handling of SGML entities. Entities are now parsed (and unicode + characters returned) in the Markdown and HTML readers, rather than being + handled in the writers. In HTML and Docbook writers, UTF-8 is now used + instead of entities for characters above 128. This makes the HTML and + Docbook output much more readable and more easily editable. + + + Removed sgmlHexEntity, sgmlDecimalEntity, sgmlNamedEntity, and + sgmlCharacterEntity regexes from Shared.hs. + + Added parsers characterEntity, namedEntity, decimalEntity, hexEntity + to Entities.hs; these parse a string and return a unicode character. + + Added new 'entity' parser to Markdown reader, and added '&' as a + special character. + + Changed 'entity' parser in HTML reader to use characterEntity. + + Rewrote decodeEntities to use the new parsers instead of Text.Regex. + + Modified HTML and Markdown readers to call decodeEntities on all raw + strings (e.g. authors, dates, link titles), to ensure that no + unprocessed entities are included in the native representation of + the document. (In the HTML reader, most of this work is done by a + change in extractAttributeName.) + + Added escapeSGMLChar to Entities.hs. Modified escapeSGMLString to + use escapeSGMLChar. + + In SGML and Markdown output, escape unicode nonbreaking space as ' ', + since a unicode non-breaking space is impossible to distinguish visually + from a regular space. (Resolves issue #3.) + + Replaced all calls to stringToSGML and encodeEntities with calls to + escapeSGMLString. + + Rewrote escapeSGMLString for better performance. + + Added charToEntity and charToNumericalEntity to Entities.hs. + Removed encodeEntitiesNumerical. + + Use Data.Map for entityTable and (new) reverseEntityTable, for a + slight performance boost over the old association list. + + Removed unneeded decodeEntities from 'str' parser in HTML and + Markdown readers. + + * Fixed several bugs in HTML reader (extractTagType, attribute parsing). + + * Markdown reader: + + + Fixed several bugs in smart quote recognition. + + Changed autoLink parsing to conform better to Markdown.pl's + behavior. is not treated as a link, but + , , and are. + + Cleaned up handling of embedded quotes in link titles. Now these are + stored as a '"' character, not as '"'. + + Use lookAhead parser for the 'first pass' (looking for reference keys), + instead of parsing normally, then using setInput to reset input. This + yields a slight performance boost. + + * Markdown writer: Use autolinks when possible. Instead of + [site.com](site.com), use . + + * RST Reader: + + + Allow the URI in a RST hyperlink target to start on the line + after the reference key. + + Added 'try' in front of 'string', where needed, or used a different parser, + in RST reader. This fixes a bug where ````` would not be correctly parsed as + a verbatim `. + + Fixed slow performance in parsing inline literals in RST reader. The + problem was that ``#`` was seen by 'inline' as a potential link or image. + Fix: inserted 'notFollowedBy (char '`')' in link parsers. + (Resolves issue #8.) + + Use lookAhead instead of getInput/setInput in RST reader. Removed + unneeded getState call, since lookAhead automatically saves and + restores the parser state. + + * LaTeX Reader: replaced 'choice [(try (string ...), ...]' idiom with + 'oneOfStrings' in LaTeX reader, for clarity. + + * Modified LaTeX writer to insert '\,' between consecutive quotes. + + * Text.ParserCombinators.Pandoc: + + + Removed followedBy' parser, replacing it with the lookAhead parser from + Text/ParserCombinators/Parsec. + + Added some needed 'try's before multicharacter parsers, especially in + 'option' contexts. + + Removed the 'try' from the 'end' parser in 'enclosed', so that + 'enclosed' behaves like 'option', 'manyTill', etc. + + * Improved website target: + + + Use a subsidiary Makefile that can be run from the website + directory. + + Improved "Examples" page: added a templating system, syntax + highlighting of xml, tex, and html files, and a demo of + docbook postprocessed by xmlto. + + Download links now go to Google's download details page (with + SHA1 checksum) rather than directly to the files. + + * Added FreeBSD port. + -- Recai Oktaş Tue, 16 Jan 2007 00:37:21 +0200 pandoc (0.3) unstable; urgency=low