c266734448
Previously we used our own homespun formatting. But this produces over-long lines that aren't ideal for diffs in tests. Easier to use something off-the-shelf and standard. Closes #7580. Performance is slower by about a factor of 10, but this isn't really a problem because native isn't suitable as a serialization format. (For serialization you should use json, because the reader is so much faster than native.)
1.1 KiB
1.1 KiB
Check that the commonmark reader handles the ascii_identifiers
extension properly.
% pandoc -f commonmark+gfm_auto_identifiers+ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
( "non-ascii--raksmorgas", [], [] )
[ Str "non"
, Space
, Str "ascii"
, Space
, Str "\9888\65039"
, Space
, Str "r\228ksm\246rg\229s"
]
]
Note that the emoji here is actually a composite character, formed from \9888 and \65039. The latter is a combining mark, so it survives...
% pandoc -f commonmark+gfm_auto_identifiers-ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
( "non-ascii-\65039-r\228ksm\246rg\229s", [], [] )
[ Str "non"
, Space
, Str "ascii"
, Space
, Str "\9888\65039"
, Space
, Str "r\228ksm\246rg\229s"
]
]
gfm
should have ascii_identifiers
disabled by default.
% pandoc -f gfm -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
( "non-ascii-\65039-r\228ksm\246rg\229s", [], [] )
[ Str "non"
, Space
, Str "ascii"
, Space
, Str "\9888\65039"
, Space
, Str "r\228ksm\246rg\229s"
]
]