pandoc/test/command/4742.md
John MacFarlane c266734448 Use pretty-simple to format native output.
Previously we used our own homespun formatting.  But this
produces over-long lines that aren't ideal for diffs in tests.
Easier to use something off-the-shelf and standard.

Closes #7580.

Performance is slower by about a factor of 10, but this isn't
really a problem because native isn't suitable as a serialization
format. (For serialization you should use json, because the reader
is so much faster than native.)
2021-09-21 12:37:42 -07:00

1.1 KiB

Check that the commonmark reader handles the ascii_identifiers extension properly.

% pandoc -f commonmark+gfm_auto_identifiers+ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
  ( "non-ascii--raksmorgas", [], [] )
  [ Str "non"
  , Space
  , Str "ascii"
  , Space
  , Str "\9888\65039"
  , Space
  , Str "r\228ksm\246rg\229s"
  ]
]

Note that the emoji here is actually a composite character, formed from \9888 and \65039. The latter is a combining mark, so it survives...

% pandoc -f commonmark+gfm_auto_identifiers-ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
  ( "non-ascii-\65039-r\228ksm\246rg\229s", [], [] )
  [ Str "non"
  , Space
  , Str "ascii"
  , Space
  , Str "\9888\65039"
  , Space
  , Str "r\228ksm\246rg\229s"
  ]
]

gfm should have ascii_identifiers disabled by default.

% pandoc -f gfm -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
  ( "non-ascii-\65039-r\228ksm\246rg\229s", [], [] )
  [ Str "non"
  , Space
  , Str "ascii"
  , Space
  , Str "\9888\65039"
  , Space
  , Str "r\228ksm\246rg\229s"
  ]
]