pandoc/test/command/4742.md
John MacFarlane 0bdcf415e4 Switch from pretty-simple to pretty-show for native output.
Update tests.

Reason:  it turns out that the native output generated by
pretty-simple isn't always readable by the native reader.
According to https://github.com/cdepillabout/pretty-simple/issues/99
it is not a design goal of the library that the rendered values
be readable using 'read'.  This makes it unsuitable for our
purposes.

pretty-show is a bit slower and it uses 4-space indents
(non-configurable), but it doesn't have this serious drawback.
2021-09-28 21:17:53 -07:00

1.1 KiB

Check that the commonmark reader handles the ascii_identifiers extension properly.

% pandoc -f commonmark+gfm_auto_identifiers+ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
  ( "non-ascii--raksmorgas", [], [] )
  [ Str "non"
  , Space
  , Str "ascii"
  , Space
  , Str "\9888\65039"
  , Space
  , Str "r\228ksm\246rg\229s"
  ]
]

Note that the emoji here is actually a composite character, formed from \9888 and \65039. The latter is a combining mark, so it survives...

% pandoc -f commonmark+gfm_auto_identifiers-ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
    1
    ( "non-ascii-\65039-r\228ksm\246rg\229s" , [] , [] )
    [ Str "non"
    , Space
    , Str "ascii"
    , Space
    , Str "\9888\65039"
    , Space
    , Str "r\228ksm\246rg\229s"
    ]
]

gfm should have ascii_identifiers disabled by default.

% pandoc -f gfm -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
    1
    ( "non-ascii-\65039-r\228ksm\246rg\229s" , [] , [] )
    [ Str "non"
    , Space
    , Str "ascii"
    , Space
    , Str "\9888\65039"
    , Space
    , Str "r\228ksm\246rg\229s"
    ]
]