pandoc/benchmark/benchmark-pandoc.hs

import Text.Pandoc
import Text.Pandoc.Shared (readDataFile, normalize)
import Criterion.Main
import Data.List (isSuffixOf)
import Text.JSON.Generic

readerBench :: Pandoc
            -> (String, ReaderOptions -> String -> Pandoc)
            -> Benchmark
readerBench doc (name, reader) =
  let writer = case lookup name writers of
                     Just (PureStringWriter w) -> w
                     _ -> error $ "Could not find writer for " ++ name
      inp = writer def{ writerWrapText = True
                                       , writerLiterateHaskell =
                                          "+lhs" `isSuffixOf` name } doc
      -- we compute the length to force full evaluation
      getLength (Pandoc (Meta a b c) d) =
            length a + length b + length c + length d
  in  bench (name ++ " reader") $ whnf (getLength .
         reader def{ readerSmart = True
                   , readerLiterateHaskell = "+lhs" `isSuffixOf` name
                   }) inp

writerBench :: Pandoc
            -> (String, WriterOptions -> Pandoc -> String)
            -> Benchmark
writerBench doc (name, writer) = bench (name ++ " writer") $ nf
    (writer def{
                   writerWrapText = True
                  , writerLiterateHaskell = "+lhs" `isSuffixOf` name }) doc

normalizeBench :: Pandoc -> [Benchmark]
normalizeBench doc = [ bench "normalize - with" $ nf (encodeJSON . normalize) doc
                     , bench "normalize - without" $ nf encodeJSON doc
                     ]

main :: IO ()
main = do
  inp <- readDataFile (Just ".") "README"
  let opts = def{ readerSmart = True }
  let doc = readMarkdown opts inp
  let readerBs = map (readerBench doc) readers
  let writers' = [(n,w) | (n, PureStringWriter w) <- writers]
  defaultMain $
    map (writerBench doc) writers' ++ readerBs ++ normalizeBench doc
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`import Text.Pandoc`
Added normalize benchmark to Benchmark.hs. 2010-12-25 23:07:26 +01:00			`import Text.Pandoc.Shared (readDataFile, normalize)`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`import Criterion.Main`
			`import Data.List (isSuffixOf)`
More accurate benchmark for normalize. 2010-12-31 00:32:34 +01:00			`import Text.JSON.Generic`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00
			`readerBench :: Pandoc`
Fixed Benchmark to compile with latest changes. 2012-07-26 07:38:59 +02:00			`-> (String, ReaderOptions -> String -> Pandoc)`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`-> Benchmark`
			`readerBench doc (name, reader) =`
			`let writer = case lookup name writers of`
Got rid of stateStandalone, which was hardly used anyway. The only possible effect will be with rst fragments that begin with an rst title block, which will now cause the header transform. 2012-07-26 05:08:42 +02:00			`Just (PureStringWriter w) -> w`
			`_ -> error $ "Could not find writer for " ++ name`
Moved WriterOptions and associated types Shared -> Options. 2012-07-27 07:59:56 +02:00			`inp = writer def{ writerWrapText = True`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`, writerLiterateHaskell =`
			"+lhs" `isSuffixOf` name } doc
New HTML reader using tagsoup as a lexer. * The new reader is faster and more accurate. * API changes for Text.Pandoc.Readers.HTML: - removed rawHtmlBlock, anyHtmlBlockTag, anyHtmlInlineTag, anyHtmlTag, anyHtmlEndTag, htmlEndTag, extractTagType, htmlBlockElement, htmlComment - added htmlTag, htmlInBalanced, isInlineTag, isBlockTag, isTextTag * tagsoup is a new dependency. * Text.Pandoc.Parsing: Generalized type on readWith. * Benchmark.hs: Added length calculation to force full evaluation. * Updated HTML reader tests. * Updated markdown and textile readers to use the functions from the HTML reader. * Note: The markdown reader now correctly handles some cases it did not before. For example: <hr/> is reproduced without adding a space. <script> a = '<b>'; </script> is parsed correctly. 2010-12-23 05:25:15 +01:00			`-- we compute the length to force full evaluation`
			`getLength (Pandoc (Meta a b c) d) =`
			`length a + length b + length c + length d`
			`in bench (name ++ " reader") $ whnf (getLength .`
Fixed Benchmark to compile with latest changes. 2012-07-26 07:38:59 +02:00			`reader def{ readerSmart = True`
			, readerLiterateHaskell = "+lhs" `isSuffixOf` name
Moved stateLiterateHaskell to readerLiterateHaskell in Options. 2012-07-26 05:20:03 +02:00			`}) inp`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00
			`writerBench :: Pandoc`
Benchmark: use nf for writers. whnf gives inaccurate results. 2010-12-13 08:24:02 +01:00			`-> (String, WriterOptions -> Pandoc -> String)`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`-> Benchmark`
Benchmark: use nf for writers. whnf gives inaccurate results. 2010-12-13 08:24:02 +01:00			`writerBench doc (name, writer) = bench (name ++ " writer") $ nf`
Moved WriterOptions and associated types Shared -> Options. 2012-07-27 07:59:56 +02:00			`(writer def{`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`writerWrapText = True`
			, writerLiterateHaskell = "+lhs" `isSuffixOf` name }) doc

More accurate benchmark for normalize. 2010-12-31 00:32:34 +01:00			`normalizeBench :: Pandoc -> [Benchmark]`
			`normalizeBench doc = [ bench "normalize - with" $ nf (encodeJSON . normalize) doc`
			`, bench "normalize - without" $ nf encodeJSON doc`
			`]`
Added normalize benchmark to Benchmark.hs. 2010-12-25 23:07:26 +01:00
Added type signature. 2012-07-26 19:02:00 +02:00			`main :: IO ()`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`main = do`
			`inp <- readDataFile (Just ".") "README"`
Fixed Benchmark to compile with latest changes. 2012-07-26 07:38:59 +02:00			`let opts = def{ readerSmart = True }`
			`let doc = readMarkdown opts inp`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00			`let readerBs = map (readerBench doc) readers`
Got rid of stateStandalone, which was hardly used anyway. The only possible effect will be with rst fragments that begin with an rst title block, which will now cause the header transform. 2012-07-26 05:08:42 +02:00			`let writers' = [(n,w) \| (n, PureStringWriter w) <- writers]`
Integrated benchmark into cabal. Can now do: cabal configure --enable-benchmarks && cabal build cabal bench --benchmark-option='markdown' --benchmark-option='-s 20' 2012-07-26 18:18:17 +02:00			`defaultMain $`
			`map (writerBench doc) writers' ++ readerBs ++ normalizeBench doc`
Added Benchmark.hs, testing all readers + writers using criterion. 2010-12-11 08:35:31 +01:00