gtp-benchmarks

Gradual Typing Performance benchmark programs.

This is a collection of Racket programs. Each program can run in exponentially-many configurations that differ in terms of their type annotations.

Companion software

Overview

The benchmarks are in the benchmarks/ folder. Each benchmark is made of 2-4 folders:

untyped/ a Racket version of the benchmark (the untyped configuration)
typed/ a Typed Racket version of the benchmark (the typed configuration)
(optional) both/ extra Racket / Typed Racket files
(optional) base/ extra libraries or data files

To Run

Quick route

Go to the benchmark's directory
Create a new directory
Copy in all the both/ files, if any
Copy in your choice of typed/ and untyped/ files
Run main.rkt

Official route

Install the gtp-measure package (raco pkg install gtp-measure)
Run raco gtp-measure <PATH-TO-BENCHMARK> (or run raco gtp-measure --help)
Follow its instructions to get the output

To run all benchmarks, copy and modify the sample manifest here and run via:

  PLTSTDERR="error info@gtp-measure" raco gtp-measure --output sample-data/ sample-gtp-measure-manifest.rkt

Results appear in a new directory ./sample-data/1/ and if you re-run the command new directories appear under ./sample-data/.

Semi-automatic route

Run racket utilities/make-configurations.rkt <PATH-TO-BENCHMARK>, this creates a directory with all typed/untyped configurations of the benchmark.
Go to one of the new directories, run main.rkt

Guidelines

The benchmarks try to meet the following "rock bottom" guidelines for giving reproducible performance data in reasonable time:

No I/O actions during timed computation
Able to run all typed/untyped configurations
Run for 1-5 seconds when untyped or fully-typed
Run for < 300 seconds in the worst case

Points 3 and 4 are in conflict. For forth in particular, the untyped configuration runs extremely quickly but some partially-typed configurations take close to 5 minutes.

Dependencies

require-typed-check

Cite

@inproceedings{g-rep-2023,
  author={Greenman, Ben},
  title={{GTP} Benchmarks for Gradual Typing Performance},
  booktitle={{REP}},
  publisher={{ACM}},
  pages={102--114},
  doi={10.1145/3589806.3600034},
  year={2023}
}

History

Original development: https://github.com/nuprl/gradual-typing-performance

Subsets / earlier-versions of these benchmarks have appeared in:

Is Sound Gradual Typing Dead?. Asumu Takikawa, Daniel Feltey, Ben Greenman, Max S. New, Jan Vitek, and Matthias Felleisen. POPL 2016.
How to Evaluate the Performance of Gradual Type Systems. Ben Greenman, Asumu Takikawa, Max S. New, Daniel Feltey, Robert Bruce Findler, Jan Vitek, and Matthias Felleisen. JFP 2019.
Sound Gradual Typing: Only Mostly Dead. Spenser Bauman, Carl Friedrich Bolz-Tereick, Jeremy Siek, and Sam Tobin-Hochstadt. OOPSLA 2017.

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
benchmarks		benchmarks
patch		patch
scribblings		scribblings
utilities		utilities
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE.txt		LICENSE.txt
README.md		README.md
info.rkt		info.rkt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarks

benchmarks

patch

patch

scribblings

scribblings

utilities

utilities

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE.txt

LICENSE.txt

README.md

README.md

info.rkt

info.rkt

Repository files navigation

gtp-benchmarks

Companion software

Overview

To Run

Quick route

Official route

Semi-automatic route

Guidelines

Dependencies

Cite

History

About

Releases 12

Packages

Contributors 5

Languages

License

utahplt/gtp-benchmarks

Folders and files

Latest commit

History

Repository files navigation

gtp-benchmarks

Companion software

Overview

To Run

Quick route

Official route

Semi-automatic route

Guidelines

Dependencies

Cite

History

About

Resources

License

Stars

Watchers

Forks

Languages