An Introduction to Property-Based Testing

Created by: Zac Hatfield-Dodds and Ryan Soklaski

This tutorial is designed to introduce attendees to (the wonderful world of) automated testing in Python – specifically for data science applications.

An Introduction to Property-Based Testing

Introduction

This tutorial is designed to promote simple but powerful methods for bolstering your software-based work/research with property-based testing, and thereby help to close the aperture on “the things that could go wrong” because of your code. It is structured as four blocks, each consisting of a short talk, live-coded demo, and extensive exercises for attendees:

Property-Based Testing 101: core concepts and the core of the Hypothesis library
Describe your Data: from numbers, to arrays, to recursive and more complicated things
Common Tests: from "does not crash" to "write+read == noop" to 'metamorphic relations'
Putting it into Practice: use what you've learned to find real bugs in a real project!

Each of these blocks runs for 40-60 minutes, consisting of a 5-15 minute presentation, occasional live-coded demos, and guided exercises. Along with a short break in the middle, we find this pattern balances content, practice, and focus well for most classes.

You can read through the slides here, as a reference for later. At the end of each section, please work through the exercises before reading ahead!

Syllabus

The material will be offered as a half-day session. There will be a mixture of lectures with hands-on exercises. The following topics will be explored:

Property-Based Testing 101

For in-person tutorials, this section is a talk rather than exercises.

In this section we will take a tour through the taxonomy of testing: encountering manual tests, parameterized tests, and property-based tests. In the meanwhile, we will be introduced to the testing library Hypothesis, which provides us with automated test-case generation and reduction.

Describe your Data

It is often the case that the process of describing our data is by far the heaviest burden that we must bear when writing tests. This process of assessing "what variety of values should I test?", "have I thought of all the important edge-cases?", and "how much is 'enough'?" will crop up with nearly every test that we write. Hypothesis is a powerful Python library that empowers us to write a description (specification, to be more precise) of the data that we want to use to exercise our test. It will then generate test cases that satisfy this description and will run our test on these cases. Let's witness this in its most basic form.

Common Tests

In this section, we will put Hypothesis strategies for describing data to good use and discuss some common tactics for writing effective property-based tests for our code. This will familiarize with certain patterns and properties - e.g. roundtrip pairs, equivalent functions, metamorphic relationships - to look for when crafting property-based tests for our code.

Putting it into Practice

Now that we know how to write property-based tests, in this final section we will cover practical tips for using them as part of a larger project. Test-suite design patterns, use of performance and determinism settings, reproducing test failures from a CI server, etc; they're all practical topics and typically left out of shorter introductions!

Prerequisites

This tutorial is for anybody who regularly writes tests in Python, and would like an easier and more effective way to do so. We assume that you are comfortable reading, running, and writing traditional unit tests; and familar with ideas like assertions. Most attendees will have heard "given, when, then" and "arrange, act, assert". You may or may not have heard of pre- and post-conditions - we will explain what "property-based" means without reference to Haskell or anything algebraic.

Attendees are expected to have access to a computer with with Python 3.7+ installed on it and should know how to run Jupyter notebooks. A complete description of how to do this is detailed in Module 1 of Python Like You Meant It.

It is also recommended that you prepare yourself to work in your IDE of choice for Python. If you do not have an IDE that you are comfortable with yet, Visual Studio code and PyCharm are both highly recommended.

Preparing a Python Environment for this Tutorial

To complete the tasks in this tutorial you will need to setup a proper python environment. These are the big steps you are going to follow:

Create a new virtual environment (optional but recomended)
Install all required external and local packages in your new virtual environment

Creating The New Environment

Create an environment using pip

The instructions can be found at create and activate a virtual environment

Tip: do not forget to activate your environment after creating it!

Create an environment using conda

To create a mini-conda environment for this tutorial, install mini-conda and then, in your terminal, execute:

> conda create -n test-tutorial python=3.10 pip
> conda activate test-tutorial

Installing Required Packages

At this step you are going to install some external packages and also install a local package located at src/pbt_tutorial, so the functions defined there are accessible by our notebooks.

The external packages below will be installed:

numpy
notebook
pytest
hypothesis

Navigate to the top level of this repo and than execute the command below to install all required packages:

pip install -r requirements.txt

Details on the package installation

The first line of the requirements.txt file provided is

-e .

which actually means

pip install -e .

This command is necessary because in this tutorial, you will be populating your own toy Python project, and will populate code under ./src/pbt_tutorial.

For this code to be available to import in our notebooks we need to make a "dev-mode" installation of this library in our environment. This is made by navigating to the top level of this repo and running:

pip install -e .

The command will install pbt_tutorial so that any code that you add to it will immediately be reflected in your installation.

After running the command above the code under ./src/pbt_tutorial will be available to import. so if you add to ./src/pbt_tutorial/__init__.py the function def my_func(): return 1, then you will be able to import that function:

>>> from pbt_tutorial import my_func
>>> my_func()
1

For Instructors

The source material for this tutorial is written in jupytext-markdown in lieu of native Jupyter notebooks. These are markdown files that can be opened as Jupyter notebooks. These markdown notebooks contain both the exercises and solutions. To convert these files to Jupyter notebooks to be distributed to the students, with the solutions excised, we are using the cogbooks tool, which can be installed with: pip install cogbooks

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github/workflows		.github/workflows
notebooks		notebooks
src/pbt_tutorial		src/pbt_tutorial
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
runtime.txt		runtime.txt
setup.py		setup.py

License

rsokl/testing-tutorial

Folders and files

Latest commit

History

Repository files navigation

An Introduction to Property-Based Testing

Introduction

Syllabus

Property-Based Testing 101

Describe your Data

Common Tests

Putting it into Practice

Prerequisites

Preparing a Python Environment for this Tutorial

Creating The New Environment

Installing Required Packages

For Instructors

About

Resources

License

Stars

Watchers

Forks

Languages