Weakly Supervised Learning

This is the source code for the book Weakly Supervied Learning an incomplete book from 2019-2020 by Russell Jurney. The book itself is open source and can be found at https://github.com/rjurney/weakly_supervised_learning :)

In my previous book, Agile Data Science 2.0 (O’Reilly Media, 2017), I setup EC2 and Vagrant environments in which to run the book’s code but since 2017 the Python ecosystem has developed to the point that I am going to refrain from providing thorough installation documentation for every requirement. In this book I provide a Docker setup that is easy to use and also provide Anaconda and PyPi environments if you wish the run the code yourself locally. The website for each library is a better resource than I can possibly create, and they are updated and maintained more frequently than this book. I will instead list requirements, link to the project pages and let the reader install the requirements themselves. If you want to use a pre-built environment, use the Dockerfile and docker-compose.yml files included in the code repository for the book will “just work” on any operating system that Docker runs on: Linux, Mac OS X, Windows.

Software Prerequisites

Linux, Mac OS X or Windows - any OS with a Docker implementation
Git is used to check out the book’s source code
Docker is used to run the book’s examples in the same environment I wrote them in

Running Docker via `docker-compose`

To run the examples using docker-compose simply run:

docker-compose up --build -d

The --build builds the container using the local directory the first time you run it. The -d puts the Jupyter web server in the background, and is optional.

Now visit http://localhost:8888

If you run into problems, remove the -d argument to run it in the foreground and file an issue on Github with the command you used and the complete error output.

Running Docker directly via the `Dockerfile`

You can also build and run the docker image directly via the docker command and the Dockerfile :

docker build --tag weakly_supervised_learning .
docker container run \
    --publish 8888:8888 \
    --detach \
    --name weakly_supervised_learning \
    -v .:/weakly_supervised_learning_code \
        weakly_supervised_learning

Now visit http://localhost:8888

If you run into problems, remove the --detach argument to run it in the foreground and file an issue on Github with the command you used and the complete error output.

Running via Docker Hub

You can also use Docker Hub to pull and run the image directly:

docker pull rjurney/weakly_supervised_learning
docker run weakly_supervised_learning # add a volume for .

Now visit http://localhost:8888

Bugs, Errors or other Problems

If you run into problems, make sure you have the latest code with git pull origin master and if it persist then search the Github issues for the error. If a fix isn’t in the issues, then create a ticket and include the command you ran and the complete output of that command. You can find the Book’s issues on Github here: https://github.com/rjurney/weakly_supervised_learning_code/issues.

Running the Code Locally

I’ve defined two Python environments for the book using Conda and a Virtual Environment. Once you have setup the requirements, you can easily reproduce the environment in which the book was written and tested.

Software Prerequisites

The following requirements are needed if you run the code locally:

Python 3.7+ - I recommend Anaconda Python, but any Python will do
conda or virtualenv to recreate the Python environment I wrote the examples in
Recommended: An NVIDIA graphics card - you can work the examples without one, but CPU training is painfully slow
Recommended: CUDA 10.0 - for GPU acceleration in CuPy and Tensorflow
Recommended: cuDNN - for GPU acceleration in Tensorflow

The file environment.yml lists them for the conda environment system used by Anaconda Python. The library dependencies for the book are also defined in requirements.in, which PyPi - the Python Package Index can use via the pip command to install them. I recommend PyPi users create a Virtual Environment to ensure you replicate the book’s environment accurately.

The examples in the book are run as Jupyter Notebooks. Jupyter is included in both conda and pip environments.

Anaconda Python 3

To create a conda environment for the book, run:

conda env create -f environment.yml
conda activate weak

To deactivate the environment, run:

conda deactivate

Virtual Environment

To create a Virtual Environment in which to install the PyPi dependencies, run:

pip install --upgrade virtualenv
virtualenv -p `which python3` weak
source weak/bin/activate
pip install -r requirements.in

To deactivate the Virtual Environment, run:

source deactivate

Running Jupyter

If you’re using Docker, the image will install and run Jupyter for you. If you’re using your own Python environment, you need to run Jupyter:

cd </path/to/weakly_supervised_learning_code>
jupyter notebook &

Then visit http://localhost:8888 and open Introduction.ipynb or select the chapter file you want to read and run.

Name		Name	Last commit message	Last commit date
Latest commit History 177 Commits
bert @ 88a817c		bert @ 88a817c
bin		bin
ch02		ch02
ch03		ch03
ch04		ch04
ch05		ch05
data		data
lib		lib
snorkel-tutorials @ 93fc777		snorkel-tutorials @ 93fc777
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
Introduction.ipynb		Introduction.ipynb
README.md		README.md
conda.env.yaml		conda.env.yaml
conda.pip.requirements.txt		conda.pip.requirements.txt
conda.requirements.txt		conda.requirements.txt
docker-compose.yml		docker-compose.yml
download.sh		download.sh
paths.json		paths.json
requirements.dev.in		requirements.dev.in
requirements.in		requirements.in
settings.json		settings.json

rjurney/weakly_supervised_learning_code

Folders and files

Latest commit

History

Repository files navigation

Weakly Supervised Learning

Software Prerequisites

Running Docker via docker-compose

Running Docker directly via the Dockerfile

Running via Docker Hub

Bugs, Errors or other Problems

Running the Code Locally

Software Prerequisites

Anaconda Python 3

Virtual Environment

Running Jupyter

About

Resources

Stars

Watchers

Forks

Languages

Running Docker via `docker-compose`

Running Docker directly via the `Dockerfile`