Notebook testing #511

davidt0x · 2022-02-21T18:55:00Z

This PR implements CI testing of brainiak and the example notebooks on Princeton research computing cluster della. I am still trying to work out some issues with failing or hanging tests on della. For now, I will have these tests skipped.

Notebooks have been added under docs/examples. Examples that need to fetch data before running can include this in a download_data.sh script that is present in their working directory. When tests are run in CI environment on della, these scripts will be invoked and data will be fetched into a cache if it isn't already present. The data will then be copied to the working directory of the notebook. Code for handling this has been added to pr-check.sh script and only runs when on della host.

Since notebooks are running under pytest now, I will need to cache the results from these notebooks so that they can be rendered to HTML for docs.

- Turn off notebook tests by default, need to pass --enable_notebook_tests to pytest to enable. - Remove an extraneous print statement from test_gbrsa that was cluttering logs.

- Fixed and issue with the data caching that was causing examples\ copied repeatedly (yikes!) - Updated pytest syntax for turning off notebook tests by default.

- Needed to disable sm BTL for new version of OpenMPI. - Needed to put upperbound on tensorflow_probability. New version requires TF 2.8 which is causing issues on della. - Fixed a bug with the pytest argument enable_notebook_tests - Fixed random numpy warnings about using np.* types instead of python primitive types for numpy array dtype.

mihaic · 2022-02-25T00:04:22Z

@davidt0x, thank you for the hard work! Did the tests run on Jenkins?

see: numpy/numpy#21079 (comment)

ARMA has been removed from stats model. Need to convert to ARIMA API now. Tried to do it but was getting NaNs in the test so this needs to be looked into.

Got rid of the 0.12 pin and instead fixed the underlying issue. It was caused by ARMA being removed in favor of ARIMA. Had to make a couple of slight modifications to the code to extract parameters correctly and set the order of the model correctly to get ARMA out of ARIMA.

- Adding libgcc as host and run dependencies to meta.yml. Hopefully this fixes the conda build issue. It seems like system lib c++ is getting picked up instead of conda libs. The only other thing I can think to do is to set LD_LIBRARY_PATH after activating environment. - Notebook HTML generation was erroring out because one of the notebooks uses Markdown headers that skip levels. I turned off warnings as errors for now. Lets see how it looks.

It looks like MyST-NB can't handle embedded images (attachments). I have converted to HTML img tag with the image extracted to a file. Needed to override git ignore to include the png. Not sure if this is the best thing to do but it should work for now.

run workflow on push

It isn't needed apparently, already installed.

CameronTEllis · 2023-07-28T22:31:00Z

@mihaic @davidt0x Is this PR waiting for review or is there still more to do?

E231 missing whitespace after ',' and E721 do not compare types, for exact checks use `is` / `is not`, for instance checks use `isinstance()`

For some reason tmp is running out of space on della, lets use scratch for now.

The following warning generates and error in doc build. Warning, treated as error: /home/runner/work/brainiak/brainiak/brainiak/factoranalysis/htfa.py:docstring of brainiak.factoranalysis.htfa:1:undefined label: 'metadata_routing' Seems to be discussed here: scikit-learn/scikit-learn#26747

Bad reference warnings are being treated as errors. We have a ton (over 1200!) and building up the exclusion list is too much work for now.

mihaic · 2023-08-08T16:32:26Z

@CameronTEllis, the PR is ready for merging as soon as the tests pass. The only one missing now is the Conda build. Until we fix it, even if we extract the commit to fix fmrisim issue #520, we would likely still not be able to build the Conda packages.

CameronTEllis · 2023-08-08T16:53:05Z

@mihaic I will review it in a few days so that once the conda build is working the PR is ready to go.

CameronTEllis

I have completed a review of 50/75 files, with the remaining files being outside of my expertise (e.g., .conda/build.sh, .github/workflows/main.yml, setup.py). Moreover, I did not review the docs/examples/ notebooks because I assume they are copies from aperture. If this is not the case, let me know.

My main concern is that mixed in with a lot of needed and important changes (e.g., changing type calls, using get_fdata) is a lot additions that are specific to Princeton's set up on Della. This seems like it could confuse users from other institutions and will lead to clutter in the code. I am not sure about the appropriate solution, but I wonder if it would be better to have a branch on brainiak that has these Della specific scripts.

CameronTEllis · 2023-08-17T16:48:51Z

docs/examples/real-time/certs/cookie-secret

@@ -0,0 +1 @@
+,��r��A��vޟZ��k


Should this be shared?

CameronTEllis · 2023-08-17T16:50:34Z

docs/examples/real-time/README_INSTRUCTIONS.md

+## Things to do once
+Before you can run this notebook, you will have to take the following steps to set up our software framework:
+
+1. Clone the [brainiak aperture repo](https://github.com/brainiak/brainiak-aperture.git) and the [rtcloud framework repo](https://github.com/brainiak/rt-cloud.git). The location of the repositories do not matter but you should make a note of the paths.


It is probably obvious to others, but I didn't follow why aperture is needed, since it isn't mentioned again in the install or the RT cloud repo

CameronTEllis · 2023-08-17T17:06:34Z

pr-check.sh

+    if [[ "$is_della" == true ]]; then
+        conda deactivate
+    else
+        source deactivate


I don't think that this is della specific, it is generally encouraged to use conda activate and conda deactivate

CameronTEllis · 2023-08-17T17:09:03Z

pr-check.sh

@@ -18,6 +18,34 @@

 set -e

+# Check whether we are running on Princeton's della compute cluster.
+is_della=false


This script has a lot of carve outs for Princeton's set up, which isn't a versatile solution for other groups nor will it future proof Brainiak within Princeton (e.g., if Della changes). Is there a way instead to make a script that replaces this one with the della-specific commands that are needed?

CameronTEllis · 2023-08-17T17:10:23Z

pr-check.sh

+if [[ "$is_della" == true ]]; then
+    echo "Running on della head node, need to request time on a compute node"
+    export BRAINIAKDEV_MPI_COMMAND=srun
+    salloc -t 03:00:00 -N 1 -n 16 sh run-tests.sh $sdist_mode || \


Seems like this isn't della specific but just slurm specific

CameronTEllis · 2023-08-17T17:19:16Z

tests/test_notebooks.py

+mpi_notebooks = ["htfa", "FCMA", "SRM"]
+
+nb_tests = []
+for f in nb_files:


Outputs here are specific to della but without clear reason why

mihaic · 2023-08-17T18:04:54Z

Thank you for your review, @CameronTEllis.

You are right, Princeton-specific code is an issue. When we decided to incorporate the Aperture notebooks for testing, we observed long test times with GitHub infrastructure. We identified Princeton infrastructure as an alternative that will provide fast test times. To improve clarity, I propose we document the use of Della at the beginning of pr-check.sh. Going forward, Della use will be separated, as started in .github/workflows/della_notebooks.yml.

lcnature · 2023-09-18T05:01:25Z

Hi I was facing some issues of package incompatibility and started what I realized now as redoing some of the work in the PR. Just wondering if there is anything I can help to make this PR move forward. Thanks!

mihaic · 2023-09-18T18:33:01Z

Thanks for your offer to help, @lcnature. The PR is looking good with all tests passing. @davidt0x is working hard on the Conda builds and we will merge the PR as soon they are fixed. Currently, Conda is taking hours to resolve the dependencies on Linux, but Mamba seems to make faster progress. If you feel like, please try building the MacOS Conda package.

lcnature · 2023-09-19T08:32:39Z

Thanks for your offer to help, @lcnature. The PR is looking good with all tests passing. @davidt0x is working hard on the Conda builds and we will merge the PR as soon they are fixed. Currently, Conda is taking hours to resolve the dependencies on Linux, but Mamba seems to make faster progress. If you feel like, please try building the MacOS Conda package.

Thanks! Ah MacOS is the only thing that I cannot help much as I don't use Mac. Indeed conda is a pain when it comes to solving dependencies and mamba seems to be often the way to go.

- Trying to use mamba\boa to speed things up. - Specify llvm-openmp for mac.

- Numpy deprecate np.bool, this causes theano to error on first import. Then subsequent imports fail with different error. - Also, FastSRM fails with newer numpy as well for unrelated reason. np.array doesn't support automatically handle ragged arrays (dtype=object) in new numpy. These need to be made explicitly now. I tried adding making them dtype object explicitly but this was causing failures later during SVD. Need to look into this more.

mihaic · 2023-10-03T16:25:00Z

@vineetbansal, thank you very much for the fixes.

Although Codecov did not report the coverage results, they look fine on their website:
https://app.codecov.io/gh/brainiak/brainiak/pull/511

@davidt0x, thank you for seeing this huge PR through. It is finally time to merge it!

davidt0x added 10 commits September 2, 2021 23:19

Added version of pr-check for running on della

105f9b6

Disable notebook building from docs for now.

0e0b5cd

Since notebooks are running under pytest now, I will need to cache the results from these notebooks so that they can be rendered to HTML for docs.

Fix some issues with notebook tests

2f70417

- Turn off notebook tests by default, need to pass --enable_notebook_tests to pytest to enable. - Remove an extraneous print statement from test_gbrsa that was cluttering logs.

increase job time on della tests

c63caab

Add log message for running notebook tests

015ce02

more fixes

5b92e42

- Fixed and issue with the data caching that was causing examples\ copied repeatedly (yikes!) - Updated pytest syntax for turning off notebook tests by default.

fix some style errors

b283f0a

Added type: ignore for pkg_resources

e5d68ea

davidt0x and others added 19 commits March 7, 2022 12:06

Get rid of install of types-pkg-resources

fa864b2

Remove numpy pin

bc78098

replace some references to np.bool with bool

e179634

fix type error in test_image.py:129

014e0b4

Fix syntax error in pr-check.sh

c4a5494

workaround for newer numpy and theano

0cb0adc

see: numpy/numpy#21079 (comment)

pin statsmodel<=0.12

d30b4e5

ARMA has been removed from stats model. Need to convert to ARIMA API now. Tried to do it but was getting NaNs in the test so this needs to be looked into.

Convert comparison to np.allclose

9e06236

fixed deprectaion warning for ndimage

fadc914

swap np.matlib.repmat for np.tile becuase of deprecation

12921d6

fix ref to moved example iem_example_synthetic_RF_data

fb5647b

add python 3.9 and 3.10 testing, add testbook req

0930055

fix python version specifiers to string in ci

83a170a

More warnings fixed on notebooks.

2eb5ff8

small change to test CI on master.

98a5464

Update main.yml

52719e7

run workflow on push

Remove install of omp on MacOS.

ca755ed

It isn't needed apparently, already installed.

mihaic mentioned this pull request Jul 24, 2023

Update setup.py #529

Closed

mihaic linked an issue Jul 24, 2023 that may be closed by this pull request

Statsmodel reference to import AMRA module is outdated in fmrisim #520

Closed

davidt0x added 7 commits August 7, 2023 12:44

Added a some documentation

0ec46dd

Add new errors to flake8 ignore

2f4d080

E231 missing whitespace after ',' and E721 do not compare types, for exact checks use `is` / `is not`, for instance checks use `isinstance()`

Add flake8 ignores for tests as well.

fdd7c17

Pin tensorflow to fix problems with tf probability

af97896

Move TMPDIR to scratch on della.

b05a7d7

For some reason tmp is running out of space on della, lets use scratch for now.

Disable -W on Sphinx

1667879

Bad reference warnings are being treated as errors. We have a ton (over 1200!) and building up the exclusion list is too much work for now.

CameronTEllis reviewed Aug 17, 2023

View reviewed changes

mihaic mentioned this pull request Aug 21, 2023

Some functions have deprecated due to packages update. #530

Closed

mihaic linked an issue Aug 21, 2023 that may be closed by this pull request

Some functions have deprecated due to packages update. #530

Closed

Disable conda build on Mac for now, testing.

fb5744e

vineetbansal added 3 commits October 2, 2023 15:39

Fixes for building conda packages.

576050f

- Trying to use mamba\boa to speed things up. - Specify llvm-openmp for mac.

Fix the numpy pin

e51f057

mihaic enabled auto-merge October 3, 2023 16:25

mihaic approved these changes Oct 3, 2023

View reviewed changes

mihaic added this pull request to the merge queue Oct 3, 2023

Merged via the queue into brainiak:master with commit 92793e0 Oct 3, 2023
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Notebook testing #511

Notebook testing #511

davidt0x commented Feb 21, 2022

mihaic commented Feb 25, 2022

CameronTEllis commented Jul 28, 2023

mihaic commented Aug 8, 2023

CameronTEllis commented Aug 8, 2023

CameronTEllis left a comment

CameronTEllis Aug 17, 2023

CameronTEllis Aug 17, 2023

CameronTEllis Aug 17, 2023

CameronTEllis Aug 17, 2023

CameronTEllis Aug 17, 2023

CameronTEllis Aug 17, 2023

mihaic commented Aug 17, 2023

lcnature commented Sep 18, 2023

mihaic commented Sep 18, 2023

lcnature commented Sep 19, 2023

mihaic commented Oct 3, 2023

		@@ -0,0 +1 @@
		,��r��A��vޟZ��k

Notebook testing #511

Notebook testing #511

Conversation

davidt0x commented Feb 21, 2022

mihaic commented Feb 25, 2022

CameronTEllis commented Jul 28, 2023

mihaic commented Aug 8, 2023

CameronTEllis commented Aug 8, 2023

CameronTEllis left a comment

Choose a reason for hiding this comment

CameronTEllis Aug 17, 2023

Choose a reason for hiding this comment

CameronTEllis Aug 17, 2023

Choose a reason for hiding this comment

CameronTEllis Aug 17, 2023

Choose a reason for hiding this comment

CameronTEllis Aug 17, 2023

Choose a reason for hiding this comment

CameronTEllis Aug 17, 2023

Choose a reason for hiding this comment

CameronTEllis Aug 17, 2023

Choose a reason for hiding this comment

mihaic commented Aug 17, 2023

lcnature commented Sep 18, 2023

mihaic commented Sep 18, 2023

lcnature commented Sep 19, 2023

mihaic commented Oct 3, 2023