Tests #43

davidt0x · 2021-09-29T21:13:05Z

Setup pytests using testbook for all notebooks.

Any notebooks that require datasets to run should place code for downloading\extracting that data into download_data.sh. This allows systematic downloading and caching of datasets for testing.

The path construction for the dataset was prepending the users $HOME directory. I removed this to make it more transportable and similar to the other notebooks.

This makes it easier to track if the dataset has been downloaded already or not so we can cache things on della.

htfa notebook expects the dataset to be in /data. This is where it resides in the Docker image. I extracted the download commands for the data files and put them into download_data.sh. I modified the commands to extract the data into a local data/ folder. I then modified the notebook to look for the data in both of these locations and throw an exception otherwise.

…into tests

The rt-cloud repo is a dependency of the the notebook. It is not pip installable so I added it as a git sub-module. I have also extracted a dependency list from rt-cloud/environment.yml to include with the other notebook dependencies.

…into tests Conflicts: notebooks/real-time/rtcloud_notebook.ipynb

One final issue to fix is that the last cell in the notebook executes the a python script on the command line. Even when this script fails this does not cause the cell in the notebook to fail.

Importing the main function and running directly ensures that tests fail if there are errors in sample.py. This is because if a cell has a command line execution that fails this is not considered a cell failure for testbook (is this a bug?)

review-notebook-app · 2021-09-29T21:13:09Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

mihaic

Great work, @davidt0x! I have a couple of suggestions.

mihaic · 2021-09-29T21:59:25Z

notebooks/isc/ISC.ipynb

@@ -92,8 +92,7 @@
   "source": [
    "# Download and extract example data from Zenodo\n",
    "!wget https://zenodo.org/record/4300904/files/brainiak-aperture-isc-data.tgz\n",
-    "!tar -xzf brainiak-aperture-isc-data.tgz\n",
-    "!rm brainiak-aperture-isc-data.tgz"
+    "!tar -xzf brainiak-aperture-isc-data.tgz\n"


How about adding --skip-old-files to minimize I/O? Same goes for any other extraction code in notebooks.

mihaic · 2021-09-29T22:00:12Z

notebooks/isc/ISC.ipynb

@@ -92,8 +92,7 @@
   "source": [
    "# Download and extract example data from Zenodo\n",
    "!wget https://zenodo.org/record/4300904/files/brainiak-aperture-isc-data.tgz\n",


Should we have --no-clobber so the file is not downloaded again? Same goes for any other download code in notebooks.

mihaic · 2021-09-29T22:02:38Z

requirements.txt

@@ -0,0 +1,32 @@
+testbook


I think it is easier for notebook authors in the future if each notebook has its own requirements file.

manojneuro · 2021-09-30T12:38:36Z

notebooks/fmrisim/fmrisim_multivariate_example.ipynb

@@ -111,7 +111,7 @@
   "source": [
    "*1.2 Load participant data*<a id=\"load_ppt\"></a>\n",
    "\n",
-    "Any 4 dimensional fMRI data that is readible by nibabel can be used as input to this pipeline. For this example, data is taken from the open access repository DataSpace: http://arks.princeton.edu/ark:/88435/dsp01dn39x4181. This file is unzipped and placed in the home directory with the name Corr_MVPA "
+    "Any 4 dimensional fMRI data that is readible by nibabel can be used as input to this pipeline. For this example, data is taken from the open access repository DataSpace: http://arks.princeton.edu/ark:/88435/dsp01dn39x4181. This file is unzipped and placed same directory as this notebook with the name Corr_MVPA "


@CameronTEllis FYI: please note a minor update to the data directory to make it compatible for automated testing.

…nd extract data

davidt0x · 2021-09-30T16:25:13Z

Ok, I think I have addressed your comments @mihaic. Can you take a look?

gdoubleyew · 2021-09-30T17:38:42Z

notebooks/real-time/rtcloud_notebook.ipynb

+      "Clear any pre-existing plot for this run using 'clearRunPlot(runNum)'\n",
+      "###################################################################################\n",
+      "/tmp/notebook-simdata/labels.npy\n",
+      "Collected training data for TR 0\n",


Hi David, Can you clear the "outputs" created from running the notebook before checking in? I think it's in the Jupiter menu Cell->clear->all_outputs.

davidt0x added 14 commits September 8, 2021 17:38

Added download_data.sh scripts to fetch datasets

dfc75ef

Any notebooks that require datasets to run should place code for downloading\extracting that data into download_data.sh. This allows systematic downloading and caching of datasets for testing.

Add iem/download_data.sh

afecb09

Add pytest for notebooks using testbook

bacce6e

fix: path for data was prepending $HOME

06de71c

The path construction for the dataset was prepending the users $HOME directory. I removed this to make it more transportable and similar to the other notebooks.

fix: add xfail for rtcloud and htfa notebooks for now

ccfbe94

fix: download_data script for isc no clobber

6822ef5

fix: remove deletion of data archive from script.

db17cf6

This makes it easier to track if the dataset has been downloaded already or not so we can cache things on della.

Added requirements.txt for all notebooks

e6def17

Merge branch 'master' of https://github.com/brainiak/brainiak-aperture …

9a0bb87

…into tests

Added rt-cloud repo as a submodule

7f18833

The rt-cloud repo is a dependency of the the notebook. It is not pip installable so I added it as a git sub-module. I have also extracted a dependency list from rt-cloud/environment.yml to include with the other notebook dependencies.

Merge branch 'master' of https://github.com/brainiak/brainiak-aperture …

dadad15

…into tests Conflicts: notebooks/real-time/rtcloud_notebook.ipynb

fix: rtcloud notebook is passing

607ddfc

One final issue to fix is that the last cell in the notebook executes the a python script on the command line. Even when this script fails this does not cause the cell in the notebook to fail.

fix: invoke sample.py by import of main

9f85615

Importing the main function and running directly ensures that tests fail if there are errors in sample.py. This is because if a cell has a command line execution that fails this is not considered a cell failure for testbook (is this a bug?)

mihaic requested changes Sep 29, 2021

View reviewed changes

manojneuro reviewed Sep 30, 2021

View reviewed changes

davidt0x added 2 commits September 30, 2021 10:07

fix: add no clobber and --skip-old-files to notebooks that download a…

b6c2bd5

…nd extract data

fix: individual requirements.txt for each notebook

e70b671

davidt0x requested a review from mihaic September 30, 2021 16:25

fix: forgot to add new requirements.txt files

050f89d

mihaic approved these changes Sep 30, 2021

View reviewed changes

gdoubleyew reviewed Sep 30, 2021

View reviewed changes

fix: cleared outputs from rtcloud notebook

4741f8e

gdoubleyew approved these changes Sep 30, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests #43

Tests #43

davidt0x commented Sep 29, 2021

review-notebook-app bot commented Sep 29, 2021

mihaic left a comment

mihaic Sep 29, 2021

mihaic Sep 29, 2021

mihaic Sep 29, 2021

manojneuro Sep 30, 2021 •

edited

Loading

davidt0x commented Sep 30, 2021

gdoubleyew Sep 30, 2021

davidt0x Sep 30, 2021

Tests #43

Are you sure you want to change the base?

Tests #43

Conversation

davidt0x commented Sep 29, 2021

review-notebook-app bot commented Sep 29, 2021

mihaic left a comment

Choose a reason for hiding this comment

mihaic Sep 29, 2021

Choose a reason for hiding this comment

mihaic Sep 29, 2021

Choose a reason for hiding this comment

mihaic Sep 29, 2021

Choose a reason for hiding this comment

manojneuro Sep 30, 2021 • edited Loading

Choose a reason for hiding this comment

davidt0x commented Sep 30, 2021

gdoubleyew Sep 30, 2021

Choose a reason for hiding this comment

davidt0x Sep 30, 2021

Choose a reason for hiding this comment

manojneuro Sep 30, 2021 •

edited

Loading