/nwb_datajoint

Neuroscience data pipeline for reproducible research used by Loren Frank Lab, UCSF

Primary LanguageJupyter NotebookMIT LicenseMIT

nwb_datajoint

The Frank lab Datajoint pipeline facilitates the storage, analysis, and sharing of neuroscience data to support reproducible research. It integrates existing open-source projects into a coherent framework so that they can be easily used.

Setup

Installing packages

  1. Clone this repository:

    git clone https://github.com/LorenFrankLab/nwb_datajoint.git
  2. Set up and activate a conda environment from environment.yml:

    cd nwb_datajoint
    conda env create -f environment.yml
    conda activate nwb_datajoint
  3. Install this repository:

    pip install -e .

Setting up database access

  1. Ask Loren or Eric to set up an account for you on the Frank lab database. Note that you have to be connected to UCSF LAN to access this server.

    If you're not affiliated with UCSF or if you are just looking to try out nwb_datajoint, then you will need to set up a different MySQL server. For example, you can set up your own local server with a Docker image of a MySQL server configured for Datajoint (see instructions)

  2. Add the following environment variables (e.g. in ~/.bashrc). This example assumes that you are interacting with the database on a computer that has mounted stelmo at /stelmo (if the mount location is different, change accordingly). For this to take effect, log out and log back in, or run source ~/.bashrc in the terminal.

    export NWB_DATAJOINT_BASE_DIR="/stelmo/nwb/"
    export SPIKE_SORTING_STORAGE_DIR="/stelmo/nwb/spikesorting"
    export ANALYSIS_LOCK_FILE="/home/lorenlab/nwb_datajoint_lock/.lockfile"
    export DJ_SUPPORT_FILEPATH_MANAGEMENT="TRUE"
    export KACHERY_DAEMON_HOST="typhoon"
    export KACHERY_DAEMON_PORT="14747"
    export KACHERY_TEMP_DIR="/stelmo/nwb/tmp"

    If you're not connected to UCSF network, then you will have to host your own kachery node for curating spike sorting. Go to kacheryhub, log in with your Google credentials, and then click on 'How to host a kachery node` for more information.

  3. Configure DataJoint. To connect to the Datajoint database, we have to specify information about it such as the hostname and the port. You should also change your password from the temporary one you were given. Go to the config directory, and run dj_config.py in the terminal with your username:

    cd config # change to the config directory
    python dj_config.py <username> # run the configuration script

    Again, if you're using your own MySQL server, then you may need to change the other settings as well.

Finally, open up a python console (e.g. run ipython from terminal) and import nwb_datajoint to check that the installation has worked.

Tutorials

The tutorials for nwb_datajoint is currently in the form of Jupyter Notebooks and can be found in the notebooks directory. We strongly recommend opening them in the context of jupyterlab.

  • 0_intro.ipynb: general introduction to the database
  • 1_spikesorting.ipynb: how to run spike sorting
  • 2_curation.ipynb: how to curate spike sorted data