GitHub - MattNolanLab/analysis_pipelines: A description of the major analysis pipeline's used in the Nolan Lab. This repo will describe the general principles behind the pipelines and link to implementations and experimental details

Overview

This is a README describing the major analysis pipelines used in the Nolan Lab at the University of Edinburgh.

The main pipelines are designed to do "primary" data analysis, defined as transforming raw data into derived data used for further scientific data analysis. For example, these pipelines turn raw electrophysiological data into spike trains, and videos into behavioural time series. The derived data is then in a simple format which should make analysis by individual researchers easy.

Here is an example of a pipeline:

Raw data (red), is transformed in derived data (teal) by scripts (green). We try to stick to some simple rules to keep things easy: one script should produce one piece of output; each script is accessible by going to github.com/{script_name}; it should be possible to run each script locally on your computer. Each experimenter gets to decide how their input and output files are named and organised, but we try to follow the NeuroBluePrint format. The file naming and organisation must be documented in each experiment's documentation.

Experiments

The pipelines have been used on experimental data obtained in the Lab. Each experiment is bespoke, and has its own peculiarities. As such, we keep separate documentation for each experiment although there is much overlap between them. The experiments with documentation are (note: you might need to be a member of the NolanLab GitHub organisation to view these):

These pages should contain all the information needed to understand the data obtained by each experiment, and run the pipeline. The pipeline consists of several scripts. The scripts are stored in GitHub repositories (repos).

Repositories for pipeline steps

We keep a repo at https://github.com/MattNolanLab/ for each of the major steps in the pipeline. The most important ones are:

Spike sorting and ephys quality control: https://github.com/MattNolanLab/nolanlab-ephys
Using DeepLabCut: https://github.com/MattNolanLab/nolanlab-dlc

These are simple "template" repos. For complex experiments, you can copy ("make a branch") of the repo, and modify it to suit your needs. On the README of each repo linked above, there are links to the repos used by each individual experiment, so you can see how each experiment has modified the template code for their needs. This system is designed so that there is a good base to work from (e.g. the MattNolanLab/nolanlab-ephys has a functional spike sorting pipeline in it) but each researcher can easily customize the pipeline if needed.

If this is all a bit overwhelming: don't worry. There is documentation, advice and instructions on each repo. Go have a look!

Help!! Git, GitHub, Python, uv, EDDIE...

All analysis pipelines are version controlled using Git and hosted on GitHub. They are mostly written in Python and package management is organised to work well with uv (though you can use venv or conda if desired) and work on the Edinburgh EDDIE supercomputer. Here are some general resources to help with these tools:

What is Git?
Making a project on GitHub
Zen of Python
Intro to Python
uv on EDDIE

And we maintain some helper packages:

Helpers for the EDDIE supercomputer: eddie-helper Loading data easily: loadi

We use many open source packages, primarily:

Many thanks to those who maintain these packages!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
wolf_pipeline.png		wolf_pipeline.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Experiments

Repositories for pipeline steps

Help!! Git, GitHub, Python, uv, EDDIE...

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Overview

Experiments

Repositories for pipeline steps

Help!! Git, GitHub, Python, uv, EDDIE...

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages