All Projects â†’ CONP-PCNO â†’ conp-dataset

CONP-PCNO / conp-dataset

Licence: MIT license
📂 A DataLad dataset for CONP

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to conp-dataset

rafy-rs
Rust library to download YouTube content and retrieve metadata
Stars: ✭ 46 (+170.59%)
Mutual labels:  metadata
MetadataRemover
Android App to remove images' metadata
Stars: ✭ 42 (+147.06%)
Mutual labels:  metadata
meta-extractor
Super simple and fast html page meta data extractor with low memory footprint
Stars: ✭ 38 (+123.53%)
Mutual labels:  metadata
hbase-meta-repair
Repair hbase metadata table from hdfs.
Stars: ✭ 36 (+111.76%)
Mutual labels:  metadata
hasura-sdk
Hasura Schema & Metadata Typescript SDK
Stars: ✭ 21 (+23.53%)
Mutual labels:  metadata
exempi
Exempi: XMP SDK (freedesktop mirror)
Stars: ✭ 19 (+11.76%)
Mutual labels:  metadata
bcdhub
Better Call Dev backend
Stars: ✭ 30 (+76.47%)
Mutual labels:  metadata
metad
Metad is a metadata server, support self semantic.
Stars: ✭ 77 (+352.94%)
Mutual labels:  metadata
envoke
A demo client-side application for persisting music metadata and rights to BigchainDB/IPDB.
Stars: ✭ 17 (+0%)
Mutual labels:  metadata
geoflow
R engine to orchestrate and run (meta)data workflows
Stars: ✭ 28 (+64.71%)
Mutual labels:  metadata
islandora vagrant
Islandora testing and development environment
Stars: ✭ 36 (+111.76%)
Mutual labels:  metadata
YouTube-MA
💾 YouTube video metadata archiver written in Golang
Stars: ✭ 17 (+0%)
Mutual labels:  metadata
unfurl
Extract rich metadata from URLs
Stars: ✭ 41 (+141.18%)
Mutual labels:  metadata
bioportal web ui
A Rails application for biological ontologies
Stars: ✭ 20 (+17.65%)
Mutual labels:  metadata
rollup-plugin-sizes
Rollup plugin to display bundle contents & size information
Stars: ✭ 77 (+352.94%)
Mutual labels:  metadata
exiftool-json-db
Maintain a JSON database of photos and videos with their metadata
Stars: ✭ 18 (+5.88%)
Mutual labels:  metadata
goexif2
MAINTAINER WANTED -- Decode embedded EXIF meta data from image files written in Pure Golang
Stars: ✭ 35 (+105.88%)
Mutual labels:  metadata
IFIscripts
Detailed documentation is available here: http://ifiscripts.readthedocs.io/en/latest/index.html
Stars: ✭ 46 (+170.59%)
Mutual labels:  metadata
metadata-one-liners
retrive metadata endpoint data with these one liners.
Stars: ✭ 38 (+123.53%)
Mutual labels:  metadata
pipm
Python dependency management workflow using setup.cfg and requirements files without reinventing the wheels
Stars: ✭ 30 (+76.47%)
Mutual labels:  metadata

CONP dataset

CircleCI

CONP dataset is a repository containing the datasets available in the Canadian Open Neuroscience Platform. It leverages DataLad to store metadata and references to data files distributed in various storage spaces and accessible depending on each data owner's policy.

The instructions below explain how to find and get data from the dataset. You can also add data by following the instructions in our contribution guidelines. We welcome your feedback! 😃

Dataset structure

projects contains sub-datasets for projects.

Projects are responsible for the management and curation of their own sub-datasets.

Installing required software

git

sudo apt-get install git

It is useful to configure your git credentials to avoid having to enter them repeatedly:

git config --global user.name "yourusername" git config --global user.email "[email protected]"

git-annex

First install the neurodebian package repository:

sudo apt-get install neurodebian

Then install the version of git-annex included in this repository:

sudo apt-get install git-annex-standalone

The version of git-annex installed can be verified with:

git annex version

As of May 12 2020, this installs git annex v 8.20200330, which works with CONP datasets. Earlier versions of git-annex may not.

DataLad:

sudo apt-get install datalad

Getting the data

Install the main CONP dataset on your computer:

datalad install -r http://github.com/CONP-PCNO/conp-dataset

Get the files you are interested in:

datalad get <file_name>

This may require authentication depending on the data owner's configuration.

You can also search for relevant files and sub-datasets:

datalad search T1

Tests

  1. Execute python tests/create_tests.py from the root of conp-dataset repository
  2. Run pytest tests/ to execute tests for all datasets in projects and investigators
  3. To run specific test on specific datasets, run pytest tests/test_<name of dataset> like pytest tests/test_projects_SIMON-dataset

For detailed explanations of the tests, please consult the test suite documentation.

Coding standards

To keep the Python code maintainable and readable a suite of QA pipelines is testing the code assuring code standards. Pull requests will trigger a GitHub workflow executing pre-commit.

To execute pre-commit locally, you will need to install pre-commit using your favorite method. Then, run:

pre-commit install

pre-commit run --all-files

Pre-commit won't let you commit until reported issue are fixed. If problematic, you can optionally skip the pre-commit for a local commit using the --no-verify flag when commiting, however this will still perform QA test on your PR.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].