All Projects → BioMeCIS-Lab → OpenOmics

BioMeCIS-Lab / OpenOmics

Licence: MIT license
A bioinformatics API and web-app to integrate multi-omics datasets & interface with public databases.

Programming Languages

python
139335 projects - #7 most used programming language
CSS
56736 projects
TeX
3793 projects

Projects that are alternatives of or similar to OpenOmics

bacnet
BACNET is a Java based platform to develop website for multi-omics analysis
Stars: ✭ 12 (-45.45%)
Mutual labels:  genomics, multi-omics
vrs-python
GA4GH Variation Representation Python Implementation
Stars: ✭ 35 (+59.09%)
Mutual labels:  genomics
cosmosR
COSMOS (Causal Oriented Search of Multi-Omic Space) is a method that integrates phosphoproteomics, transcriptomics, and metabolomics data sets.
Stars: ✭ 30 (+36.36%)
Mutual labels:  data-integration
genepattern-server
The GenePattern Server web application
Stars: ✭ 26 (+18.18%)
Mutual labels:  genomics
LRSDAY
LRSDAY: Long-read Sequencing Data Analysis for Yeasts
Stars: ✭ 26 (+18.18%)
Mutual labels:  genomics
genoiser
use the noise
Stars: ✭ 15 (-31.82%)
Mutual labels:  genomics
snp-sites
Finds SNP sites from a multi-FASTA alignment file
Stars: ✭ 182 (+727.27%)
Mutual labels:  genomics
human genomics pipeline
A Snakemake workflow to process single samples or cohorts of paired-end sequencing data (WGS or WES) using trim galore/bwa/GATK4/parabricks.
Stars: ✭ 19 (-13.64%)
Mutual labels:  genomics
cryfa
A secure encryption tool for genomic data
Stars: ✭ 53 (+140.91%)
Mutual labels:  genomics
DriverPower
DriverPower
Stars: ✭ 22 (+0%)
Mutual labels:  genomics
gubbins
Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
Stars: ✭ 103 (+368.18%)
Mutual labels:  genomics
DRAM
Distilled and Refined Annotation of Metabolism: A tool for the annotation and curation of function for microbial and viral genomes
Stars: ✭ 159 (+622.73%)
Mutual labels:  genomics
instaGRAAL
Large genome reassembly based on Hi-C data, continuation of GRAAL
Stars: ✭ 32 (+45.45%)
Mutual labels:  genomics
mity
mity: A highly sensitive mitochondrial variant analysis pipeline for whole genome sequencing data
Stars: ✭ 27 (+22.73%)
Mutual labels:  genomics
data-product-streaming
Template to deploy a Data Product for data stream processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
Stars: ✭ 32 (+45.45%)
Mutual labels:  data-integration
bigly
a pileup library that embraces the huge
Stars: ✭ 38 (+72.73%)
Mutual labels:  genomics
psmc
Implementation of the Pairwise Sequentially Markovian Coalescent (PSMC) model
Stars: ✭ 121 (+450%)
Mutual labels:  genomics
rl trading
No description or website provided.
Stars: ✭ 14 (-36.36%)
Mutual labels:  data-manipulation
dqlab-career-track
A collection of scripts written to complete DQLab Data Analyst Career Track 📊
Stars: ✭ 53 (+140.91%)
Mutual labels:  data-manipulation
CuteVCF
simple viewer for variant call format using htslib
Stars: ✭ 30 (+36.36%)
Mutual labels:  genomics

PyPI version Documentation Status pyOpenSci status DOI OpenOmics codecov

OpenOmics is currently under active development and we may break API compatibility in the future.

This Python package provide a series of tools to integrate and explore the genomics, transcriptomics, proteomics, and clinical data (aka multi-omics data). With interfaces to popular annotation databases and scalable data-frame manipulation tools, OpenOmics facilitates the common data wrangling tasks when preparing data for RNA-seq bioinformatics analysis.

Documentation (Latest | Stable) | OpenOmics at a glance

Features

OpenOmics assist in integration of heterogeneous multi-omics bioinformatics data. The library provides a Python API as well as an interactive Dash web interface. It features support for:

  • Genomics, Transcriptomics, Proteomics, and Clinical data.
  • Harmonization with 20+ popular annotation, interaction, disease-association databases.

OpenOmics also has an efficient data pipeline that bridges the popular data manipulation Pandas library and Dask distributed processing to address the following use cases:

  • Providing a standard pipeline for dataset indexing, table joining and querying, which are transparent and customizable for end-users.
  • Providing Efficient disk storage for large multi-omics dataset with Parquet data structures.
  • Integrating various data types including interactions and sequence data, then exporting to NetworkX graphs or data generators for down-stream machine learning.
  • Accessible by both developers and scientists with a Python API that works seamlessly with an external Galaxy tool interface or the built-in Dash web interface (WIP).

Installation via pip:

$ pip install openomics

Citations

The journal paper for this scientific package was reviewed by JOSS at https://joss.theoj.org/papers/10.21105/joss.03249#, and can be cited with:

# BibTeX
@article{Tran2021,
  doi = {10.21105/joss.03249},
  url = {https://doi.org/10.21105/joss.03249},
  year = {2021},
  publisher = {The Open Journal},
  volume = {6},
  number = {61},
  pages = {3249},
  author = {Nhat C. Tran and Jean X. Gao},
  title = {OpenOmics: A bioinformatics API to integrate multi-omics datasets and interface with public databases.},
  journal = {Journal of Open Source Software}
}

Credits

Thank you for extremely helpful feedback and guidance from the pyOpenSci reviewers. This package was created with the pyOpenSci/cookiecutter-pyopensci project template, based off audreyr/cookiecutter-pypackage.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].