All Projects → nanxstats → Rcpi

nanxstats / Rcpi

Licence: Artistic-2.0 License
Molecular informatics toolkit with a comprehensive integration of bioinformatics and cheminformatics tools for drug discovery.

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Rcpi

awesome-small-molecule-ml
A curated list of resources for machine learning for small-molecule drug discovery
Stars: ✭ 54 (+145.45%)
Mutual labels:  bioinformatics, cheminformatics, drug-discovery
protwis
Protwis is the backbone of the GPCRdb. The GPCRdb contains reference data, interactive visualisation and experiment design tools for G protein-coupled receptors (GPCRs).
Stars: ✭ 20 (-9.09%)
Mutual labels:  bioinformatics, cheminformatics, drug-discovery
py4chemoinformatics
Python for chemoinformatics
Stars: ✭ 78 (+254.55%)
Mutual labels:  cheminformatics, drug-discovery
e3fp
3D molecular fingerprints
Stars: ✭ 93 (+322.73%)
Mutual labels:  cheminformatics, fingerprint
CANDO
Computational Analysis of Novel Drug Opportunities
Stars: ✭ 27 (+22.73%)
Mutual labels:  bioinformatics, drug-discovery
Awesome Cheminformatics
A curated list of Cheminformatics libraries and software.
Stars: ✭ 244 (+1009.09%)
Mutual labels:  bioinformatics, cheminformatics
Jupyter Dock
Jupyter Dock is a set of Jupyter Notebooks for performing molecular docking protocols interactively, as well as visualizing, converting file formats and analyzing the results.
Stars: ✭ 179 (+713.64%)
Mutual labels:  cheminformatics, drug-discovery
Scopy
An integrated negative design python library for desirable HTS/VS database design
Stars: ✭ 28 (+27.27%)
Mutual labels:  cheminformatics, drug-discovery
Tdc
Therapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Stars: ✭ 291 (+1222.73%)
Mutual labels:  bioinformatics, cheminformatics
sirius
SIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)
Stars: ✭ 32 (+45.45%)
Mutual labels:  bioinformatics, cheminformatics
wdlRunR
Elastic, reproducible, and reusable genomic data science tools from R backed by cloud resources
Stars: ✭ 34 (+54.55%)
Mutual labels:  bioinformatics, bioconductor
Dgl Lifesci
Python package for graph neural networks in chemistry and biology
Stars: ✭ 194 (+781.82%)
Mutual labels:  bioinformatics, cheminformatics
Protr
Comprehensive toolkit for generating various numerical features of protein sequences
Stars: ✭ 30 (+36.36%)
Mutual labels:  bioinformatics, feature-extraction
bidd-molmap
MolMap: An Efficient Convolutional Neural Network Based Molecular Deep Learning Tool
Stars: ✭ 102 (+363.64%)
Mutual labels:  fingerprint, drug-discovery
Scipipe
Robust, flexible and resource-efficient pipelines using Go and the commandline
Stars: ✭ 826 (+3654.55%)
Mutual labels:  bioinformatics, cheminformatics
GLaDOS
Web Interface for ChEMBL @ EMBL-EBI
Stars: ✭ 28 (+27.27%)
Mutual labels:  cheminformatics, drug-discovery
Sourceafis Java
Fingerprint recognition engine for Java that takes a pair of human fingerprint images and returns their similarity score. Supports efficient 1:N search.
Stars: ✭ 136 (+518.18%)
Mutual labels:  fingerprint, feature-extraction
Cdk
The Chemistry Development Kit
Stars: ✭ 283 (+1186.36%)
Mutual labels:  bioinformatics, cheminformatics
GenomicDataCommons
Provide R access to the NCI Genomic Data Commons portal.
Stars: ✭ 64 (+190.91%)
Mutual labels:  bioinformatics, bioconductor
sourceafis-net
Fingerprint recognition engine for .NET that takes a pair of human fingerprint images and returns their similarity score. Supports efficient 1:N search.
Stars: ✭ 43 (+95.45%)
Mutual labels:  fingerprint, feature-extraction

Rcpi

bioc downloads

Overview

Rcpi offers a molecular informatics toolkit with a comprehensive integration of bioinformatics and cheminformatics tools for drug discovery. For more information, please see our paper <DOI:10.1093/bioinformatics/btu624> (PDF).

Paper Citation

Formatted citation:

Dong-Sheng Cao, Nan Xiao, Qing-Song Xu, and Alex F. Chen. (2015). Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions. Bioinformatics 31 (2), 279-281.

BibTeX entry:

@article{Rcpi2015,
  author  = {Cao, Dong-Sheng and Xiao, Nan and Xu, Qing-Song and Chen, Alex F.},
  title   = {{Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions}},
  journal = {Bioinformatics},
  year    = {2015},
  volume  = {31},
  number  = {2},
  pages   = {279--281},
  doi     = {10.1093/bioinformatics/btu624}
}

Installation

To install the Rcpi package:

install.packages("BiocManager")
BiocManager::install("Rcpi")

To make the package fully functional (especially the Open Babel related functions), we recommend installing the Enhances packages by:

BiocManager::install("Rcpi", dependencies = c("Imports", "Enhances"))

Several dependencies of the Rcpi package may require some system-level libraries, check the corresponding manuals of these packages for detailed installation guides.

Browse the package vignettes: [1], [2] for a quick-start.

Features

Rcpi implemented and integrated the state-of-the-art protein sequence descriptors and molecular descriptors/fingerprints with R. For protein sequences, the Rcpi package could

  • Calculate six protein descriptor groups composed of fourteen types of commonly used structural and physicochemical descriptors that include 9920 descriptors.

  • Calculate six types of generalized scales-based descriptors derived by various dimensionality reduction methods for proteochemometric (PCM) modeling.

  • Parallellized pairwise similarity computation derived by protein sequence alignment and Gene Ontology (GO) semantic similarity measures within a list of proteins.

For small molecules, the Rcpi package could

  • Calculate 307 molecular descriptors (2D/3D), including constitutional, topological, geometrical, and electronic descriptors, etc.

  • Calculate more than ten types of molecular fingerprints, including FP4 keys, E-state fingerprints, MACCS keys, etc., and parallelized chemical similarity search.

  • Parallelized pairwise similarity computation derived by fingerprints and maximum common substructure search within a list of small molecules.

By combining various types of descriptors for drugs and proteins in different methods, interaction descriptors representing protein-protein or compound-protein interactions could be conveniently generated with Rcpi, including:

  • Two types of compound-protein interaction (CPI) descriptors

  • Three types of protein-protein interaction (PPI) descriptors

Several useful auxiliary utilities are also shipped with Rcpi:

  • Parallelized molecule and protein sequence retrieval from several online databases, like PubChem, ChEMBL, KEGG, DrugBank, UniProt, RCSB PDB, etc.

  • Loading molecules stored in SMILES/SDF files and loading protein sequences from FASTA/PDB files

  • Molecular file format conversion

The computed protein sequence descriptors, molecular descriptors/fingerprints, interaction descriptors and pairwise similarities are widely used in various research fields relevant to drug disvery, primarily bioinformatics, cheminformatics, proteochemometrics, and chemogenomics.

Contribute

To contribute to this project, please take a look at the Contributing Guidelines first. Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].