All Projects → zhengxwen → HIBAG

zhengxwen / HIBAG

Licence: other
R package – HLA Genotype Imputation with Attribute Bagging (development version only)

Programming Languages

C++
36643 projects - #6 most used programming language
r
7636 projects

Projects that are alternatives of or similar to HIBAG

sciblox
sciblox - Easier Data Science and Machine Learning
Stars: ✭ 48 (+108.7%)
Mutual labels:  imputation
SSIM Seq2Seq
SSIM - A Deep Learning Approach for Recovering Missing Time Series Sensor Data
Stars: ✭ 32 (+39.13%)
Mutual labels:  imputation
kourami
Kourami: Graph-guided assembly for HLA alleles
Stars: ✭ 31 (+34.78%)
Mutual labels:  hla
ALRA
Imputation method for scRNA-seq based on low-rank approximation
Stars: ✭ 48 (+108.7%)
Mutual labels:  imputation
OpenRTI
Mirror of OpenRTI on SourceForge. Do not create PR's. Instead send patches or git pull commands to the mailing list on SF.
Stars: ✭ 48 (+108.7%)
Mutual labels:  hla
impute-me
This is the code behind the www.impute.me site. It contains algorithms for personal genome analysis, including imputation and polygenic risk score calculation
Stars: ✭ 96 (+317.39%)
Mutual labels:  imputation
imputationserver
Michigan Imputation Server: A new web-based service for imputation that facilitates access to new reference panels and greatly improves user experience and productivity
Stars: ✭ 41 (+78.26%)
Mutual labels:  imputation
simputation
Making imputation easy
Stars: ✭ 71 (+208.7%)
Mutual labels:  imputation
missRanger
R package "missRanger" for fast imputation of missing values by random forests.
Stars: ✭ 42 (+82.61%)
Mutual labels:  imputation
TotalLeastSquares.jl
Solve many kinds of least-squares and matrix-recovery problems
Stars: ✭ 23 (+0%)
Mutual labels:  imputation
missCompare
missCompare R package - intuitive missing data imputation framework
Stars: ✭ 31 (+34.78%)
Mutual labels:  imputation
BetaML.jl
Beta Machine Learning Toolkit
Stars: ✭ 64 (+178.26%)
Mutual labels:  imputation
icn3d
web-based protein structure viewer and analysis tool interactively or in batch mode
Stars: ✭ 95 (+313.04%)
Mutual labels:  snp
SNPRelate
R package: parallel computing toolset for relatedness and principal component analysis of SNP data (Development Version)
Stars: ✭ 74 (+221.74%)
Mutual labels:  snp
hlatyping
Precision HLA typing from next-generation sequencing data
Stars: ✭ 28 (+21.74%)
Mutual labels:  hla

HLA Genotype Imputation with Attribute Bagging

Kernel Version: 1.5

GPLv3 GNU General Public License, GPLv3

Availability Years-in-BioC Build Status Build status codecov.io

Features

HIBAG is a state of the art software package for imputing HLA types using SNP data, and it relies on a training set of HLA and SNP genotypes. HIBAG can be used by researchers with published parameter estimates instead of requiring access to large training sample datasets. It combines the concepts of attribute bagging, an ensemble classifier method, with haplotype inference for SNPs and HLA types. Attribute bagging is a technique which improves the accuracy and stability of classifier ensembles using bootstrap aggregating and random variable selection.

Bioconductor Package

Release Version: 1.34.0

http://www.bioconductor.org/packages/HIBAG/

Changes in Bioconductor Version (since v1.26.0, Y2020):

  • Kernel Version: v1.5
  • The kernel v1.5 generates the same training model as v1.4, but 2-6x faster, by taking advantage of Intel AVX, AVX2 and AVX512 intrinsics if available

Changes in Bioconductor Version (since v1.14.0, Y2017):

  • Kernel Version: v1.4
  • The kernel v1.4 outputs exactly the same model parameter estimates as v1.3, and the model training with v1.4 is 1.2 times faster than v1.3
  • Modify the kernel to support the GPU extension

Changes in Bioconductor Version (since v1.3.0, Y2013):

  • Kernel Version: v1.3
  • Optimize the calculation of hamming distance using SSE2 and hardware POPCNT instructions if available
  • Hardware POPCNT: 2.4x speedup for large-scale data, compared to the implementation in v1.2.4
  • SSE2 popcount implementation without hardware POPCNT: 1.5x speedup for large-scale data, compared to the implementation in v1.2.4

Package Author & Maintainer

Dr. Xiuwen Zheng

Pre-fit Model Download

Citation

Zheng, X. et al. HIBAG-HLA genotype imputation with attribute bagging. Pharmacogenomics Journal 14, 192-200 (2014). doi: 10.1038/tpj.2013.18

Zheng, X. (2018) Imputation-Based HLA Typing with SNPs in GWAS Studies. In: Boegel S. (eds) HLA Typing. Methods in Molecular Biology, Vol 1802. Humana Press, New York, NY. doi: 10.1007/978-1-4939-8546-3_11

Installation

  • Bioconductor repository:
source("http://bioconductor.org/biocLite.R")
biocLite("HIBAG")
  • Development version from Github (for developers/testers only):
library("devtools")
install_github("zhengxwen/HIBAG")

The install_github() approach requires that you build from source, i.e. make and compilers must be installed on your system -- see the R FAQ for your operating system; you may also need to install dependencies manually.

Acceleration

CPU with Intel Intrinsics

  • GCC (>= v6.0) is strongly recommended to compile the HIBAG package (Intel ICC is not suggested).

  • HIBAG::hlaSetKernelTarget("max") can be used to maximize the algorithm efficiency.

GPU with OpenCL

Archive

https://github.com/zhengxwen/Archive/tree/master/HIBAG

https://bioconductor.org/about/release-announcements

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].