pneuvial / adjclust

Licence: other

Adjacency-constrained hierarchical clustering of a similarity matrix

Programming Languages

7636 projects

50402 projects - #5 most used programming language

C++

36643 projects - #6 most used programming language

Projects that are alternatives of or similar to adjclust

echolocatoR

Automated statistical and functional fine-mapping pipeline with extensive API access to datasets.

Stars: ✭ 13 (-13.33%)

Mutual labels: gwas, linkage-disequilibrium

gchromVAR

Cell type specific enrichments using finemapped variants and quantitative epigenetic data

Stars: ✭ 31 (+106.67%)

Mutual labels: gwas

hic

Analysis of Chromosome Conformation Capture data (Hi-C)

Stars: ✭ 45 (+200%)

Mutual labels: hi-c

GenAMap

Visual Machine Learning of Genome-Phenome Associations

Stars: ✭ 22 (+46.67%)

Mutual labels: gwas

HiC data

A (continuously updated) collection of references to Hi-C data. Predominantly human/mouse Hi-C data, with replicates.

Stars: ✭ 107 (+613.33%)

Mutual labels: hi-c

hickit

TAD calling, phase imputation, 3D modeling and more for diploid single-cell Hi-C (Dip-C) and general Hi-C

Stars: ✭ 79 (+426.67%)

Mutual labels: hi-c

rvtests

Rare variant test software for next generation sequencing data

Stars: ✭ 114 (+660%)

Mutual labels: gwas

gcMapExplorer

Genome Contact Map Explorer - gcMapExplorer. Visit:

Stars: ✭ 15 (+0%)

Mutual labels: hi-c

TADLib

A Library to Explore Chromatin Interaction Patterns for Topologically Associating Domains

Stars: ✭ 23 (+53.33%)

Mutual labels: hi-c

qmplot

A Python package for creating high-quality manhattan and Q-Q plots from GWAS results.

Stars: ✭ 25 (+66.67%)

Mutual labels: gwas

docker-4dn-hic

Docker for 4DN Hi-C processing pipeline

Stars: ✭ 42 (+180%)

Mutual labels: hi-c

hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

Stars: ✭ 62 (+313.33%)

Mutual labels: hierarchical-clustering

mustache

Multi-scale Detection of Chromatin Loops from Hi-C and Micro-C Maps using Scale-Space Representation

Stars: ✭ 38 (+153.33%)

Mutual labels: hi-c

3d-genome-processing-tutorial

A 3D genome data processing tutorial for ISMB/ECCB 2017

Stars: ✭ 44 (+193.33%)

Mutual labels: hi-c

clodius

Clodius is a tool for breaking up large data sets into smaller tiles that can subsequently be displayed using an appropriate viewer.

Stars: ✭ 32 (+113.33%)

Mutual labels: hi-c

deepvis

machine learning algorithms in Swift

Stars: ✭ 54 (+260%)

Mutual labels: hierarchical-clustering

instaGRAAL

Large genome reassembly based on Hi-C data, continuation of GRAAL

Stars: ✭ 32 (+113.33%)

Mutual labels: hi-c

lme4qtl

Mixed models @lme4 + custom covariances + parameter constraints

Stars: ✭ 39 (+160%)

Mutual labels: gwas

faster lmm d

A faster lmm for GWAS. Supports GPU backend.

Stars: ✭ 12 (-20%)

Mutual labels: gwas

FarmCPUpp

Perform GWAS using the FarmCPU model.

Stars: ✭ 16 (+6.67%)

Mutual labels: gwas

View All Similar Projects ➔

adjclust: Adjacency-constrained clustering

adjclust is a package that provides methods to perform adjacency-constrained hierarchical agglomerative clustering. Adjacency-constrained hierarchical agglomerative clustering is hierarchical agglomerative clustering (HAC) in which each observation is associated to a position, and the clustering is constrained so as only adjacent clusters are merged. It is useful in bioinformatics (e.g. Genome Wide Association Studies or Hi-C data analysis).

adjclust provides three user level functions: adjClust, snpClust and hicClust, which are briefly explained below.

Installation

You can install adjclust from github with:

# install.packages("devtools")
devtools::install_github("pneuvial/adjclust")

`adjClust`

adjClust performs adjacency-constrained HAC for standard and sparse, similarity and dissimilarity matrices and dist objects. Matrix::dgCMatrix and Matrix::dsCMatrix are the supported sparse matrix classes. Let’s look at a basic example

library("adjclust")

sim <- matrix(c(1.0, 0.5, 0.2, 0.1,
                0.5, 1.0, 0.1, 0.2,
                0.2, 0.1, 1.0, 0.6,
                0.1, 0.2 ,0.6 ,1.0), nrow=4)
h <- 3
fit <- adjClust(sim, "similarity", h)
plot(fit)

The result is of class chac. It can be plotted as a dendrogram (as shown above). Successive merge and heights of clustering can be obtained by fit$merge and fit$height respectively.

`snpClust`

snpClust performs adjacency-constrained HAC for specific application of Genome Wide Association Studies (GWAS). A minimal example is given below. See GWAS Vignette for details.

library("snpStats")
#> Loading required package: survival
#> Loading required package: Matrix
data("ld.example", package = "snpStats")
geno <- ceph.1mb[, -316]  ## drop one SNP leading to one missing LD value
h <- 100
ld.ceph <- ld(geno, stats = "R.squared", depth = h)
image(ld.ceph, lwd = 0)

fit <- snpClust(geno, stats = "R.squared", h = h)
#> Warning in run.snpClust(x, h = h, stats = stats): Forcing the LD similarity to
#> be smaller than or equal to 1
#> Note: 133 merges with non increasing heights.
plot(fit)
#> Warning in plot.chac(fit): 
#> Detected reversals in dendrogram: mode = 'corrected', 'within-disp' or 'total-disp' might be more relevant.

sel_clust <- select(fit, "bs")
plotSim(as.matrix(ld.ceph), clustering = sel_clust, dendro = fit)

`hicClust`

hicClust performs adjacency-constrained HAC for specific application of Hi-C data analysis. A minimal example is given below. See Hi-C Vignette for details.

library("HiTC")
#> Warning: package 'BiocGenerics' was built under R version 4.0.5
#> Warning: package 'GenomeInfoDb' was built under R version 4.0.5

load(system.file("extdata", "hic_imr90_40_XX.rda", package = "adjclust"))
binned <- binningC(hic_imr90_40_XX, binsize = 5e5)
#> Bin size 'xgi' =500488 [1x500488]
#> Bin size 'ygi' =500488 [1x500488]
mapC(binned)
#> minrange= 104  - maxrange= 36776.8

fitB <- hicClust(binned)
#> Note: 5 merges with non increasing heights.
plot(fitB)
#> Warning in plot.chac(fitB): 
#> Detected reversals in dendrogram: mode = 'corrected', 'within-disp' or 'total-disp' might be more relevant.

plotSim(intdata(binned), dendro = fitB) # default: log scale for colors

Credits

Version 0.4.0 of this package was completed by Shubham Chaturvedi as a part of the Google Summer of Code 2017 program.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

pneuvial / adjclust

Programming Languages

Labels

Projects that are alternatives of or similar to adjclust

adjclust: Adjacency-constrained clustering

Installation

`adjClust`

`snpClust`

`hicClust`

Credits