All Projects → lmweber → Cytometry Clustering Comparison

lmweber / Cytometry Clustering Comparison

Licence: mit
R scripts to reproduce analyses in our paper comparing clustering methods for high-dimensional cytometry data

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Cytometry Clustering Comparison

Survivor
Toolset for SV simulation, comparison and filtering
Stars: ✭ 180 (+500%)
Mutual labels:  bioinformatics, comparison
Swarm
A robust and fast clustering method for amplicon-based studies
Stars: ✭ 88 (+193.33%)
Mutual labels:  bioinformatics, clustering
micca
micca - MICrobial Community Analysis
Stars: ✭ 19 (-36.67%)
Mutual labels:  bioinformatics, clustering
Awesome Single Cell
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Stars: ✭ 1,937 (+6356.67%)
Mutual labels:  bioinformatics, clustering
Vsearch
Versatile open-source tool for microbiome analysis
Stars: ✭ 444 (+1380%)
Mutual labels:  bioinformatics, clustering
Data mining
The Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms
Stars: ✭ 10 (-66.67%)
Mutual labels:  clustering
Awesome Sequencing Tech Papers
A collection of publications on comparison of high-throughput sequencing technologies.
Stars: ✭ 21 (-30%)
Mutual labels:  bioinformatics
Gpusorting
Implementation of a few sorting algorithms in OpenCL
Stars: ✭ 9 (-70%)
Mutual labels:  comparison
Adapt
Advanced Developer Async Programming Toolkit
Stars: ✭ 26 (-13.33%)
Mutual labels:  clustering
Rasusa
Randomly subsample sequencing reads to a specified coverage
Stars: ✭ 28 (-6.67%)
Mutual labels:  bioinformatics
Densitycluster
Machine learning. Clustering by fast search and find of density peaks.
Stars: ✭ 27 (-10%)
Mutual labels:  clustering
Events
Repository for *SEM Paper on Event Coreference Resolution in ECB+
Stars: ✭ 20 (-33.33%)
Mutual labels:  clustering
Scanpy
Single-Cell Analysis in Python. Scales to >1M cells.
Stars: ✭ 858 (+2760%)
Mutual labels:  bioinformatics
Crypto Compare
Comparison chart between popular cryptocurrencies.
Stars: ✭ 21 (-30%)
Mutual labels:  comparison
Scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
Stars: ✭ 855 (+2750%)
Mutual labels:  bioinformatics
Sevenbridges R
Seven Bridges API Client, CWL Schema, Meta Schema, and SDK Helper in R
Stars: ✭ 27 (-10%)
Mutual labels:  bioinformatics
Nonpareil
Estimate metagenomic coverage and sequence diversity
Stars: ✭ 26 (-13.33%)
Mutual labels:  bioinformatics
Tribuo
Tribuo - A Java machine learning library
Stars: ✭ 882 (+2840%)
Mutual labels:  clustering
Vdjviz
A lightweight immune repertoire browser
Stars: ✭ 21 (-30%)
Mutual labels:  bioinformatics
Clustering
fast clustering algorithms
Stars: ✭ 14 (-53.33%)
Mutual labels:  clustering

Comparison of clustering methods for high-dimensional single-cell flow and mass cytometry (CyTOF) data

This repository contains R scripts to reproduce the analyses and figures in our paper comparing clustering methods for high-dimensional flow cytometry and mass cytometry (CyTOF) data.

  • Weber L.M. and Robinson M.D. (2016) Comparison of clustering methods for high-dimensional single-cell flow and mass cytometry data. Cytometry Part A, 89A: 1084–1096. Open access available here.

    Note: Supplementary files (Supporting Information PDF and Supporting Information Table S1) are available via the journal link above, or from the supplementary_files directory (latest version: November 18, 2016).

Overview

In this study, we compared the performance of 18 clustering methods for automated detection of cell populations in high-dimensional flow cytometry and mass cytometry (CyTOF) data, using 6 publicly available data sets from experiments in immunology as benchmarks. These results extend previously published comparisons by focusing on high-dimensional data and including new methods developed for CyTOF data.

A list of the clustering methods can be found in Table 1 in the paper. A list of the data sets is provided in Table 2.

The comparisons showed that several methods performed well, including FlowSOM, X-shift, PhenoGraph, Rclusterpp, and flowMeans. Among these, FlowSOM had extremely fast runtimes, making this method well-suited for interactive, exploratory analysis of large, high-dimensional data sets on a standard laptop or desktop computer.

Based on our results, we recommend the use of FlowSOM (with manual selection of the number of clusters; see paper) as a first choice for this type of analysis, since this method gave best or near-best performance across all data sets, together with extremely fast runtimes.

See the paper for more details, in particular regarding the advantages of the different methods for different clustering tasks (detecting multiple cell populations vs. detecting a single rare population).

FlowSOM and Rtsne example code

FlowSOM (Van Gassen et al., 2015) is available as a Bioconductor package for the R statistical programming language.

A worked example showing how to use FlowSOM for clustering and Rtsne for visualization is available in the FlowSOM-Rtsne-example repository.

Updates

Updated results for new clustering algorithms or new reference data sets will be published on this website.

The following updates are currently available:

Contents of this repository

R scripts are organized into the following subdirectories. All scripts contain comments explaining the overall purpose and individual steps.

Supplementary files from the published paper are included in the following directory:

R scripts and summary reports for updated results are included in the following directory:

  • updates: updated results for new clustering methods or new reference data sets

Data files

Pre-processed data files for the benchmark data sets are available from FlowRepository (repository FR-FCM-ZZPH).

Original data files can be obtained through the references listed in Table 2 in the paper.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].