All Projects → davisidarta → topometry

davisidarta / topometry

Licence: MIT license
A comprehensive dimensional reduction framework to recover the latent topology from high-dimensional data.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to topometry

snATAC
<<------ Use SnapATAC!!
Stars: ✭ 23 (-64.06%)
Mutual labels:  clustering, single-cell-genomics, single-cell-analysis
scarf
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (-15.62%)
Mutual labels:  clustering, dimension-reduction, single-cell-genomics
Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+3332.81%)
Mutual labels:  clustering, scikit-learn, dimensionality-reduction
Python Clustering Exercises
Jupyter Notebook exercises for k-means clustering with Python 3 and scikit-learn
Stars: ✭ 153 (+139.06%)
Mutual labels:  clustering, scikit-learn
Qlik Py Tools
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
Stars: ✭ 135 (+110.94%)
Mutual labels:  clustering, scikit-learn
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+2818.75%)
Mutual labels:  clustering, scikit-learn
Pqkmeans
Fast and memory-efficient clustering
Stars: ✭ 189 (+195.31%)
Mutual labels:  clustering, scikit-learn
Orange3
🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+4825%)
Mutual labels:  clustering, scikit-learn
dyngen
Simulating single-cell data using gene regulatory networks 📠
Stars: ✭ 59 (-7.81%)
Mutual labels:  single-cell-analysis, single-cell-omics
Machine Learning
A repository of resources for understanding the concepts of machine learning/deep learning.
Stars: ✭ 29 (-54.69%)
Mutual labels:  clustering, dimensionality-reduction
DRComparison
Comparison of dimensionality reduction methods
Stars: ✭ 29 (-54.69%)
Mutual labels:  dimensionality-reduction, single-cell-analysis
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-62.5%)
Mutual labels:  clustering, scikit-learn
Awesome Single Cell
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Stars: ✭ 1,937 (+2926.56%)
Mutual labels:  clustering, dimensionality-reduction
Awesome Community Detection
A curated list of community detection research papers with implementations.
Stars: ✭ 1,874 (+2828.13%)
Mutual labels:  clustering, dimensionality-reduction
Community
A Python implementation of Girvan-Newman algorithm
Stars: ✭ 125 (+95.31%)
Mutual labels:  clustering, graphs
Ml Email Clustering
Email clustering with machine learning
Stars: ✭ 116 (+81.25%)
Mutual labels:  clustering, scikit-learn
Ml code
A repository for recording the machine learning code
Stars: ✭ 75 (+17.19%)
Mutual labels:  clustering, scikit-learn
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+2268.75%)
Mutual labels:  clustering, scikit-learn
mathematics-statistics-for-data-science
Mathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
Stars: ✭ 56 (-12.5%)
Mutual labels:  clustering, dimensionality-reduction
ML-Track
This repository is a recommended track, designed to get started with Machine Learning.
Stars: ✭ 19 (-70.31%)
Mutual labels:  clustering, scikit-learn

Latest PyPI version License: MIT Documentation Status Twitter

TopOMetry - Topologically Optimized geoMetry

Documentation available at Read The Docs.

A global framework for dimensionality reduction: learning topologic metrics, orthogonal bases and graph layouts

TopOMetry is a high-level python library to explore data topology. It allows learning topological metrics, dimensionality reduced basis and graphs from data, as well to visualize them with different layout optimization algorithms. The main aim is to achieve sequential approximations of the Laplace-Beltrami Operator, a natural way to describe data geometry and its high-dimensional topology. For more information, see the manuscript.

TopOMetry is designed to handle large-scale data matrices containing extreme topological diversity, such as those generated from single-cell omics, and can be used to perform topology-preserving visualizations.

TopOMetry main class is the TopOGraph object. In a TopOGraph, topological metrics are recovered with diffusion harmonics, fuzzy simplicial sets or Continuous-k-Nearest-Neighbors, and used to obtain orthogonal basis (multiscale Diffusion Maps and/or fuzzy or continuous versions of Laplacian Eigenmaps) that emphasize topological features and are robust to noise. On top of these basis, new graphs can be learned using k-nearest-neighbors graphs or with new topological metrics. The learned similarity metrics, basis and graphs are stored at the TopOGraph object.

Finally, different graph layout optimization algorithms built-in TopOMetry can be used for visualization:

  • MAP (Manifold Approximation and Projection) - a lighter UMAP with looser assumptions
  • MDE (Minimum Distortion Embedding) - the ultimate swiss-army knife for graph layout optimization
  • tSNE (t-Stochasthic Neighborhood Embedding) - a classic of visualization, with parallelization
  • TriMAP - dimensionality reduction using triplets
  • NCVis (Noise Contrastive Visualization) - for blazing fast performance
  • PaCMAP (Pairwise-controlled Manifold Approximation and Projection) - for balanced visualizations

The following image summarizes the TopOMetry workflow:

TopOMetry in a glance

Contributing

Contributions are very welcome! If you're interested in adding a new feature, just let me know in the Issues section.

License

MIT License

Citation

@article {Sidarta-Oliveira2022.03.14.484134,
	author = {Sidarta-Oliveira, Davi and Velloso, Licio A},
	title = {A comprehensive dimensional reduction framework to learn single-cell phenotypic topology uncovers T cell diversity},
	elocation-id = {2022.03.14.484134},
	year = {2022},
	doi = {10.1101/2022.03.14.484134},
	publisher = {Cold Spring Harbor Laboratory},
	URL = {https://www.biorxiv.org/content/early/2022/03/17/2022.03.14.484134},
	eprint = {https://www.biorxiv.org/content/early/2022/03/17/2022.03.14.484134.full.pdf},
	journal = {bioRxiv}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].