All Projects → ContextLab → Hypertools

ContextLab / Hypertools

Licence: mit
A Python toolbox for gaining geometric insights into high-dimensional data

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Hypertools

R Novice Gapminder
R for Reproducible Scientific Analysis
Stars: ✭ 127 (-92.43%)
Mutual labels:  data-wrangling, data-visualization
Moderndive book
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (-68.59%)
Mutual labels:  data-wrangling, data-visualization
R Ecology Lesson
Data Analysis and Visualization in R for Ecologists
Stars: ✭ 218 (-87.01%)
Mutual labels:  data-wrangling, data-visualization
Uplot
📈 A small, fast chart for time series, lines, areas, ohlc & bars
Stars: ✭ 6,808 (+305.72%)
Mutual labels:  time-series, data-visualization
R Raster Vector Geospatial
Introduction to Geospatial Raster and Vector Data with R
Stars: ✭ 76 (-95.47%)
Mutual labels:  data-wrangling, data-visualization
Imputets
CRAN R Package: Time Series Missing Value Imputation
Stars: ✭ 114 (-93.21%)
Mutual labels:  time-series, data-visualization
Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315 (-81.23%)
Mutual labels:  time-series, data-visualization
Data Forge Ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (-42.37%)
Mutual labels:  data-wrangling, data-visualization
Openrefine
OpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+408.4%)
Mutual labels:  data-wrangling, data-visualization
Python Novice Gapminder
Plotting and Programming in Python
Stars: ✭ 109 (-93.5%)
Mutual labels:  data-wrangling, data-visualization
Python Ecology Lesson
Data Analysis and Visualization in Python for Ecologists
Stars: ✭ 116 (-93.09%)
Mutual labels:  data-wrangling, data-visualization
Simplestockanalysispython
Stock Analysis Tutorial in Python
Stars: ✭ 126 (-92.49%)
Mutual labels:  time-series
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-92.67%)
Mutual labels:  data-visualization
Pbpython
Code, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+2.74%)
Mutual labels:  data-visualization
Exoplanet
Fast & scalable MCMC for all your exoplanet needs!
Stars: ✭ 122 (-92.73%)
Mutual labels:  time-series
Imhotep
Imhotep is a large-scale analytics platform built by Indeed.
Stars: ✭ 125 (-92.55%)
Mutual labels:  time-series
Dep viz
A visual tool to help developers understand Elixir recompilation in their projects
Stars: ✭ 121 (-92.79%)
Mutual labels:  data-visualization
Eon Chart
Realtime animated graphs with PubNub and C3.
Stars: ✭ 121 (-92.79%)
Mutual labels:  time-series
Openhistorian
The Open Source Time-Series Data Historian
Stars: ✭ 120 (-92.85%)
Mutual labels:  time-series
Npmcharts.com
Compare npm package downloads over time
Stars: ✭ 129 (-92.31%)
Mutual labels:  data-visualization

Hypertools logo

"To deal with hyper-planes in a 14 dimensional space, visualize a 3D space and say 'fourteen' very loudly. Everyone does it." - Geoff Hinton

Hypertools example

Overview

HyperTools is designed to facilitate dimensionality reduction-based visual explorations of high-dimensional data. The basic pipeline is to feed in a high-dimensional dataset (or a series of high-dimensional datasets) and, in a single function call, reduce the dimensionality of the dataset(s) and create a plot. The package is built atop many familiar friends, including matplotlib, scikit-learn and seaborn. Our package was recently featured on Kaggle's No Free Hunch blog. For a general overview, you may find this talk useful (given as part of the MIND Summer School at Dartmouth).

Try it!

Click the badge to launch a binder instance with example uses:

Binder

or

Check the repo of Jupyter notebooks from the HyperTools paper.

Installation

To install the latest stable version run:

pip install hypertools

To install the latest unstable version directly from GitHub, run:

pip install -U git+https://github.com/ContextLab/hypertools.git

Or alternatively, clone the repository to your local machine:

git clone https://github.com/ContextLab/hypertools.git

Then, navigate to the folder and type:

pip install -e .

(These instructions assume that you have pip installed on your system)

NOTE: If you have been using the development version of 0.5.0, please clear your data cache (/Users/yourusername/hypertools_data).

Requirements

  • python 2.7, 3.5+
  • PPCA>=0.0.2
  • scikit-learn>=0.18.1
  • pandas>=0.18.0
  • seaborn>=0.8.1
  • matplotlib>=1.5.1
  • scipy>=0.17.1
  • numpy>=1.10.4
  • future
  • requests
  • deepdish
  • pytest (for development)
  • ffmpeg (for saving animations)

If installing from github (instead of pip), you must also install the requirements: pip install -r requirements.txt

Troubleshooting

If you encounter an error related to installing deepdish (hdf5) on a MacOS system, try installing hdf5 directly using homebrew:

$ brew tap homebrew/science
$ brew install hdf5

and then re-start the installation.

Documentation

Check out our readthedocs page for further documentation, complete API details, and additional examples.

Citing

We wrote a short JMLR paper about HyperTools, which you can read here, or you can check out a (longer) preprint here. We also have a repository with example notebooks from the paper here.

Please cite as:

Heusser AC, Ziman K, Owen LLW, Manning JR (2018) HyperTools: A Python toolbox for gaining geometric insights into high-dimensional data. Journal of Machine Learning Research, 18(152): 1--6.

Here is a bibtex formatted reference:

@ARTICLE {,
    author  = {Andrew C. Heusser and Kirsten Ziman and Lucy L. W. Owen and Jeremy R. Manning},    
    title   = {HyperTools: a Python Toolbox for Gaining Geometric Insights into High-Dimensional Data},    
    journal = {Journal of Machine Learning Research},
    year    = {2018},
    volume  = {18},	
    number  = {152},	
    pages   = {1-6},	
    url     = {http://jmlr.org/papers/v18/17-434.html}	
}

Contributing

Join the chat at https://gitter.im/hypertools/Lobby

If you'd like to contribute, please first read our Code of Conduct.

For specific information on how to contribute to the project, please see our Contributing page.

Testing

Build Status

To test HyperTools, install pytest (pip install pytest) and run pytest in the HyperTools folder

Examples

See here for more examples.

Plot

import hypertools as hyp
hyp.plot(list_of_arrays, '.', group=list_of_labels)

Plot example

Align

import hypertools as hyp
hyp.plot(list_of_arrays, align='hyper')

BEFORE

Align before example

AFTER

Align after example

Cluster

import hypertools as hyp
hyp.plot(array, '.', n_clusters=10)

Cluster Example

Describe

import hypertools as hyp
hyp.tools.describe(list_of_arrays, reduce='PCA', max_dims=14)

Describe Example

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].