StingrayAnything can happen in the next half hour (including spectral timing made easy)!
PyastronomyA collection of astronomy-related routines in Python
Fklearnfklearn: Functional Machine Learning
HiringCreate WOW Moments. Create superfans.
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
ExportifyExport Spotify playlists using the Web API. Analyze them in the Jupyter notebook.
SetlA simple Spark-powered ETL framework that just works 🍺
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Tsrepr TSrepr: R package for time series representations
Dream3dData Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
PygmmisGaussian mixture model for incomplete (missing or truncated) and noisy data
DatatableA Python package for manipulating 2-dimensional tabular data structures
Datacamp🍧 A repository that contains courses I have taken on DataCamp
StartrA template for data journalism in R
GraphiaA visualisation tool for the creation and analysis of graphs
Daru Viewdaru-view is for easy and interactive plotting in web application & IRuby notebook. daru-view is a plugin gem to the existing daru gem.
Pydata Pandas WorkshopMaterial for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
WarpConvert and analyze large data sets at light speed, on Mac and iOS.
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Imexamimexam is a python tool for simple image examination, and plotting, with similar functionality to IRAF's imexamine
PycmMulti-class confusion matrix library in Python
TiledbThe Universal Storage Engine
MetrotwitterWhat Twitter reveals about the differences between cities and the monoculture of the Bay Area
Topicmodelstopics Models extension for Mallet & scikit-learn
CultivarMultidimensional data explorer and visualization tool.
MathematicavsrExample projects, code, and documents for comparing Mathematica with R.
Data SelfieData Selfie - a browser extension to track yourself on Facebook and analyze your data.
Ether sqlA python library to push ethereum blockchain data into an sql database.
Pytima python package for the interfacial analysis of molecular simulations
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Janitorsimple tools for data cleaning in R
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
ApogeeTools for dealing with APOGEE data
RshrfrsHRF: A Toolbox for Resting State HRF Deconvolution and Connectivity Analysis (MATLAB)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
PandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
WildfirepyWildfirePy, a Python library for Wildfire GIS data analysis.