Imexamimexam is a python tool for simple image examination, and plotting, with similar functionality to IRAF's imexamine
Stars: ✭ 57 (-46.73%)
Topicmodelstopics Models extension for Mallet & scikit-learn
Stars: ✭ 50 (-53.27%)
Facebook Data Analyzer📊Python script to analyze the contents of your Facebook data export
Stars: ✭ 71 (-33.64%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+946.73%)
Musictaster一种song2vec、artist2vec的实践
Stars: ✭ 38 (-64.49%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+1025.23%)
TiledbThe Universal Storage Engine
Stars: ✭ 1,072 (+901.87%)
PyastronomyA collection of astronomy-related routines in Python
Stars: ✭ 91 (-14.95%)
Datacamp🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-35.51%)
Pydata Pandas WorkshopMaterial for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
Stars: ✭ 65 (-39.25%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (+816.82%)
ExportifyExport Spotify playlists using the Web API. Analyze them in the Jupyter notebook.
Stars: ✭ 80 (-25.23%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-45.79%)
StingrayAnything can happen in the next half hour (including spectral timing made easy)!
Stars: ✭ 94 (-12.15%)
Running pageMake your own running home page
Stars: ✭ 1,078 (+907.48%)
Dream3dData Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-31.78%)
LogationAnalyse your NGINX access logs and create beautiful maps of the locations from which people access your service.
Stars: ✭ 99 (-7.48%)
DatatableA Python package for manipulating 2-dimensional tabular data structures
Stars: ✭ 1,166 (+989.72%)
Data SelfieData Selfie - a browser extension to track yourself on Facebook and analyze your data.
Stars: ✭ 1,009 (+842.99%)
HiringCreate WOW Moments. Create superfans.
Stars: ✭ 85 (-20.56%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+821.5%)
Daru Viewdaru-view is for easy and interactive plotting in web application & IRuby notebook. daru-view is a plugin gem to the existing daru gem.
Stars: ✭ 65 (-39.25%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-67.29%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+1048.6%)
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-40.19%)
WarpConvert and analyze large data sets at light speed, on Mac and iOS.
Stars: ✭ 62 (-42.06%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-26.17%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+7872.9%)
RootThe official repository for ROOT: analyzing, storing and visualizing big data, scientifically
Stars: ✭ 1,377 (+1186.92%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-29.91%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+905.61%)
ElectrophysiologysoftwareA list of openly available software tools for (mostly human) electrophysiology.
Stars: ✭ 54 (-49.53%)
MetrotwitterWhat Twitter reveals about the differences between cities and the monoculture of the Bay Area
Stars: ✭ 52 (-51.4%)
CubesLight-weight Python OLAP framework for multi-dimensional data analysis
Stars: ✭ 1,393 (+1201.87%)
PygmmisGaussian mixture model for incomplete (missing or truncated) and noisy data
Stars: ✭ 70 (-34.58%)
CultivarMultidimensional data explorer and visualization tool.
Stars: ✭ 46 (-57.01%)
Fklearnfklearn: Functional Machine Learning
Stars: ✭ 1,305 (+1119.63%)
MathematicavsrExample projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-61.68%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-35.51%)
Ether sqlA python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (-61.68%)
Blog文章列表
Stars: ✭ 96 (-10.28%)
Pytima python package for the interfacial analysis of molecular simulations
Stars: ✭ 38 (-64.49%)
StartrA template for data journalism in R
Stars: ✭ 69 (-35.51%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+1060.75%)
TabixTabix.io UI
Stars: ✭ 1,152 (+976.64%)
GdlGDL - GNU Data Language
Stars: ✭ 104 (-2.8%)
100 Pandas Puzzles100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Stars: ✭ 1,382 (+1191.59%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1150.47%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+1057.01%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-37.38%)