GopGoPlus - The Go+ language for engineering, STEM education, and data science
Stars: ✭ 7,829 (+630.32%)
MatplotplusplusMatplot++: A C++ Graphics Library for Data Visualization 📊🗾
Stars: ✭ 2,433 (+126.96%)
SeaweedfsSeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
Stars: ✭ 13,380 (+1148.13%)
KneedKnee point detection in Python 📈
Stars: ✭ 328 (-69.4%)
CollapseAdvanced and Fast Data Transformation in R
Stars: ✭ 184 (-82.84%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-94.59%)
datajoint-pythonRelational data pipelines for the science lab
Stars: ✭ 140 (-86.94%)
ArticlesA repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (-67.35%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (-63.53%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-60.26%)
Scikit Mobilityscikit-mobility: mobility analysis in Python
Stars: ✭ 339 (-68.38%)
Quantitative NotebooksEducational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (-66.79%)
DataexplorerAutomate Data Exploration and Treatment
Stars: ✭ 362 (-66.23%)
PandapyPandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
Stars: ✭ 474 (-55.78%)
MathematicavsrExample projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-96.18%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+362.31%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-49.53%)
Jupyter pivottablejsDrag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (-60.07%)
GonumGonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
Stars: ✭ 5,384 (+402.24%)
ReflowA language and runtime for distributed, incremental data processing in the cloud
Stars: ✭ 706 (-34.14%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+304.29%)
Pandas SummaryAn extension to pandas dataframes describe function.
Stars: ✭ 361 (-66.32%)
Aws.s3Amazon Simple Storage Service (S3) API Client
Stars: ✭ 302 (-71.83%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (-64.93%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (-70.62%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+309.24%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+374.16%)
Cluster PackA library on top of either pex or conda-pack to make your Python code easily available on a cluster
Stars: ✭ 23 (-97.85%)
Awesome RA curated list of awesome R packages, frameworks and software.
Stars: ✭ 4,858 (+353.17%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-57.65%)
RumaleRumale is a machine learning library in Ruby
Stars: ✭ 526 (-50.93%)
DapyEasy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (-51.21%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+394.87%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-71.83%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-41.42%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-41.98%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-34.33%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-42.82%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-98.51%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (-22.76%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-97.95%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+423.97%)
Pytima python package for the interfacial analysis of molecular simulations
Stars: ✭ 38 (-96.46%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-20.34%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+676.96%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-97.57%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-19.4%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (-8.49%)