All Categories → Data Processing → data-science

Top 1642 data-science open source projects

Labeled Tweet Generator
Search for tweets and download the data labeled with its polarity in CSV format
Responsible Ai Widgets
This project provides responsible AI user interfaces for Fairlearn, interpret-community, and Error Analysis, as well as foundational building blocks that they rely on.
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Ml Da Coursera Yandex Mipt
Machine Learning and Data Analysis Coursera Specialization from Yandex and MIPT
Tflearn
Deep learning library featuring a higher-level API for TensorFlow.
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Skpro
Supervised domain-agnostic prediction framework for probabilistic modelling
Toolbox
A Java Toolbox for Scalable Probabilistic Machine Learning
Python Data Science Handbook
A Chinese translation of Jake Vanderplas' "Python Data Science Handbook". 《Python数据科学手册》在线Jupyter notebook中文翻译
Azureml Examples
Official community-driven Azure Machine Learning examples, tested with GitHub Actions
Kaggle Past Solutions
A searchable compilation of Kaggle past solutions
Pulsar
Turn large Web sites into tables and charts using simple SQLs.
Vizuka
Explore high-dimensional datasets and how your algo handles specific regions.
Drsa
Deep Recurrent Survival Analysis, an auto-regressive deep model for time-to-event data analysis with censorship handling. An implementation of our AAAI 2019 paper and a benchmark for several (Python) implemented survival analysis methods.
D2l En
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
Oreilly reactive python for data
Resources for the O'Reilly online video "Reactive Python for Data"
Flurs
🌊 FluRS: A Python library for streaming recommendation algorithms
Sspipe
Simple Smart Pipe: python productivity-tool for rapid data manipulation
Ohayo
ohayo is a fast and free data science studio.
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Probflow
A Python package for building Bayesian models with TensorFlow or PyTorch
Aethos
Automated Data Science and Machine Learning library to optimize workflow.
R Course
Una introduccion al analisis de datos con R y R Studio
Bayesian Cognitive Modeling In Pymc3
PyMC3 codes of Lee and Wagenmakers' Bayesian Cognitive Modeling - A Pratical Course
Ml Pyxis
Tool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Breakdown
Model Agnostics breakDown plots
Ds With Pysimplegui
Data science and Machine Learning GUI programs/ desktop apps with PySimpleGUI package
Pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
Pytorch Wrapper
Provides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.
Danfojs
danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
H2o Tutorials
Tutorials and training material for the H2O Machine Learning Platform
Fklearn
fklearn: Functional Machine Learning
Sci Pype
A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.
301-360 of 1642 data-science projects