All Categories → Data Processing → data-science

Top 1642 data-science open source projects

Python For Data Science
A collection of Jupyter Notebooks for learning Python for Data Science.
Laurae
Advanced High Performance Data Science Toolbox for R by Laurae
Estadistica Con R
Apuntes personales sobre estadística, machine learning y lenguaje de programación R
Instascrape
Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Trump Lies
Tutorial: Web scraping in Python with Beautiful Soup
Achoo
Achoo uses a Raspberry Pi to predict if my son will need his inhaler on any given day using weather, pollen, and air quality data. If the prediction for a given day is above a specified threshold, the Pi will email his school nurse, and myself, notifying her that he may need preemptive treatment. Community-sourced health monitoring!
Radio
RadIO is a library for data science research of computed tomography imaging
Data Science Projects With Python
A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn
Climate Change Data
🌍 A curated list of APIs, open data and ML/AI projects on climate change
Tad
A desktop application for viewing and analyzing tabular data
Data Science Notebook
📖 每一个伟大的思想和行动都有一个微不足道的开始
Imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Finance
Here you can find all the quantitative finance algorithms that I've worked on and refined over the past year!
Awesome Ensemble Learning
Ensemble learning related books, papers, videos, and toolboxes
Machinelearningnotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
Data Science Live Book
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Plynx
PLynx is a domain agnostic platform for managing reproducible experiments and data-oriented workflows.
Online Courses Learning
Contains the online course about Data Science, Machine Learning, Programming Language, Operating System, Mechanial Engineering, Mathematics and Robotics provided by Coursera, Udacity, Linkedin Learning, Udemy and edX.
Speedml
Speedml is a Python package to speed start machine learning projects.
Uci Ml Api
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Zebras
Data analysis library for JavaScript built with Ramda
Klib
Easy to use Python library of customized functions for cleaning and analyzing data.
Delbot
It understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.
Deon
A command line tool to easily add an ethics checklist to your data science projects.
Observations
Tools for loading standard data sets in machine learning
Pytorch Lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Dataaspirant codes
Complete machine learning model codes
Pca Magic
PCA that iteratively replaces missing data
Homlr
Supplementary material for Hands-On Machine Learning with R, an applied book covering the fundamentals of machine learning with R.
Awesome R Learning Resources
A curated collection of free resources to help deepen your understanding of the R programming language. Updated regularly. Contributions encouraged via pull request (see contributing.md).
Imbalanced Algorithms
Python-based implementations of algorithms for learning on imbalanced data.
Lets Plot Kotlin
Kotlin API for Lets-Plot - an open-source plotting library for statistical data.
Computationalhealthcare
A platform for analysis & development of machine learning models using large de-identified healthcare datasets.
Docker Galaxy Stable
🐳📊📚 Docker Images tracking the stable Galaxy releases.
Soda Sql
Metric collection, data testing and monitoring for SQL accessible data
Metrics
Machine learning metrics for distributed, scalable PyTorch applications.
61-120 of 1642 data-science projects