All Categories → Data Processing → data-science

Top 1642 data-science open source projects

Socrat
A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Pretzel
Javascript full-stack framework for Big Data visualisation and analysis
Rmarkdown Website Tutorial
Tutorial for creating websites w/ R Markdown
Tiledb Vcf
Efficient variant-call data storage and retrieval library using the TileDB storage library.
Datacleaner
A Python tool that automatically cleans data sets and readies them for analysis.
R Notes
Notes for using R language to do data mining and machine learning (Chinese)
Docker Images
Out-of-box Data Science / AI platform | AI/数据科学的瑞士军刀
Blogr
Scripts + data to recreate analyses published on http://benjaminlmoore.wordpress.com and http://blm.io
Looper
A resource list for causality in statistics, data science and physics
Boltzmannclean
Fill missing values in Pandas DataFrames using Restricted Boltzmann Machines
4th Place Home Credit Default Risk
Codes and dashboards for 4th place solution for Kaggle's Home Credit Default Risk competition
Lambdaschooldatascience
Completed assignments and coding challenges from the Lambda School Data Science program.
Mds
Modern Data Science
Pygooglenews
If Google News had a Python library
Biolitmap
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Lux
Python API for Intelligent Visual Data Discovery
Errormoji
®️ errors, in emoji
Dataframe
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Nlp
📝 This repository recorded my NLP journey.
Machine Learning With Python
Small scale machine learning projects to understand the core concepts . Give a Star 🌟If it helps you. BONUS: Interview Bank coming up..!
Threatpursuit Vm
Threat Pursuit Virtual Machine (VM): A fully customizable, open-sourced Windows-based distribution focused on threat intelligence analysis and hunting designed for intel and malware analysts as well as threat hunters to get up and running quickly.
Osint collection
Maintained collection of OSINT related resources. (All Free & Actionable)
Dalex
moDel Agnostic Language for Exploration and eXplanation
Python Machine Learning Book 2nd Edition
The "Python Machine Learning (2nd edition)" book code repository and info resource
Pycall.rb
Calling Python functions from the Ruby language
Awesome Streamlit
The purpose of this project is to share knowledge on how awesome Streamlit is and can be
Machine learning refined
Notes, examples, and Python demos for the textbook "Machine Learning Refined" (published by Cambridge University Press).
Mit 15 003 Data Science Tools
Study guides for MIT's 15.003 Data Science Tools
Rows
A common, beautiful interface to tabular data, no matter the format
Hitchhikers Guide
The Hitchhiker's Guide to Data Science for Social Good
Python Small Examples
告别枯燥,致力于打造 Python 实用小例子,更多Python良心教程见 Python中文网 http://www.zglg.work
Industry Machine Learning
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Statistical Rethinking With Python And Pymc3
Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath
Reflow
A language and runtime for distributed, incremental data processing in the cloud
H1st
The AI Application Platform We All Need. Human AND Machine Intelligence. Based on experience building AI solutions at Panasonic: robotics predictive maintenance, cold-chain energy optimization, Gigafactory battery mfg, avionics, automotive cybersecurity, and more.
541-600 of 1642 data-science projects