All Categories → Data Processing → data-science

Top 1642 data-science open source projects

Chefboost
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Book list
Python, Machine Learning, Deep Learning and Data Science Books
Scikit Plot
An intuitive library to add plotting functionality to scikit-learn objects.
Kd lib
A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.
Datasets For Good
List of datasets to apply stats/machine learning/technology to the world of social good.
Dstack
An open-source tool to rapidly develop data applications with Python
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Covid19 Severity Prediction
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
Fedmsg
Federated Messaging with ZeroMQ
Handout
Turn Python scripts into handouts with Markdown and figures
Fixy
Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Learnpythonforresearch
This repository provides everything you need to get started with Python for (social science) research.
Awesome Ai
A curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)
Bookstore
📚 Notebook storage and publishing workflows for the masses
Presentations
Slide show presentations regarding data driven investing.
Lazynlp
Library to scrape and clean web pages to create massive datasets.
Datascience Pizza
🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e dicionário de conceitos
Danmf
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Pzad
Курс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Primehub
A toil-free multi-tenancy machine learning platform in your Kubernetes cluster
Ghactions
GitHub actions for R and accompanying R package
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Visualizingtwitchcommunities
Graphing communities on Twitch.tv in a visually intuitive way
Pygm
🐍 Python library implementing sorted containers with state-of-the-art query performance and compressed memory usage
Zigzag
Python library for identifying the peaks and valleys of a time series.
Programming With Data
🐍 Learn Python and Pandas from the ground up
Batchflow
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Py Quantmod
Powerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Metaprob
An embedded language for probabilistic programming and meta-programming.
Rbbjson
Flexible JSON traversal for rapid prototyping.
Pyfts
An open source library for Fuzzy Time Series in Python
Color recognition
🎨 Color recognition & classification & detection on webcam stream / on video / on single image using K-Nearest Neighbors (KNN) is trained with color histogram features by OpenCV.
Data Science Stack Cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Datasciencevm
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Go Tsne
t-Distributed Stochastic Neighbor Embedding (t-SNE) in Go
Graspologic
Python package for graph statistics
121-180 of 1642 data-science projects