ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Book listPython, Machine Learning, Deep Learning and Data Science Books
Scikit PlotAn intuitive library to add plotting functionality to scikit-learn objects.
Kd libA Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.
Datasets For GoodList of datasets to apply stats/machine learning/technology to the world of social good.
DstackAn open-source tool to rapidly develop data applications with Python
100 Days Of Ml CodeA day to day plan for this challenge. Covers both theoritical and practical aspects
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
JaxnetConcise deep learning for JAX
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
AuptimizerAn automatic ML model optimization tool.
AulasAulas da Escola de Inteligência Artificial de São Paulo
FedmsgFederated Messaging with ZeroMQ
HandoutTurn Python scripts into handouts with Markdown and figures
FixyAmacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
LearnpythonforresearchThis repository provides everything you need to get started with Python for (social science) research.
Sweetie DataThis repo contains logstash of various honeypots
Awesome AiA curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)
Bookstore📚 Notebook storage and publishing workflows for the masses
PresentationsSlide show presentations regarding data driven investing.
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Datascience Pizza🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e dicionário de conceitos
DanmfA sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
PzadКурс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
PrimehubA toil-free multi-tenancy machine learning platform in your Kubernetes cluster
GhactionsGitHub actions for R and accompanying R package
FastbookThe fastai book, published as Jupyter Notebooks
GensimTopic Modelling for Humans
Awesome Pytorch ListA comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
GeniA Clojure dataframe library that runs on Spark
Pygm🐍 Python library implementing sorted containers with state-of-the-art query performance and compressed memory usage
ZigzagPython library for identifying the peaks and valleys of a time series.
BatchflowBatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Py QuantmodPowerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
MetaprobAn embedded language for probabilistic programming and meta-programming.
RbbjsonFlexible JSON traversal for rapid prototyping.
PyftsAn open source library for Fuzzy Time Series in Python
Color recognition🎨 Color recognition & classification & detection on webcam stream / on video / on single image using K-Nearest Neighbors (KNN) is trained with color histogram features by OpenCV.
Data Science Stack Cookiecutter🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Go Tsnet-Distributed Stochastic Neighbor Embedding (t-SNE) in Go
MarianaThe Cutest Deep Learning Framework which is also a wonderful Declarative Language