LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+1002.78%)
Sweetie DataThis repo contains logstash of various honeypots
Stars: ✭ 163 (-9.44%)
ZigzagPython library for identifying the peaks and valleys of a time series.
Stars: ✭ 156 (-13.33%)
AnndataAnnotated data.
Stars: ✭ 171 (-5%)
PrimehubA toil-free multi-tenancy machine learning platform in your Kubernetes cluster
Stars: ✭ 160 (-11.11%)
ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (-2.22%)
AuptimizerAn automatic ML model optimization tool.
Stars: ✭ 166 (-7.78%)
Py QuantmodPowerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Stars: ✭ 155 (-13.89%)
DstackAn open-source tool to rapidly develop data applications with Python
Stars: ✭ 174 (-3.33%)
Bookstore📚 Notebook storage and publishing workflows for the masses
Stars: ✭ 162 (-10%)
Soda SqlMetric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (-3.89%)
DanmfA sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Stars: ✭ 161 (-10.56%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-5%)
GhactionsGitHub actions for R and accompanying R package
Stars: ✭ 159 (-11.67%)
FastbookThe fastai book, published as Jupyter Notebooks
Stars: ✭ 13,998 (+7676.67%)
MatplotplusplusMatplot++: A C++ Graphics Library for Data Visualization 📊🗾
Stars: ✭ 2,433 (+1251.67%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-15.56%)
Scikit PlotAn intuitive library to add plotting functionality to scikit-learn objects.
Stars: ✭ 2,162 (+1101.11%)
BatchflowBatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Stars: ✭ 156 (-13.33%)
FedmsgFederated Messaging with ZeroMQ
Stars: ✭ 165 (-8.33%)
FixyAmacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-8.33%)
MetaprobAn embedded language for probabilistic programming and meta-programming.
Stars: ✭ 155 (-13.89%)
Datasets For GoodList of datasets to apply stats/machine learning/technology to the world of social good.
Stars: ✭ 174 (-3.33%)
LearnpythonforresearchThis repository provides everything you need to get started with Python for (social science) research.
Stars: ✭ 163 (-9.44%)
Awesome AiA curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)
Stars: ✭ 161 (-10.56%)
Deep SpyingSpying using Smartwatch and Deep Learning
Stars: ✭ 172 (-4.44%)
PresentationsSlide show presentations regarding data driven investing.
Stars: ✭ 162 (-10%)
Andrew Ng NotesThis is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (+0%)
Datascience Pizza🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e dicionário de conceitos
Stars: ✭ 2,043 (+1035%)
100 Days Of Ml CodeA day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (-4.44%)
PzadКурс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Stars: ✭ 160 (-11.11%)
MetricsMachine learning metrics for distributed, scalable PyTorch applications.
Stars: ✭ 162 (-10%)
JaxnetConcise deep learning for JAX
Stars: ✭ 171 (-5%)
ComputationalhealthcareA platform for analysis & development of machine learning models using large de-identified healthcare datasets.
Stars: ✭ 180 (+0%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-6.11%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+6990.56%)
Book listPython, Machine Learning, Deep Learning and Data Science Books
Stars: ✭ 176 (-2.22%)
Awesome Pytorch ListA comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Stars: ✭ 12,475 (+6830.56%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+2632.78%)
Pygm🐍 Python library implementing sorted containers with state-of-the-art query performance and compressed memory usage
Stars: ✭ 156 (-13.33%)
AulasAulas da Escola de Inteligência Artificial de São Paulo
Stars: ✭ 166 (-7.78%)
HandoutTurn Python scripts into handouts with Markdown and figures
Stars: ✭ 1,973 (+996.11%)
Lets Plot KotlinKotlin API for Lets-Plot - an open-source plotting library for statistical data.
Stars: ✭ 181 (+0.56%)
Ml GlossaryMachine learning glossary
Stars: ✭ 2,338 (+1198.89%)
Deep RulesTen Quick Tips for Deep Learning in Biology
Stars: ✭ 179 (-0.56%)
Kd libA Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.
Stars: ✭ 173 (-3.89%)
BoostarootaA fast xgboost feature selection algorithm
Stars: ✭ 165 (-8.33%)