Top 27 pydata open source projects

Kartothek
A consistent table management library in python
Scattertext Pydata
Notebooks for the Seattle PyData 2017 talk on Scattertext
Pydata Chicago2016 Ml Tutorial
Machine learning with scikit-learn tutorial at PyData Chicago 2016
Pymapd
Python client for OmniSci GPU-accelerated SQL engine and analytics platform
Pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
Distributed
A distributed task scheduler for Dask
Dask
Parallel computing with task scheduling
Pydata.kr
PyData Korea 공식 홈페이지입니다. (준비중)
Neural Image Captioning
Implementation of Neural Image Captioning model using Keras with Theano backend
Array Api
RFC document, tooling and other content related to the array API standard
Pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
ml at awslambda pydatabln2018
Material for working alongside my workshop session at PyData Berlin 2018
pydataberlin-2017
Repo for my talk at the PyData Berlin 2017 conference
array-api-comparison
Data and tooling to compare the API surfaces of various array libraries.
meetup-slides
Speaker slides from monthly meetups and conference
PyData-Pseudolabelling-Keynote
Accompanying notebook and sources to "A Guide to Pseudolabelling: How to get a Kaggle medal with only one model" (Dec. 2020 PyData Boston-Cambridge Keynote)
mapshader
Simple Python GIS Web Services
sktime-tutorial-pydata-amsterdam-2020
Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020
1-27 of 27 pydata projects