Top 471 pandas open source projects

Jardin
A pandas.DataFrame-based ORM.
Covid 19 jhu data web scrap and cleaning
This repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/
Docker Alpine Python Machinelearning
Small Docker image with Python Machine Learning tools (~180MB) https://hub.docker.com/r/frolvlad/alpine-python-machinelearning/
Locopy
locopy: Loading/Unloading to Redshift and Snowflake using Python.
Fecon236
Tools for financial economics. Curated wrapper over Python ecosystem. Source code for fecon235 Jupyter notebooks.
Pangres
SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features
Seaborn
Statistical data visualization in Python
Gsee
GSEE: Global Solar Energy Estimator
Pydata Pandas Workshop
Material for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
Python
Jupyter notebooks and datasets for the interesting pandas/python/data science video series.
Head Pose Estimator.caffe
Head Pose Estimator on Caffe
Dask
Parallel computing with task scheduling
Skoot
A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn friendly interface in an effort to expedite the modeling process.
Iml
Курс "Введение в машинное обучение" (ВМК, МГУ имени М.В. Ломоносова)
Xyzpy
Efficiently generate and analyse high dimensional data.
Django Rest Pandas
📊📈 Serves up Pandas dataframes via the Django REST Framework for use in client-side (i.e. d3.js) visualizations and offline analysis (e.g. Excel)
10 Simple Hacks To Speed Up Your Data Analysis In Python
Some useful Tips and Tricks to speed up the data analysis process in Python.
Abu
阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构
Pandas Ml Quant
Master repository for the pandas-ml modules
Python stock github
Python 量化投资及 Github 管理学习笔记
Pandas Ta
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
Pandas basics
basic pandas tutorials
Python for ml
brief introduction to Python for machine learning
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Pythondatasciencehandbook
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
Pandas Validation
A small Python library for validating data with pandas
Mgflappy Bird
飞翔的小鸟:是一个飞翔的小鸟通过障碍物得分的小游戏和熊猫(Panda):是一款以熊猫为主题的游戏,你将会化身行动敏捷神速的熊猫
Dupandas
📊 python package for performing deduplication using flexible text matching and cleaning in pandas dataframe
Kodiak
Enhance your feature engineering workflow with Kodiak
Yelp dataset challenge
Play around with Yelp dataset in Python (in progress and very messy repo)
Numsharp
High Performance Computation for N-D Tensors in .NET, similar API to NumPy.
Pyda 2e Zh
📖 [译] 利用 Python 进行数据分析 · 第 2 版
Disatbot
DABOT: Disaster Attention Bot
S3bp
Read and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.
Boltzmannclean
Fill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Finta
Common financial technical indicators implemented in Pandas.
Quickviz
Visualize a pandas dataframe in a few clicks
Lux
Python API for Intelligent Visual Data Discovery
Dataframe
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Modin
Modin: Speed up your Pandas workflows by changing a single line of code
Fecon235
Notebooks for financial economics. Keywords: Jupyter notebook pandas Federal Reserve FRED Ferbus GDP CPI PCE inflation unemployment wage income debt Case-Shiller housing asset portfolio equities SPX bonds TIPS rates currency FX euro EUR USD JPY yen XAU gold Brent WTI oil Holt-Winters time-series forecasting statistics econometrics