A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn friendly interface in an effort to expedite the modeling process.

Stars: ✭ 50 (-60.32%)

Mutual labels: pandas

Pandapy

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

Stars: ✭ 474 (+276.19%)

Mutual labels: pandas

Loghouse

Ready to use log management solution for Kubernetes storing data in ClickHouse and providing web UI.

Stars: ✭ 805 (+538.89%)

Mutual labels: clickhouse

pywedge

Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking

Stars: ✭ 49 (-61.11%)

Mutual labels: dataframe

tsa-tutorial

Material for the tutorial, "Time series analysis with pandas" at T-Academy

Stars: ✭ 21 (-83.33%)

Mutual labels: pandas

Swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

Stars: ✭ 1,844 (+1363.49%)

Mutual labels: pandas

google classroom

Google Classroom Data Pipeline

Stars: ✭ 17 (-86.51%)

Mutual labels: pandas

Clickhouse Grafana

Clickhouse datasource for grafana

Stars: ✭ 462 (+266.67%)

Mutual labels: clickhouse

Information-Retrieval

Information Retrieval algorithms developed in python. To follow the blog posts, click on the link:

Stars: ✭ 103 (-18.25%)

Mutual labels: pandas

Xyzpy

Efficiently generate and analyse high dimensional data.

Stars: ✭ 45 (-64.29%)

Mutual labels: pandas

dflib

In-memory Java DataFrame library

Stars: ✭ 50 (-60.32%)

Mutual labels: dataframe

Jqdatasdk

简单易用的量化金融数据包(easy utility for getting financial market data of China)

Stars: ✭ 457 (+262.7%)

Mutual labels: pandas

machine-learning-capstone-project

This is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning

Stars: ✭ 28 (-77.78%)

Mutual labels: pandas

five-minute-midas

Stars: ✭ 41 (-67.46%)

Mutual labels: pandas

Data Science Ipython Notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Stars: ✭ 22,048 (+17398.41%)

Mutual labels: pandas

ydata-quality

Data Quality assessment with one line of code

Stars: ✭ 311 (+146.83%)

Mutual labels: pandas

Data Science Complete Tutorial

For extensive instructor led learning

Stars: ✭ 1,027 (+715.08%)

Mutual labels: pandas

degiro-trading-tracker

Simplified tracking of your investments

Stars: ✭ 16 (-87.3%)

Mutual labels: pandas

Pydata Notebook

利用Python进行数据分析第二版 (2017) 中文翻译笔记

Stars: ✭ 4,300 (+3312.7%)

Mutual labels: pandas

DS-Cookbook101

A jupyter notebook having all most frequent used code snippet for daily data scienceoperations

Stars: ✭ 59 (-53.17%)

Mutual labels: pandas

Seaborn Tutorial

This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.

Stars: ✭ 114 (-9.52%)

Mutual labels: pandas

jcasts

Simple podcast MVP

Stars: ✭ 27 (-78.57%)

Mutual labels: pandas

Dovpanda

Directions overlay for working with pandas in an analysis environment

Stars: ✭ 419 (+232.54%)

Mutual labels: pandas

dataquest-guided-projects-solutions

My dataquest project solutions

Stars: ✭ 35 (-72.22%)

Mutual labels: pandas

Abu

阿布量化交易系统(股票，期权，期货，比特币，机器学习) 基于python的开源量化交易，量化投资架构

Stars: ✭ 8,589 (+6716.67%)

Mutual labels: pandas

obsplus

A Pandas-Centric ObsPy Expansion Pack

Stars: ✭ 28 (-77.78%)

Mutual labels: pandas

Finance Go

📊 Financial markets data library implemented in go.

Stars: ✭ 392 (+211.11%)

Mutual labels: pandas

PandasVersusExcel

Python数据分析入门，数据分析师入门

Stars: ✭ 120 (-4.76%)

Mutual labels: pandas

Pymc Example Project

Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.

Stars: ✭ 90 (-28.57%)

Mutual labels: pandas

Python-Data-Visualization

D-Lab's 3 hour introduction to data visualization with Python. Learn how to create histograms, bar plots, box plots, scatter plots, compound figures, and more, using matplotlib and seaborn.

Stars: ✭ 42 (-66.67%)

Mutual labels: pandas

Pandas Technical Indicators

Technical Indicators implemented in Python using Pandas

Stars: ✭ 388 (+207.94%)

Mutual labels: pandas

tutorials

Short programming tutorials pertaining to data analysis.

Stars: ✭ 14 (-88.89%)

Mutual labels: pandas

Lambda Packs

Precompiled packages for AWS Lambda

Stars: ✭ 997 (+691.27%)

Mutual labels: pandas

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+10907.94%)

Mutual labels: pandas

Arquero

Query processing and transformation of array-backed data tables.

Stars: ✭ 384 (+204.76%)

Mutual labels: dataframe

awesome-clickhouse

A curated list of awesome ClickHouse software.

Stars: ✭ 71 (-43.65%)

Mutual labels: clickhouse

Pbpython

Code, Notebooks and Examples from Practical Business Python

Stars: ✭ 1,724 (+1268.25%)

Mutual labels: pandas

Stats Maths With Python

General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python

Stars: ✭ 381 (+202.38%)

Mutual labels: pandas

Data-Analyst-Nanodegree

Kai Sheng Teh - Udacity Data Analyst Nanodegree

Stars: ✭ 42 (-66.67%)

Mutual labels: pandas

Flink Learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Stars: ✭ 11,378 (+8930.16%)

Mutual labels: clickhouse

Fecon236

Tools for financial economics. Curated wrapper over Python ecosystem. Source code for fecon235 Jupyter notebooks.

Stars: ✭ 72 (-42.86%)

Mutual labels: pandas

Spark Redis

A connector for Spark that allows reading and writing to/from Redis cluster

Stars: ✭ 773 (+513.49%)

Mutual labels: dataframe

Python-Data-Wrangling

D-Lab's 3 hour introduction to data wrangling in Python. Learn how to import and manipulate dataframes using pandas in Python.

Stars: ✭ 41 (-67.46%)

Mutual labels: pandas

appmetrica-logsapi-loader

A tool for automatic data loading from AppMetrica LogsAPI into (local) ClickHouse

Stars: ✭ 18 (-85.71%)

Mutual labels: clickhouse

Credit Risk Modelling

Credit Risk analysis by using Python and ML

Stars: ✭ 91 (-27.78%)

Mutual labels: pandas

datahub

DataHub - Synthetic data library

Stars: ✭ 66 (-47.62%)

Mutual labels: pandas

Pandas exercises

Practice your pandas skills!

Stars: ✭ 7,140 (+5566.67%)

Mutual labels: pandas

framequery

SQL on dataframes - pandas and dask

Stars: ✭ 63 (-50%)

Mutual labels: pandas

AlphaVantageAPI

An Opinionated AlphaVantage API Wrapper in Python 3.9. Compatible with Pandas TA (pip install pandas_ta). Get your FREE API Key at https://www.alphavantage.co/support/

Stars: ✭ 77 (-38.89%)

Mutual labels: pandas

Disease Prediction From Symptoms

Disease Prediction based on Symptoms.

Stars: ✭ 70 (-44.44%)

Mutual labels: pandas

Vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀

Stars: ✭ 6,793 (+5291.27%)

Mutual labels: dataframe

pandas-stubs

Pandas type stubs. Helps you type-check your code.