Or Pandas【运筹OR帷幄|数据科学】pandas教程系列电子书
Stars: ✭ 492 (+290.48%)
SspipeSimple Smart Pipe: python productivity-tool for rapid data manipulation
Stars: ✭ 96 (-23.81%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+232.54%)
rle-arrayRun-length encoded arrays for pandas.
Stars: ✭ 20 (-84.13%)
SkootA package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn friendly interface in an effort to expedite the modeling process.
Stars: ✭ 50 (-60.32%)
PandapyPandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
Stars: ✭ 474 (+276.19%)
LoghouseReady to use log management solution for Kubernetes storing data in ClickHouse and providing web UI.
Stars: ✭ 805 (+538.89%)
pywedgeMakes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking
Stars: ✭ 49 (-61.11%)
tsa-tutorialMaterial for the tutorial, "Time series analysis with pandas" at T-Academy
Stars: ✭ 21 (-83.33%)
SwifterA package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Stars: ✭ 1,844 (+1363.49%)
Information-RetrievalInformation Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Stars: ✭ 103 (-18.25%)
XyzpyEfficiently generate and analyse high dimensional data.
Stars: ✭ 45 (-64.29%)
dflibIn-memory Java DataFrame library
Stars: ✭ 50 (-60.32%)
Jqdatasdk简单易用的量化金融数据包(easy utility for getting financial market data of China)
Stars: ✭ 457 (+262.7%)
machine-learning-capstone-projectThis is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (-77.78%)
five-minute-midasPredicting Profitable Day Trading Positions using Decision Tree Classifiers. scikit-learn | Flask | SQLite3 | pandas | MLflow | Heroku | Streamlit
Stars: ✭ 41 (-67.46%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+17398.41%)
ydata-qualityData Quality assessment with one line of code
Stars: ✭ 311 (+146.83%)
DS-Cookbook101A jupyter notebook having all most frequent used code snippet for daily data scienceoperations
Stars: ✭ 59 (-53.17%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-9.52%)
jcastsSimple podcast MVP
Stars: ✭ 27 (-78.57%)
DovpandaDirections overlay for working with pandas in an analysis environment
Stars: ✭ 419 (+232.54%)
Abu阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构
Stars: ✭ 8,589 (+6716.67%)
obsplusA Pandas-Centric ObsPy Expansion Pack
Stars: ✭ 28 (-77.78%)
Finance Go📊 Financial markets data library implemented in go.
Stars: ✭ 392 (+211.11%)
Pymc Example ProjectExample PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
Stars: ✭ 90 (-28.57%)
Python-Data-VisualizationD-Lab's 3 hour introduction to data visualization with Python. Learn how to create histograms, bar plots, box plots, scatter plots, compound figures, and more, using matplotlib and seaborn.
Stars: ✭ 42 (-66.67%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-88.89%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+691.27%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+10907.94%)
ArqueroQuery processing and transformation of array-backed data tables.
Stars: ✭ 384 (+204.76%)
PbpythonCode, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+1268.25%)
Stats Maths With PythonGeneral statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (+202.38%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+8930.16%)
Fecon236Tools for financial economics. Curated wrapper over Python ecosystem. Source code for fecon235 Jupyter notebooks.
Stars: ✭ 72 (-42.86%)
Spark RedisA connector for Spark that allows reading and writing to/from Redis cluster
Stars: ✭ 773 (+513.49%)
Python-Data-WranglingD-Lab's 3 hour introduction to data wrangling in Python. Learn how to import and manipulate dataframes using pandas in Python.
Stars: ✭ 41 (-67.46%)
appmetrica-logsapi-loaderA tool for automatic data loading from AppMetrica LogsAPI into (local) ClickHouse
Stars: ✭ 18 (-85.71%)
datahubDataHub - Synthetic data library
Stars: ✭ 66 (-47.62%)
framequerySQL on dataframes - pandas and dask
Stars: ✭ 63 (-50%)
AlphaVantageAPIAn Opinionated AlphaVantage API Wrapper in Python 3.9. Compatible with Pandas TA (pip install pandas_ta). Get your FREE API Key at https://www.alphavantage.co/support/
Stars: ✭ 77 (-38.89%)
VaexOut-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+5291.27%)
pandas-stubsPandas type stubs. Helps you type-check your code.
Stars: ✭ 84 (-33.33%)
open-data-anonimizerPython Data Anonymization & Masking Library For Data Science Tasks
Stars: ✭ 36 (-71.43%)