Baby Names AnalysisData ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'.
Stars: ✭ 557 (+2685%)
Mexican Government ReportText Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.
Stars: ✭ 473 (+2265%)
Jdata京东JData算法大赛-高潜用户购买意向预测入门程序(starter code)
Stars: ✭ 662 (+3210%)
Csvs To SqliteConvert CSV files into a SQLite database
Stars: ✭ 568 (+2740%)
ModinModin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+33095%)
YfinanceDownload market data from Yahoo! Finance's API
Stars: ✭ 6,148 (+30640%)
FintaCommon financial technical indicators implemented in Pandas.
Stars: ✭ 901 (+4405%)
Subreddit AnalyzerA comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.
Stars: ✭ 447 (+2135%)
SdcIntel® Scalable Dataframe Compiler for Pandas*
Stars: ✭ 623 (+3015%)
TalismanStraightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+2820%)
JdupesA powerful duplicate file finder and an enhanced fork of 'fdupes'.
Stars: ✭ 790 (+3850%)
SequoiaA股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
Stars: ✭ 564 (+2720%)
KopiaCross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Stars: ✭ 507 (+2435%)
Machine Learning머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 repository입니다. (This repository is intented for helping whom are interested in machine learning study)
Stars: ✭ 705 (+3425%)
Dataframe GoDataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Stars: ✭ 487 (+2335%)
Pyda 2e Zh📖 [译] 利用 Python 进行数据分析 · 第 2 版
Stars: ✭ 866 (+4230%)
Jqdatasdk简单易用的量化金融数据包(easy utility for getting financial market data of China)
Stars: ✭ 457 (+2185%)
PingouinStatistical package in Python based on Pandas
Stars: ✭ 651 (+3155%)
Pytablewriterpytablewriter is a Python library to write a table in various formats: CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.
Stars: ✭ 422 (+2010%)
QuickvizVisualize a pandas dataframe in a few clicks
Stars: ✭ 18 (-10%)
Finance Go📊 Financial markets data library implemented in go.
Stars: ✭ 392 (+1860%)
SiubaPython library for using dplyr like syntax with pandas and SQL
Stars: ✭ 605 (+2925%)
Prince👑 Python factor analysis library (PCA, CA, MCA, MFA, FAMD)
Stars: ✭ 591 (+2855%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+21750%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+4040%)
IexfinancePython SDK for IEX Cloud
Stars: ✭ 573 (+2765%)
S3bpRead and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.
Stars: ✭ 24 (+20%)
AlphapyAutomated Machine Learning [AutoML] with Python, scikit-learn, Keras, XGBoost, LightGBM, and CatBoost
Stars: ✭ 564 (+2720%)
Data Science PortfolioPortfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (+2695%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+41545%)
RecordlinkageA toolkit for record linkage and duplicate detection in Python
Stars: ✭ 532 (+2560%)
Fecon235Notebooks for financial economics. Keywords: Jupyter notebook pandas Federal Reserve FRED Ferbus GDP CPI PCE inflation unemployment wage income debt Case-Shiller housing asset portfolio equities SPX bonds TIPS rates currency FX euro EUR USD JPY yen XAU gold Brent WTI oil Holt-Winters time-series forecasting statistics econometrics
Stars: ✭ 708 (+3440%)
PanderaA light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+2430%)
BoltzmanncleanFill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Stars: ✭ 23 (+15%)
Or Pandas【运筹OR帷幄|数据科学】pandas教程系列电子书
Stars: ✭ 492 (+2360%)
RdedupData deduplication engine, supporting optional compression and public key encryption.
Stars: ✭ 690 (+3350%)
PandapyPandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
Stars: ✭ 474 (+2270%)
Yelp dataset challengePlay around with Yelp dataset in Python (in progress and very messy repo)
Stars: ✭ 15 (-25%)
PynamicalPynamical is a Python package for modeling and visualizing discrete nonlinear dynamical systems, chaos, and fractals.
Stars: ✭ 458 (+2190%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+110140%)
BorgmaticSimple, configuration-driven backup software for servers and workstations
Stars: ✭ 902 (+4410%)
PyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+3135%)
DovpandaDirections overlay for working with pandas in an analysis environment
Stars: ✭ 419 (+1995%)
DisatbotDABOT: Disaster Attention Bot
Stars: ✭ 26 (+30%)
AlertmanagerPrometheus Alertmanager
Stars: ✭ 4,574 (+22770%)
Bamboolibbamboolib - a GUI for pandas DataFrames
Stars: ✭ 622 (+3010%)
Pandas JsPandas in JavaScript for data analysis and visualization
Stars: ✭ 389 (+1845%)
FoxcrossAsyncIO serving for data science models
Stars: ✭ 18 (-10%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+2865%)
KodiakEnhance your feature engineering workflow with Kodiak
Stars: ✭ 20 (+0%)
NumsharpHigh Performance Computation for N-D Tensors in .NET, similar API to NumPy.
Stars: ✭ 882 (+4310%)
PhildbTimeseries database
Stars: ✭ 25 (+25%)
LuxPython API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+3835%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+2850%)