Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (+379.31%)
fairlensIdentify bias and measure fairness of your data
Stars: ✭ 51 (+75.86%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+3234.48%)
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (+7427.59%)
PanderaA light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+1644.83%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+841.38%)
PyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+2131.03%)
ApogeeTools for dealing with APOGEE data
Stars: ✭ 34 (+17.24%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (+368.97%)
GreyNSightsPrivacy-Preserving Data Analysis using Pandas
Stars: ✭ 18 (-37.93%)
CubesLight-weight Python OLAP framework for multi-dimensional data analysis
Stars: ✭ 1,393 (+4703.45%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+16862.07%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+489.66%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+1934.48%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+1944.83%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-37.93%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+92324.14%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (+151.72%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+4137.93%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+29317.24%)
Pandas GbqPandas Google BigQuery
Stars: ✭ 243 (+737.93%)
kobe-every-shot-everA Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+127.59%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+106.9%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-44.83%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-34.48%)
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+2806.9%)
IexfinancePython SDK for IEX Cloud
Stars: ✭ 573 (+1875.86%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+16989.66%)
Valid.js📝 A library for data validation.
Stars: ✭ 604 (+1982.76%)
Finance Go📊 Financial markets data library implemented in go.
Stars: ✭ 392 (+1251.72%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-44.83%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+3610.34%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+1248.28%)
StartrA template for data journalism in R
Stars: ✭ 69 (+137.93%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (+131.03%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+4182.76%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (+100%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (+406.9%)
MezaA Python toolkit for processing tabular data
Stars: ✭ 374 (+1189.66%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+1344.83%)
WeldHigh-performance runtime for data analytics applications
Stars: ✭ 2,709 (+9241.38%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (+679.31%)
Tianyanchapip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (+610.34%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-55.17%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (+631.03%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+455.17%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (+6.9%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (+3.45%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (+48.28%)
pyvaruRule based data validation library for python 3.
Stars: ✭ 17 (-41.38%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+14844.83%)
VolbxGraphical tool for data manipulation written in C++/Qt
Stars: ✭ 187 (+544.83%)