validadaAnother library for defensive data analysis.
Stars: ✭ 29 (-43.14%)
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (+4180.39%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+2009.8%)
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (+172.55%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+3529.41%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+1796.08%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+16231.37%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+435.29%)
CubesLight-weight Python OLAP framework for multi-dimensional data analysis
Stars: ✭ 1,393 (+2631.37%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (+188.24%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (+13.73%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (+31.37%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+2335.29%)
Categoricalarrays.jlArrays for working with categorical data (both nominal and ordinal)
Stars: ✭ 71 (+39.22%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+235.29%)
VolbxGraphical tool for data manipulation written in C++/Qt
Stars: ✭ 187 (+266.67%)
Pandas GbqPandas Google BigQuery
Stars: ✭ 243 (+376.47%)
Tianyanchapip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (+303.92%)
kobe-every-shot-everA Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+29.41%)
GreyNSightsPrivacy-Preserving Data Analysis using Pandas
Stars: ✭ 18 (-64.71%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-62.75%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+215.69%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-72.55%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+16627.45%)
StartrA template for data journalism in R
Stars: ✭ 69 (+35.29%)
ApogeeTools for dealing with APOGEE data
Stars: ✭ 34 (-33.33%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+2309.8%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (+43.14%)
LFM1b-analysesPython scripts for studying bias in recommender systems
Stars: ✭ 18 (-64.71%)
facerec-bias-bfwSource code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).
Stars: ✭ 40 (-21.57%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (-15.69%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+9545.1%)
StatsA well tested and comprehensive Golang statistics library package with no dependencies.
Stars: ✭ 2,196 (+4205.88%)
WeldHigh-performance runtime for data analytics applications
Stars: ✭ 2,709 (+5211.76%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (+343.14%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-64.71%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-74.51%)
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+1552.94%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (+315.69%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-39.22%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+721.57%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-68.63%)
hdfeNo description or website provided.
Stars: ✭ 22 (-56.86%)
Algorithmic-TradingI have been deeply interested in algorithmic trading and systematic trading algorithms. This Repository contains the code of what I have learnt on the way. It starts form some basic simple statistics and will lead up to complex machine learning algorithms.
Stars: ✭ 47 (-7.84%)
veridical-flowMaking it easier to build stable, trustworthy data-science pipelines.
Stars: ✭ 28 (-45.1%)
FairAIThis is a collection of papers and other resources related to fairness.
Stars: ✭ 55 (+7.84%)
yt-channels-DS-AI-ML-CSA comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+1935.29%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-56.86%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (+166.67%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+52454.9%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-68.63%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+17.65%)