toucan-connectorsConnectors available to retrieve data in Toucan Toco small apps
Stars: ✭ 13 (-35%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+350%)
pandas twitterAnalyzing Trump's tweets using Python (Pandas + Twitter workshop)
Stars: ✭ 81 (+305%)
ydata-qualityData Quality assessment with one line of code
Stars: ✭ 311 (+1455%)
trackanimationTrack Animation is a Python 2 and 3 library that provides an easy and user-adjustable way of creating visualizations from GPS data.
Stars: ✭ 74 (+270%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+69250%)
saddleSADDLE: Scala Data Library
Stars: ✭ 23 (+15%)
jcastsSimple podcast MVP
Stars: ✭ 27 (+35%)
faldo more with dbt. fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Stars: ✭ 567 (+2735%)
five-minute-midasPredicting Profitable Day Trading Positions using Decision Tree Classifiers. scikit-learn | Flask | SQLite3 | pandas | MLflow | Heroku | Streamlit
Stars: ✭ 41 (+105%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+2960%)
obsplusA Pandas-Centric ObsPy Expansion Pack
Stars: ✭ 28 (+40%)
pyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 970 (+4750%)
Information-RetrievalInformation Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Stars: ✭ 103 (+415%)
grailerweb scraping tool for grailed.com
Stars: ✭ 30 (+50%)
Python-Data-VisualizationD-Lab's 3 hour introduction to data visualization with Python. Learn how to create histograms, bar plots, box plots, scatter plots, compound figures, and more, using matplotlib and seaborn.
Stars: ✭ 42 (+110%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (+960%)
wax-mlA Python library for machine-learning and feedback loops on streaming data
Stars: ✭ 36 (+80%)
xpandasUniversal 1d/2d data containers with Transformers functionality for data analysis.
Stars: ✭ 25 (+25%)
muneSimple stock price analytics
Stars: ✭ 14 (-30%)
weaverbirdA visual data pipeline builder with various backends
Stars: ✭ 65 (+225%)
pantabRead/Write pandas DataFrames with Tableau Hyper Extracts
Stars: ✭ 64 (+220%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (+55%)
pybacenThis library was developed for economic analysis in the Brazilian scenario (Investments, micro and macroeconomic indicators)
Stars: ✭ 40 (+100%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+705%)
ml-workflow-automationPython Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deployment as a RESTful service on Kubernetes.
Stars: ✭ 44 (+120%)
chatstats💬📊 Fun data visualizations for Facebook Messenger chats
Stars: ✭ 18 (-10%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-5%)
datarA Grammar of Data Manipulation in python
Stars: ✭ 142 (+610%)
datascienvdatascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (+165%)
vulknLove your Data. Love the Environment. Love VULKИ.
Stars: ✭ 43 (+115%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-35%)
pytdTreasure Data Driver for Python
Stars: ✭ 15 (-25%)
gw2raidarA log parsing website for Guild Wars 2 combat logs
Stars: ✭ 19 (-5%)
EngeznyEngezny is a python package that quickly generates all possible charts from your dataframe and saves them for you, and engezny is only supporting now uni-parameter visualization using the pie, bar and barh visualizations.
Stars: ✭ 25 (+25%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-30%)
covid-19Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-30%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-20%)
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+4115%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+200%)
jupyter-djangoUsing Jupyter Notebook with Django: a presentation
Stars: ✭ 42 (+110%)
cognipyIn-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas
Stars: ✭ 31 (+55%)
bcpandasHigh-level wrapper around BCP for high performance data transfers between pandas and SQL Server. No knowledge of BCP required!!
Stars: ✭ 69 (+245%)
tsa-tutorialMaterial for the tutorial, "Time series analysis with pandas" at T-Academy
Stars: ✭ 21 (+5%)
machine-learning-capstone-projectThis is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (+40%)
DS-Cookbook101A jupyter notebook having all most frequent used code snippet for daily data scienceoperations
Stars: ✭ 59 (+195%)
onelinerhub2.5k code solutions with clear explanation @ onelinerhub.com
Stars: ✭ 645 (+3125%)