Data Analysis主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (+238.1%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+283.33%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+114.29%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-69.05%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+2273.81%)
Awkward 1.0Manipulate JSON-like data with NumPy-like idioms.
Stars: ✭ 203 (+383.33%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+19730.95%)
Pyda 2e Zh📖 [译] 利用 Python 进行数据分析 · 第 2 版
Stars: ✭ 866 (+1961.9%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (+223.81%)
100 Pandas Puzzles100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Stars: ✭ 1,382 (+3190.48%)
Ml CheatsheetA constantly updated python machine learning cheatsheet
Stars: ✭ 136 (+223.81%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+10345.24%)
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (+230.95%)
ml-workflow-automationPython Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deployment as a RESTful service on Kubernetes.
Stars: ✭ 44 (+4.76%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-61.9%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+550%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (+171.43%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+18859.52%)
fairlensIdentify bias and measure fairness of your data
Stars: ✭ 51 (+21.43%)
Stats Maths With PythonGeneral statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (+807.14%)
Algorithmic-TradingI have been deeply interested in algorithmic trading and systematic trading algorithms. This Repository contains the code of what I have learnt on the way. It starts form some basic simple statistics and will lead up to complex machine learning algorithms.
Stars: ✭ 47 (+11.9%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+2202.38%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-54.76%)
Tensorflow Ml Nlp텐서플로우와 머신러닝으로 시작하는 자연어처리(로지스틱회귀부터 트랜스포머 챗봇까지)
Stars: ✭ 176 (+319.05%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+4307.14%)
Machine Learning With PythonPractice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+5130.95%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (+404.76%)
covid-19Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-66.67%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-47.62%)
CC33ZCurso de Ciência da Computação
Stars: ✭ 50 (+19.05%)
EngeznyEngezny is a python package that quickly generates all possible charts from your dataframe and saves them for you, and engezny is only supporting now uni-parameter visualization using the pie, bar and barh visualizations.
Stars: ✭ 25 (-40.48%)
Python-campNo description or website provided.
Stars: ✭ 34 (-19.05%)
datascienvdatascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (+26.19%)
Machine-LearningThis repository contains notebooks that will help you in understanding basic ML algorithms as well as basic numpy excercise. 💥 🌈 🌈
Stars: ✭ 15 (-64.29%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1357.14%)
onelinerhub2.5k code solutions with clear explanation @ onelinerhub.com
Stars: ✭ 645 (+1435.71%)
xpandasUniversal 1d/2d data containers with Transformers functionality for data analysis.
Stars: ✭ 25 (-40.48%)
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+1907.14%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-66.67%)
saddleSADDLE: Scala Data Library
Stars: ✭ 23 (-45.24%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+32923.81%)
Chapter-2Code examples for Chapter 2 of Data Wrangling with JavaScript
Stars: ✭ 16 (-61.9%)
neworderA dynamic microsimulation framework for python
Stars: ✭ 15 (-64.29%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+42.86%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-26.19%)
pantabRead/Write pandas DataFrames with Tableau Hyper Extracts
Stars: ✭ 64 (+52.38%)
valinvestA value investing tool based on Warren Buffett, Joseph Piotroski and Benjamin Graham thoughts
Stars: ✭ 84 (+100%)
DataSciPyData Science with Python
Stars: ✭ 15 (-64.29%)