All Projects → shenxiangzhuang → Pythondataanalysis

shenxiangzhuang / Pythondataanalysis

The data and code that used in my book.

Programming Languages

python3
1442 projects

Projects that are alternatives of or similar to Pythondataanalysis

Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1725.93%)
Mutual labels:  jupyter-notebook, data-science
Computervision Recipes
Best Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+15111.11%)
Mutual labels:  jupyter-notebook, data-science
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1748.15%)
Mutual labels:  jupyter-notebook, data-science
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-35.19%)
Mutual labels:  jupyter-notebook, data-science
Numerical Linear Algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Stars: ✭ 8,263 (+15201.85%)
Mutual labels:  jupyter-notebook, data-science
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (+1718.52%)
Mutual labels:  jupyter-notebook, data-science
Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-22.22%)
Mutual labels:  jupyter-notebook, data-science
Mlnet Workshop
ML.NET Workshop to predict car sales prices
Stars: ✭ 29 (-46.3%)
Mutual labels:  jupyter-notebook, data-science
Presentations
Talks & Workshops by the CODAIT team
Stars: ✭ 50 (-7.41%)
Mutual labels:  jupyter-notebook, data-science
Mckinsey Smartcities Traffic Prediction
Adventure into using multi attention recurrent neural networks for time-series (city traffic) for the 2017-11-18 McKinsey IronMan (24h non-stop) prediction challenge
Stars: ✭ 49 (-9.26%)
Mutual labels:  jupyter-notebook, data-science
Python Training
Python training for business analysts and traders
Stars: ✭ 972 (+1700%)
Mutual labels:  jupyter-notebook, data-science
25daysinmachinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-1.85%)
Mutual labels:  jupyter-notebook, data-science
Docker Iocaml Datascience
Dockerfile of Jupyter (IPython notebook) and IOCaml (OCaml kernel) with libraries for data science and machine learning
Stars: ✭ 30 (-44.44%)
Mutual labels:  jupyter-notebook, data-science
Minerva Training Materials
Learn advanced data science on real-life, curated problems
Stars: ✭ 37 (-31.48%)
Mutual labels:  jupyter-notebook, data-science
Python for ml
brief introduction to Python for machine learning
Stars: ✭ 29 (-46.3%)
Mutual labels:  jupyter-notebook, data-science
Ds Take Home
My solution to the book A Collection of Data Science Take-Home Challenges
Stars: ✭ 1,004 (+1759.26%)
Mutual labels:  jupyter-notebook, data-science
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-62.96%)
Mutual labels:  jupyter-notebook, data-science
Intro Python
Python pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-61.11%)
Mutual labels:  jupyter-notebook, data-science
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-9.26%)
Mutual labels:  jupyter-notebook, data-science
Ppd599
USC urban data science course series with Python and Jupyter
Stars: ✭ 1,062 (+1866.67%)
Mutual labels:  jupyter-notebook, data-science

《Python数据分析入门————从数据获取到可视化》

概览

这里是本书中使用的所有源代码,数据等文件。关于本书的一些最新的进展的也会第一时间在这里公布。希望本书能对大家有所帮助。

问题提交

如果大家有问题和建议,可以直接在本项目提交issue(推荐),也可以发邮件给我([email protected]) 我会定期查看并尽快回复。 (也有读者到出版社 提交勘误的,也是可以的,不过只建议在那里提交typo相关的, 涉及到代码还是建议在Github提issue,方便一些)。

勘误

已更正:

页码 错误 改正
201 上方第一个阴影框(训练集数据)“种类”列最后两行将“bumpy”全改为“orange” 第二次印刷时更正
202 第三行,“是橙子还是水果”改为“是橙子还是苹果” 第二次印刷时更正
99 代码框最后两行交换位置(因为多线程会把urls清空) 第六次印刷时更正
115 正文第三行“运行输出如下。”下面的输出有误,下面的数据需要我们自己手动创建 第六次印刷时更正
245 代码框,最上面应加上import random as rnd 第六次印刷时更正
247,248 两个LP问题的目标函数漏掉,改正参考博客 第六次印刷时更正
71-73 豆瓣模拟登录报错 第六次印刷时更正

待更正:

页码 错误 改正

意见征集

个人认为,一本书在出版后绝对不是结束的标志,而是新一轮的开始。本书写作的初衷在于,当时国内很多的书并没有将数据爬取,数据处理,分析以及可视化放到一起来写,我认为这是一件值得去尝试的事情,所以才有了这本书。

在本书出版一年多来,根据各方的反馈也在不断进行着完善。于此同时也意识到书中存在的问题,比较核心的就在于知识的深度与广度之间的矛盾,本书是着眼于广度的,所以深度就有所欠缺。后面会考虑对内容进行删减,在顾及广度的同时突出重点(统计学方法,机器学习方法等算法)。

此外,如果有机会写第二版,会将文章核心内容以Jupyter notebook的形式呈现,以更好地说明问题。

如上所言,是有一些反馈,但是不太多。希望各位作为读者,在阅读完本书后能够写一些建议给我,我也能更好地明确下面修改的方向。

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].