All Projects → sengkchu → dataquest-guided-projects-solutions

sengkchu / dataquest-guided-projects-solutions

Licence: other
My dataquest project solutions

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to dataquest-guided-projects-solutions

Mlcourse.ai
Open Machine Learning Course
Stars: ✭ 7,963 (+22651.43%)
Mutual labels:  scikit-learn, pandas, data-analysis, matplotlib
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+4231.43%)
Mutual labels:  scikit-learn, pandas, data-analysis
PracticalMachineLearning
A collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+71.43%)
Mutual labels:  scikit-learn, pandas, data-analysis
PandasVersusExcel
Python数据分析入门,数据分析师入门
Stars: ✭ 120 (+242.86%)
Mutual labels:  pandas, data-analysis, matplotlib
Machine Learning Alpine
Alpine Container for Machine Learning
Stars: ✭ 30 (-14.29%)
Mutual labels:  scikit-learn, pandas, matplotlib
Ds and ml projects
Data Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (+60%)
Mutual labels:  scikit-learn, pandas, matplotlib
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (-11.43%)
Mutual labels:  pandas, data-analysis, matplotlib
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+765.71%)
Mutual labels:  scikit-learn, pandas, data-analysis
Kagglestruggle
Kaggle Struggle
Stars: ✭ 228 (+551.43%)
Mutual labels:  scikit-learn, pandas, matplotlib
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+571.43%)
Mutual labels:  scikit-learn, pandas, data-analysis
Artificial Intelligence Deep Learning Machine Learning Tutorials
A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
Stars: ✭ 2,966 (+8374.29%)
Mutual labels:  scikit-learn, pandas, matplotlib
Pythondatasciencehandbook
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
Stars: ✭ 31,995 (+91314.29%)
Mutual labels:  scikit-learn, pandas, matplotlib
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-42.86%)
Mutual labels:  scikit-learn, pandas, matplotlib
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+62894.29%)
Mutual labels:  scikit-learn, pandas, matplotlib
datascienv
datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (+51.43%)
Mutual labels:  scikit-learn, pandas, matplotlib
Pbpython
Code, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+4825.71%)
Mutual labels:  scikit-learn, pandas, data-analysis
Edaviz
edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
Stars: ✭ 220 (+528.57%)
Mutual labels:  pandas, data-analysis, matplotlib
machine-learning-capstone-project
This is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (-20%)
Mutual labels:  scikit-learn, pandas, matplotlib
Machine Learning With Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+6177.14%)
Mutual labels:  scikit-learn, pandas, matplotlib
Algorithmic-Trading
Algorithmic trading using machine learning.
Stars: ✭ 102 (+191.43%)
Mutual labels:  scikit-learn, pandas, matplotlib

Dataquest Guided Projects

This repository is a collection of my projects from Dataquest.io.

The projects below will serve as reference notes for myself and anyone else who is interested. These projects contain lots of comments showing my thought process and what I personally learned from each project.

Project #1: Exploring US births

Concepts explored: lists, dictionaries, functions, for loops

Functions, methods, and properties used: .read(), open(), .split(), .append(), int()

Project #2: Exploring Gun Deaths in the US

Concepts explored: list comprehension, datetime module, csv module

Functions, methods, and properties used: csv.reader(), .items(), list(), datetime.datetime()

Project #3: Analyzing Thanksgiving Dinner

Concepts explored: pandas, functions, boolean filtering

Functions, methods, and properties used: .read_csv(), .pivot_table(), .replace(), .describe(), .apply(), .isnull(), .columns, .shape, .head()

Project #4: Visualizing Earnings Based On College Majors

Concepts explored: pandas, matplotlib, histograms, bar charts, scatterplots, scatter matrices

Functions, methods, and properties used: .plot(), scatter_matrix(), hist(), iloc[], .head(), .tail(), .describe()

Project #5: Visualizing The Gender Gap In College Degrees

Concepts explored: pandas, matplotlib, histograms, line plots, chart graphics

Functions, methods, and properties used: .savefig(), .text(), .axhline(), .set_yticks(), .tick_params(), .set_title(), .set_ylim(), .set_xlim(), .spines(), .tick_params()

Project #6: Analyzing NYC High School Data

Concepts explored: pandas, matplotlib.pyplot, correlations, regex, basemap, data analysis, string manipulation

Functions, methods, and properties used: .scatter(), info(), .tolist(), .groupby(), .agg(), .concat(), .apply(), .strip, .merge(), .fillna(), .corr()

Project #7: Star Wars Survey

Concepts explored: pandas, matplotlib.pyplot, data cleaning, string manipulation, bar plots

Functions, methods, and properties used: .read_csv(), .columns, notnull, map(), .dtypes, .rename, astype(), .mean(), .sum(), .xlabel(), .ylabel()

Project #8: Working with Data Downloads

Concepts explored: pandas, manipulating files with command line

Project #9: Transforming Data with Python

Concepts explored: pandas, manipulating files with command line, working with multiple python scripts, dateutil.parser

Project #10: Analyzing CIA Factbook

Python/SQL concepts explored: python+sqlite3, pandas, SQL queries, SQL subqueries, matplotlib.plyplot, seaborn, histograms

Functions, methods, and properties used: .cursor(), .read_sql_query(), .set_xlabel(), .set_xlim(), .add_subplot(), .figure()

SQL statements used: SELECT, WHERE, FROM, MIN(), MAX(), ORDER BY, AND

Project #11: Preparing data for SQLite

Python/SQL concepts explored: python+sqlite3, pandas, data cleaning, columns manipulation

Functions, methods, and properties used: .str.rstrip(), .str.split(), .connect(), .cursor(), .drop(), .str[], .map(), .value_counts()

SQL statements used: SELECT, FROM, PRAGMA

Project #12: Creating Relations in SQLite

Python/SQL concepts explored: python+sqlite3, pandas, multiple tables, foreign keys, subqueries, populating new tables

Functions, methods, and properties used: .cursor(), .connect(), .execute(), .fetchall(), .executemany()

SQL statements used: PRAGMA, LIMIT, FROM, SELECT, INNER JOIN, DROP, ALTER, VALUES

Project #13: Analyzing Movie Reviews

Concepts explored: pandas, descriptive statistics, numpy, matplotlib, scipy, correlations

Functions and methods used: .sort_values(), sci.linregress(), .hist(), .absolute(), .mean(), .median(), .absolute()

Project #14: Winning Jeopardy

Concepts explored: pandas, matplotlib, data cleaning, string manipulation, chi squared test, regex, try/except

Functions, methods, and properties used: .columns, .lower(), .sub(), .apply(), sum(), .array(), .split(), .shape, .mean(), .iterrows(), .remove(), .add(), .append()

Project #15: Predicting Car Prices

Concepts explored: Concepts explored: pandas, matplotlib, data cleaning, features engineering, k-nearest neighbors, hyperparameter tuning, RMSE

Functions and methods used: .read_csv(), .replace(), .drop(), .astype(), isnull().sum(), .min(), .max(), .mean(), .permutation(), .reindex(), .iloc[], .fit(), .predict(), mean_squared_error(), .Series(), .sort_values(), .plot(), .legend()

Project #16: Predicting House Sale Prices

Concepts explored: pandas, data cleaning, features engineering, linear regression, hyperparameter tuning, RMSE, KFold validation

Functions, methods, and properties used: .dtypes, .value_counts(), .drop, .isnull(), sum(), .fillna(), .sort_values(), . corr(), .index, .append(), .get_dummies(), .astype(), predict(), .fit(), KFold(), mean_squared_error()

Project #17: Predicting the Stock Market

Concepts explored: linear regression, mean squared error, categorical features, datetime

Functions, methods, and properties used: .read_csv(), .to_datetime(), .sort_values(), .rolling(), .apply(), .concat(), .get_dummies(), .shift(), datetime(), .fit(), .predict(), mean_squared_error()

Project #18: Predicting Bike Rentals

Concepts explored: pandas, matplotlib, features engineering, linear regression, decision trees, random forests, MSE

Functions, methods, and properties used:.hist(), .apply(), .corr(), .columns, .drop(), .sample(), .index, .floor(),.fit() .predict(), .mean_squared_error(), .append()

Project #19: Investigating Airplane Accidents

Concepts explored: Big O notation, strings, dictionaries, data parsing, try/except

Functions, methods, and properties used: range(), .append(), .split(), .values(), Counter()

Project #20: Working with Spark in Jupyter Notebook

Concepts explored: Spark

PySpark methods used: .map(), .flatMap(), .filter(), .count(), .collect(), .take()

Project #21: Working with Spark Dataframes and Spark SQL in Jupyter

Concepts explored: Spark SQL, Spark Dataframes, combining data from multiple files

Methods and functions used: .SQLContext(), .head(), .toPandas(), .show(), .select(), .hist(), .registerTempTable()

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].