All Projects → AnshuTrivedi → Data-Scientist-In-Python

AnshuTrivedi / Data-Scientist-In-Python

Licence: other
This repository contains notes and projects of Data scientist track from dataquest course work.

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to Data-Scientist-In-Python

A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.
Stars: ✭ 998 (+4239.13%)
Mutual labels:  probability, datascience, machinelearning, deeplearning
A guide to getting started with Data Science and ML.
Stars: ✭ 17 (-26.09%)
Mutual labels:  numpy, pandas, datascience, machinelearning
yet another custom data science template via cookiecutter
Stars: ✭ 59 (+156.52%)
Mutual labels:  datascience, machinelearning, deeplearning
Stats Maths With Python
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (+1556.52%)
Mutual labels:  numpy, probability, pandas
Anomaly Detection
anomaly detection with anomalize and Google Trends data
Stars: ✭ 38 (+65.22%)
Mutual labels:  machine-learning-algorithms, datascience, machinelearning
The Cutest Deep Learning Framework which is also a wonderful Declarative Language
Stars: ✭ 151 (+556.52%)
Mutual labels:  machine-learning-algorithms, machinelearning, deeplearning
A fast xgboost feature selection algorithm
Stars: ✭ 165 (+617.39%)
Mutual labels:  machine-learning-algorithms, datascience, machinelearning
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (+52.17%)
Mutual labels:  numpy, pandas, kaggle
Notebooks Statistics And Machinelearning
Jupyter Notebooks from the old (RIP) machine learning and statistics blog
Stars: ✭ 270 (+1073.91%)
Mutual labels:  machine-learning-algorithms, datascience, machinelearning
Data Analysis
Stars: ✭ 142 (+517.39%)
Mutual labels:  numpy, pandas, kaggle
Plant Disease Identification Using Convulutional Neural Network
Stars: ✭ 89 (+286.96%)
Mutual labels:  machine-learning-algorithms, kaggle, machinelearning
Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deployment as a RESTful service on Kubernetes.
Stars: ✭ 44 (+91.3%)
Mutual labels:  numpy, pandas, kaggle
Real Time Ml Project
A curated list of applied machine learning and data science notebooks and libraries across different industries.
Stars: ✭ 143 (+521.74%)
Mutual labels:  machine-learning-algorithms, machinelearning, deeplearning
Awesome Deep Learning And Machine Learning Questions
【不定期更新】收集整理的一些网站中(如知乎、Quora、Reddit、Stack Exchange等)与深度学习、机器学习、强化学习、数据科学相关的有价值的问题
Stars: ✭ 203 (+782.61%)
Mutual labels:  machine-learning-algorithms, machinelearning, deeplearning
Learn Data Science For Free
This repositary is a combination of different resources lying scattered all over the internet. The reason for making such an repositary is to combine all the valuable resources in a sequential manner, so that it helps every beginners who are in a search of free and structured learning resource for Data Science. For Constant Updates Follow me in …
Stars: ✭ 4,757 (+20582.61%)
Mutual labels:  machine-learning-algorithms, deeplearning, datascienceproject
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+95760.87%)
Mutual labels:  numpy, pandas, kaggle
Datacamp Python Data Science Track
All the slides, accompanying code and exercises all stored in this repo. 🎈
Stars: ✭ 250 (+986.96%)
Mutual labels:  pandas, datascience, machinelearning
List of Data Science and Machine Learning Resource that I frequently use
Stars: ✭ 19 (-17.39%)
Mutual labels:  probability, datascience, machinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (+130.43%)
Mutual labels:  numpy, machine-learning-algorithms, machinelearning
gan deeplearning4j
Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-17.39%)
Mutual labels:  datascience, machinelearning, deeplearning



I have completed Data Scientist in Python full track from Dataquest with 28 real world projects.This repository contains all projects,datsets used in course and notes.Step_1 to Step_8 is order of course track completeion.

Tools Used

I worked on Jupyter Notebooks and Notepad app for course notes on Windows laptop.
You can install Jupyter Notebook from here
Notepad app already avialable in Windows ,or you can use any app for making notes.


Repo have separate folder for projects where i have saved projects according to course and step track.

Projects completed in Step_1

Project_1: Profitable app profilles for the APP and Google play markets
Project_2: Learn and install jupyter notebook
Project_3: Exploring hacker news posts

Total Projects:3

Projects completed in step_2

Project_4: Exploring ebay car sales data
Project_5: Visualizing earnings based on college majors
Project_6: Visualizing geder gap in college degrees
Project_7: Clean and analyze employee exit survey
Project_8: Analyze highschool data
Project_9: Star wars survey

Total Projects: 6

Projects in Step_3

Step_3 have no projects.

Projects completed in Step_4

Project_10: Analyze facebook data using SQL
Project_11: Answering business questions using SQL
Project_12: API and web scraping with reddit API
Project_13: API and web scraping with reddit API
Project_14: Popular data science questions

Total Projects: 5

Projects completed in Step_5

Project_15: Investigating Fandago movie ratings
Project_16: Finding best market to advertise in
Project_17: Mobile app for lottery addiction
Project_18: Building spam filter with naive bays

Total Projects: 5

Projects completed in Step_6

Project_20: Predicting car prices
Project_21: Predicting house sale prices
Project_22: Predicting bike rentals

Total projects: 4

Projects completed in Step_7

Project_24: Digits classification
Project_25: Credit modeling
Project_26: Getting started with titanic survival prediction

Total projects: 3

Projects completed in Step_8

Project_27: Spark installation and jupyter notebook integration

Total projects: 1


This folder contains datasets used in courses for data analysis practice. Datasets used in step_1
Datasets used in step_2
Datasets in step_3
Step_3 have no data sets to download.
Datasets used in step_4
Datasets used in step_5
Datasets used in step_6
Datasets used in step_7
Datasets used in step_8


Notes of all courses are avialbale either in text or jupyter notebook format.
Takeaway files are in pdf format which are very short and concise notes.


This course is more than enough for absolute beginners and good for intermediate Data Analytics practitioner.

Happy to Help:

If have any issue in understanding notes or struggling to grasp any topic , i am ready to offer help.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].