All Projects → CICIFLY → Data-Analytics-Projects

CICIFLY / Data-Analytics-Projects

Licence: other
This repository contains the projects related to data collecting, assessing,cleaning,visualizations and analyzing

Programming Languages

Jupyter Notebook
11667 projects
HTML
75241 projects

Projects that are alternatives of or similar to Data-Analytics-Projects

Mlcourse.ai
Open Machine Learning Course
Stars: ✭ 7,963 (+4668.26%)
Mutual labels:  seaborn, matplotlib
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-91.62%)
Mutual labels:  seaborn, matplotlib
heatmaps
Better heatmaps in Python
Stars: ✭ 117 (-29.94%)
Mutual labels:  seaborn, matplotlib
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (-81.44%)
Mutual labels:  seaborn, matplotlib
Exploratory Data Analysis Visualization Python
Data analysis and visualization with PyData ecosystem: Pandas, Matplotlib Numpy, and Seaborn
Stars: ✭ 78 (-53.29%)
Mutual labels:  seaborn, matplotlib
Github-Stars-Predictor
It's a github repo star predictor that tries to predict the stars of any github repository having greater than 100 stars.
Stars: ✭ 34 (-79.64%)
Mutual labels:  seaborn, matplotlib
Ai Learn
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+2526.95%)
Mutual labels:  seaborn, matplotlib
Machine-Learning-with-Iris-Dataset
Data Visualization and Machine Learning with Iris Dataset.
Stars: ✭ 65 (-61.08%)
Mutual labels:  seaborn, matplotlib
astetik
Astetik takes away the pain from telling visual stories with data on Python
Stars: ✭ 15 (-91.02%)
Mutual labels:  seaborn, matplotlib
Python-Data-Visualization
D-Lab's 3 hour introduction to data visualization with Python. Learn how to create histograms, bar plots, box plots, scatter plots, compound figures, and more, using matplotlib and seaborn.
Stars: ✭ 42 (-74.85%)
Mutual labels:  seaborn, matplotlib
Python-Course
Python Basics, Machine Learning and Deep Learning
Stars: ✭ 50 (-70.06%)
Mutual labels:  seaborn, matplotlib
data-analysis-using-python
Data Analysis Using Python: A Beginner’s Guide Featuring NYC Open Data
Stars: ✭ 81 (-51.5%)
Mutual labels:  seaborn, matplotlib
datascienv
datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (-68.26%)
Mutual labels:  seaborn, matplotlib
alvito
Alvito - An Algorithm Visualization Tool for Python
Stars: ✭ 52 (-68.86%)
Mutual labels:  seaborn, matplotlib
The-Data-Visualization-Workshop
A New, Interactive Approach to Learning Data Visualization
Stars: ✭ 59 (-64.67%)
Mutual labels:  seaborn, matplotlib
pyplot-fortran
For generating plots from Fortran using Python's matplotlib.pyplot 📈
Stars: ✭ 158 (-5.39%)
Mutual labels:  matplotlib
ml-book
Codice sorgente ed Errata Corrige del mio libro "A tu per tu col Machine Learning"
Stars: ✭ 16 (-90.42%)
Mutual labels:  matplotlib
matplotnim
A Nim wrapper for matplotlib
Stars: ✭ 22 (-86.83%)
Mutual labels:  matplotlib
tsp-essay
A fun study of some heuristics for the Travelling Salesman Problem.
Stars: ✭ 15 (-91.02%)
Mutual labels:  matplotlib
publib
Produce publication-level quality images on top of Matplotlib
Stars: ✭ 34 (-79.64%)
Mutual labels:  matplotlib

Data-Analytics-Projects:

Certificate : https://graduation.udacity.com/confirm/KUM3F4AJ

This repository is mainly for projects I have done under Udacity-Data-Analysis-Nanodegree.

Udacity online data analyst program prepares me for a career as a data analyst by helping me learn to clean and organize data, uncover patterns and insights, draw meaningful conclusions, and clearly communicate critical findings. I am developing proficiency in Python and its data analysis libraries (Numpy, pandas, Matplotlib) and SQL as I build a portfolio of projects .

Tips: For data science projects with python, I would recomend you to install numpy , pandas , scipy , scikit learn , matplotlib , seaborn thest basic libraries.

Part 1 - Intro to Data Analysis

Subjects Covered:

  • Anaconda: Learn to use Anaconda to manage packages and environments for use with Python
  • Jupyter Notebook: Learn to use this open-source web application
  • Data Analysis Process
  • NumPy for 1 and 2D Data
  • Pandas Series and Dataframes

Project 1: Explore Weather Trends with weather forecast data

In this project, I choose one of Udacity's curated datasets and investigate it using NumPy and pandas. I complete the entire data analysis process, starting by posing a question and finishing by sharing the findings. ( It may be better to place this section inside the readme of the project 1)

Project 2: Investigate a dataset called TMDb movie data.

I was provided a dataset reflecting data collected from an experiment. I used statistical techniques to answer questions about the data and report my conclusions and recommendations in a report.

Part 2 -Practical Statistics

Subjects Covered:

  • Probability
  • Conditional Probability
  • Binominal Distribution
  • Sampling Distribution and Central Limit Theorem
  • Descriptive Statistics
  • Inferential Statistics
  • Confidence Levels and Intervals
  • Hypothesis Testing
  • T-tests and A/B test
  • Regression
  • Multiple Linear Regression
  • Logistic Regression

Project 3: Analyze A/B Test Results with company ab_data.csv

Using Python, I gathered data from a variety of sources, assess its quality and tidiness, then clean it. I documented the wrangling efforts in a Jupyter Notebook, plus showcase them through analyses and visualizations using Python and SQL.By using AB Testing and regression methods to decide if the company should launch a new webpage or keep the old one.

Part 3 - Data Extraction and Wrangling

Subjects Covered:

  • GATHERING DATA:
    • Gather data from multiple sources, including gathering files, programmatically downloading files, web-scraping data, and accessing data from APIs
    • Import data of various file formats into pandas, including flat files (e.g. TSV), HTML files, TXT files, and JSON files
    • Store gathered data in a PostgreSQL database
  • ASSESSING DATA
    • Assess data visually and programmatically using pandas
    • Distinguish between dirty data (content or “quality” issues) and messy data (structural or “tidiness” issues)
    • Identify data quality issues and categorize them using metrics: validity, accuracy, completeness, consistency, and uniformity
  • CLEANING DATA
    • Identify each step of the data cleaning process (defining, coding,and testing)
    • Clean data using Python and pandas
    • Test cleaning code visually and programmatically using Python

Project 4 : Data Wrangle and Analyze with Tweet WeRateDogs data

Collect data from different sources and assess data visually and programmatically , clean data for visulizing data and finding insights later.

Part 4 - Data Visualisation

Subjects Covered:

  • Univariate exploration of data ( histogram , bar charts , Use axis limits and different scales )
  • Bivariate exploration of data ( scatter plots , clustered bar charts , violin and bar charts , faceting )
  • Multivariate exploration of data ( encodings , plot matrices , feature enginnering )
  • Explanatory Visulizations ( story telling with data , polish plots , create slide deck )

Project 5: Data Visulization with Diamond Data

Data visualization to a dataset involving the characteristics of diamonds and their prices.

Project 6: Communicate data finding with Ford Go Bike Sharing Data

In this project, I used Python’s data visualization tools to systematically explore the bike dataset for its properties and relationships between variables. Then, I created a presentation that communicates the findings to others.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].