All Projects β†’ dinhanhthi β†’ data-science-learning

dinhanhthi / data-science-learning

Licence: other
πŸ“Š All of courses, assignments, exercises, mini-projects and books that I've done so far in the process of learning by myself Machine Learning and Data Science.

Programming Languages

Jupyter Notebook
11667 projects
HTML
75241 projects

Projects that are alternatives of or similar to data-science-learning

PracticalMachineLearning
A collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+87.5%)
Mutual labels:  scikit-learn, kaggle, self-learning
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+68800%)
Mutual labels:  scikit-learn, kaggle
Machinejs
[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
Stars: ✭ 412 (+1187.5%)
Mutual labels:  scikit-learn, kaggle
Ailearning
AiLearning: ζœΊε™¨ε­¦δΉ  - MachineLearning - ML、深度学习 - DeepLearning - DL、θ‡ͺ焢语言倄理 NLP
Stars: ✭ 32,316 (+100887.5%)
Mutual labels:  scikit-learn, pca
MachineLearning
Implementations of machine learning algorithm by Python 3
Stars: ✭ 16 (-50%)
Mutual labels:  scikit-learn, pca
How-to-score-0.8134-in-Titanic-Kaggle-Challenge
Solution of the Titanic Kaggle competition
Stars: ✭ 114 (+256.25%)
Mutual labels:  scikit-learn, kaggle
Hungabunga
HungaBunga: Brute-Force all sklearn models with all parameters using .fit .predict!
Stars: ✭ 614 (+1818.75%)
Mutual labels:  scikit-learn, kaggle
NLP-Specialization
NLP Specialization (Natural Language Processing) made by deeplearning.ai
Stars: ✭ 44 (+37.5%)
Mutual labels:  coursera, deeplearning-ai
Fraud Detection
Credit Card Fraud Detection using ML: IEEE style paper + Jupyter Notebook
Stars: ✭ 58 (+81.25%)
Mutual labels:  scikit-learn, kaggle
Ml code
A repository for recording the machine learning code
Stars: ✭ 75 (+134.38%)
Mutual labels:  scikit-learn, pca
Machine Learning And Reinforcement Learning In Finance
Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering
Stars: ✭ 173 (+440.63%)
Mutual labels:  scikit-learn, coursera
coursera-gan-specialization
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
Stars: ✭ 277 (+765.63%)
Mutual labels:  coursera, deeplearning-ai
Robotics Coursework
πŸ€– Places where you can learn robotics (and stuff like that) online πŸ€–
Stars: ✭ 1,810 (+5556.25%)
Mutual labels:  coursera, self-learning
IBM-final-project-Machine-Learning
Final project of IBM's course https://www.coursera.org/learn/machine-learning-with-python on coursera
Stars: ✭ 33 (+3.13%)
Mutual labels:  scikit-learn, coursera
coursera-ai-for-medicine-specialization
Programming assignments, labs and quizzes from all courses in the Coursera AI for Medicine Specialization offered by deeplearning.ai
Stars: ✭ 80 (+150%)
Mutual labels:  coursera, deeplearning-ai
Prince
πŸ‘‘ Python factor analysis library (PCA, CA, MCA, MFA, FAMD)
Stars: ✭ 591 (+1746.88%)
Mutual labels:  scikit-learn, pca
coursera-machinelearning
Stanford University - Machine Learning by Andrew Ng
Stars: ✭ 82 (+156.25%)
Mutual labels:  coursera, deeplearning-ai
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (+9.38%)
Mutual labels:  scikit-learn, kaggle
Artificial Intelligence Deep Learning Machine Learning Tutorials
A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
Stars: ✭ 2,966 (+9168.75%)
Mutual labels:  scikit-learn, kaggle
kaggledatasets
Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (+37.5%)
Mutual labels:  scikit-learn, kaggle

πŸ“Š data-science-learning

The list of things I've finished so far on the way of learning by myself Machine Learning and Data Science.

πŸ”₯ Projects

  • Setting up a cafΓ© in Ho Chi Minh City β€” find a best place to setting up a new business β€” article β€” source.
  • Titanic: Machine Learning from Disaster (from Kaggle) β€” predicts which passengers survived the Titanic shipwreck β€” source.

I also do some mini-projects for understanding the concepts. You can find the html files (exported from the corresponding Jupyter Notebook files) and "Open in Colab" files for below mini projects here.

🎲 Tasks

  • Anomaly Detection. β€” my note
  • Data Aggregation β€” my note
  • Data Overview. β€” my note
  • Data Visualization.
  • Model evaluation.
  • Preprocessing (texts, images, dates & times, structured data). β€” my note
  • Testing. β€” my note
  • Web Scraping.

🐍 Programming Languages

  • GraphQL β€” an open-source data query and manipulation language for APIs, and a runtime for fulfilling queries with existing data.
  • Python β€” an interpreted, high-level, general-purpose programming language β€” my note.
  • R β€” a programming language and free software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing.
  • Scala β€” a general-purpose programming language providing support for functional programming and a strong static type system.
  • SQL β€” a domain-specific language used in programming and designed for managing data held in a relational database management system, or for stream processing in a relational data stream management system.

βš™οΈ Frameworks & Platforms

  • Apache Airflow β€” my note
  • Docker β€” a set of platform as a service products that use OS-level virtualization to deliver software in packages called containers β€” my note
  • Google Colab β€” a free cloud service, based on Jupyter Notebooks for machine-learning education and research β€” my note.
  • Google Kubernetes
  • Hadoop β€” a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation.
  • Kaggle β€” an online community of data scientists and machine learners, owned by Google.
  • PostgreSQL (Postgres) β€” a free and open-source relational database management system emphasizing extensibility and technical standards compliance.
  • Spark β€” an open-source distributed general-purpose cluster-computing framework.

βš’οΈ Tools

  • Bash β€” my note
  • Git β€” a distributed version-control system for tracking changes in source code during software development β€” my note.
  • Markdown β€” a lightweight markup language with plain text formatting syntax β€” my note.
  • Jupyter Notebook β€” an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text β€” my note.
  • Trello β€” a web-based Kanban-style list-making application.

πŸ“š Libraries & Frameworks

The "ticked" libraries don't mean that I've known/understand whole of them (but I can easily use them with their documentation)!

  • D3js β€” a JavaScript library for producing dynamic, interactive data visualizations in web browsers.
  • Keras β€” an open-source neural-network library written in Python.
  • Matplotlib β€” a plotting library for the Python programming language and its numerical mathematics extension NumPy. β€” my note
  • Numpy β€” a library for the Python programming language, adding support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays. β€” my note
  • OpenCV β€” a library of programming functions mainly aimed at real-time computer vision.
  • Pandas β€” a software library written for the Python programming language for data manipulation and analysis. -- my note
  • Plotly -- the front-end for ML and data science models.
  • PyTorch -- my note
  • Seaborn β€” a Python data visualization library based on matplotlib.
  • Scikit-learn β€” a free software machine learning library for the Python programming language.
  • TensorFlow β€” a free and open-source software library for dataflow and differentiable programming across a range of tasks.

πŸ‘¨β€πŸ« Courses

The "non-checked" courses are under the way to be finished!

πŸ“– Books

The "non-checked" books are under the way to be finished!

πŸ€– Github's repositories

🌏 Other resources


The descriptions of terms in this site are borrowed from Wikipedia.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].