All Projects → carpentries-incubator → machine-learning-novice-sklearn

carpentries-incubator / machine-learning-novice-sklearn

Licence: other
A Carpentry style lesson on machine learning with Python and scikit-learn.

Programming Languages

python
139335 projects - #7 most used programming language
HTML
75241 projects
SCSS
7915 projects
Makefile
30231 projects
r
7636 projects
CSS
56736 projects

Projects that are alternatives of or similar to machine-learning-novice-sklearn

introduction-to-conda-for-data-scientists
Introduction to Conda for (Data) Scientists
Stars: ✭ 35 (+59.09%)
Mutual labels:  english, lesson, alpha, carpentries-incubator
hpc-intro
Lesson materials for an Introduction to High Performance Computing in the tradition of Software Carpentry
Stars: ✭ 123 (+459.09%)
Mutual labels:  english, lesson, alpha, carpentries-incubator
python-packaging-publishing
Packaging and Publishing with Python
Stars: ✭ 24 (+9.09%)
Mutual labels:  english, lesson, carpentries-incubator
docker-introduction
Reproducible Computational Environments using Containers
Stars: ✭ 34 (+54.55%)
Mutual labels:  english, lesson, carpentries-incubator
geospatial-python
Introduction to Geospatial Raster and Vector Data with Python
Stars: ✭ 107 (+386.36%)
Mutual labels:  english, lesson, carpentries-incubator
deep-learning-intro
Learn Deep Learning with Python
Stars: ✭ 23 (+4.55%)
Mutual labels:  english, lesson, carpentries-incubator
Python Aos Lesson
Python for Atmosphere and Ocean Scientists
Stars: ✭ 49 (+122.73%)
Mutual labels:  english, lesson, alpha
shell-extras
Extra Unix Shell Material
Stars: ✭ 22 (+0%)
Mutual labels:  english, lesson, carpentries-incubator
sql-ecology-lesson
Data Management with SQL for Ecologists
Stars: ✭ 37 (+68.18%)
Mutual labels:  english, lesson
r-novice-inflammation
Programming with R
Stars: ✭ 142 (+545.45%)
Mutual labels:  english, lesson
sql-novice-survey
Databases and SQL
Stars: ✭ 59 (+168.18%)
Mutual labels:  english, lesson
rr-organization1
The Organization lesson for the Reproducible Science Curriculum
Stars: ✭ 36 (+63.64%)
Mutual labels:  english, lesson
Shell Novice
The Unix Shell
Stars: ✭ 234 (+963.64%)
Mutual labels:  english, lesson
matlab-novice-inflammation
Programming with MATLAB
Stars: ✭ 26 (+18.18%)
Mutual labels:  english, lesson
shell-genomics
Introduction to the Command Line for Genomics
Stars: ✭ 54 (+145.45%)
Mutual labels:  english, lesson
OpenRefine-ecology-lesson
Data Cleaning with OpenRefine for Ecologists
Stars: ✭ 20 (-9.09%)
Mutual labels:  english, lesson
lc-wikidata
Library Carpentry Wikidata
Stars: ✭ 17 (-22.73%)
Mutual labels:  english, lesson
lc-data-intro
Library Carpentry: Introduction to Working with Data (Regular Expressions)
Stars: ✭ 16 (-27.27%)
Mutual labels:  english, lesson
wrangling-genomics
Data Wrangling and Processing for Genomics
Stars: ✭ 49 (+122.73%)
Mutual labels:  english, lesson
python-aos-lesson
Python for Atmosphere and Ocean Scientists
Stars: ✭ 78 (+254.55%)
Mutual labels:  english, lesson

Introduction to Machine Learning with Scikit Learn and Python

Create a Slack Account with us

This repository generates the corresponding lesson website from The Carpentries repertoire of lessons.

Contributing

We welcome all contributions to improve the lesson! Maintainers will do their best to help you if you have any questions, concerns, or experience any difficulties along the way.

We'd like to ask you to familiarize yourself with our Contribution Guide and have a look at the more detailed guidelines on proper formatting, ways to render the lesson locally, and even how to write new episodes.

Please see the current list of issues for ideas for contributing to this repository. For making your contribution, we use the GitHub flow, which is nicely explained in the chapter Contributing to a Project in Pro Git by Scott Chacon. Look for the tag good_first_issue. This indicates that the mantainers will welcome a pull request fixing this issue.

Maintainer(s)

Current maintainers of this lesson are:

Outline

As determined by the attendees of CarpentryConnect Manchester 2019, the proposed outline of this lesson is as follows:

Unsupervised Learning

I. Clustering

1. Kmeans

II. Dimesionality Reduction

1. PCA
2. TSNE

Supervised Learning

All models, objectives:

  • What it is;
  • when to use it and on what type of data;
  • how to evaluate the fit, over/underfitting;
  • computational complexity

I. Regression

1. Linear
2. Polynomial
  • Overfitting/underfitting
  • Test sets (how and why)

II. Classification

1. Logistic regression
  • Over/underfitting can happen in regression too
  • Accuracy
  • Confusion Matrix
  • Precision
  • Recall
2. Random Forest
3. Neural Networks
  • Evaluation
  • Cross Validation

Ethics

Authors

A list of contributors to the lesson can be found in AUTHORS

Citation

To cite this lesson, please consult with CITATION

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].