All Projects → gabrieltseng → Datascience Projects

gabrieltseng / Datascience Projects

A collection of personal data science projects

Projects that are alternatives of or similar to Datascience Projects

Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-26.32%)
Mutual labels:  jupyter-notebook, data-science
Presentations
Talks & Workshops by the CODAIT team
Stars: ✭ 50 (-12.28%)
Mutual labels:  jupyter-notebook, data-science
Computervision Recipes
Best Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+14310.53%)
Mutual labels:  jupyter-notebook, data-science
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1629.82%)
Mutual labels:  jupyter-notebook, data-science
Data Privacy For Data Scientists
A workshop on data privacy methods for data scientists.
Stars: ✭ 53 (-7.02%)
Mutual labels:  jupyter-notebook, data-science
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1650.88%)
Mutual labels:  jupyter-notebook, data-science
Mckinsey Smartcities Traffic Prediction
Adventure into using multi attention recurrent neural networks for time-series (city traffic) for the 2017-11-18 McKinsey IronMan (24h non-stop) prediction challenge
Stars: ✭ 49 (-14.04%)
Mutual labels:  jupyter-notebook, data-science
Python Training
Python training for business analysts and traders
Stars: ✭ 972 (+1605.26%)
Mutual labels:  jupyter-notebook, data-science
25daysinmachinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-7.02%)
Mutual labels:  jupyter-notebook, data-science
Ppd599
USC urban data science course series with Python and Jupyter
Stars: ✭ 1,062 (+1763.16%)
Mutual labels:  jupyter-notebook, data-science
Minerva Training Materials
Learn advanced data science on real-life, curated problems
Stars: ✭ 37 (-35.09%)
Mutual labels:  jupyter-notebook, data-science
Mri Analysis Pytorch
MRI analysis using PyTorch and MedicalTorch
Stars: ✭ 55 (-3.51%)
Mutual labels:  jupyter-notebook, data-science
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (+1622.81%)
Mutual labels:  jupyter-notebook, data-science
Ds Take Home
My solution to the book A Collection of Data Science Take-Home Challenges
Stars: ✭ 1,004 (+1661.4%)
Mutual labels:  jupyter-notebook, data-science
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-38.6%)
Mutual labels:  jupyter-notebook, data-science
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-14.04%)
Mutual labels:  jupyter-notebook, data-science
Python for ml
brief introduction to Python for machine learning
Stars: ✭ 29 (-49.12%)
Mutual labels:  jupyter-notebook, data-science
Docker Iocaml Datascience
Dockerfile of Jupyter (IPython notebook) and IOCaml (OCaml kernel) with libraries for data science and machine learning
Stars: ✭ 30 (-47.37%)
Mutual labels:  jupyter-notebook, data-science
Numerical Linear Algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Stars: ✭ 8,263 (+14396.49%)
Mutual labels:  jupyter-notebook, data-science
Pythondataanalysis
The data and code that used in my book.
Stars: ✭ 54 (-5.26%)
Mutual labels:  jupyter-notebook, data-science

Learning Data Science

In this repository, I'll keep the code I write as I learn about Data Science.

I write about what I am learning here: https://medium.com/@gabrieltseng/

For all notebooks which require a GPU I use an AWS P2 instance.

I approached the projects in the following order (latest to earliest):

Table of Contents

MNIST WGAN

A WGAN used to generate MNIST digits.

Language model

I experiment with two language models, one based of the weight dropped LSTM, and one based off temporal convolutional networks. Both are trained on the Wikitext 2 dataset.

Object Detection

I build a single and multi image object detector, which can label and locate objects in an image. Both models are trained on the Pascal VOC 2007 challenge dataset.

Stanford Databases

I work through the Stanford Databases by Jennifer Widom. The folder contains my solutions to the exercises. In the case where there were no exercises (i.e. there was only a quiz), I only added the statement of accomplishment.

Think Bayes

I work through Think Bayes by Allen Downey. The folder contains my solutions to the exercises.

I took the exercises from both the book, and the GitHub repository

Twitter Disasters

I build a tweet summarizer (COWTS), with the goal of providing a useful summary of tweets to a rescue team in a disaster scenario. This involves experimenting with Integer Linear Programming, term frequency - inverse document frequency scores and word graphs.

Detecting Bullies

I train machine learning algorithms on a smaller dataset (~3000 datapoints) to recognize bullying in online discussions, as part of Kaggle's Detecting Insults in Social Commentary competition. By implementing word embeddings, I significantly improve the competition's best result.

Style Neural Network

I experiment with generative neural networks by building a style neural network, which takes as input two images, and outputs an image with the content of the first image and the style of the second image. I improve the original neural style network (A Neural Network of Artistic Style) by implementing two additional papers (Incorporating Long Range Consistency in CNN based Texture Generation and Stable and Controllable Neural Texture Synthesis and Style Transfer Using Histogram Losses).

Quora

I build a recurrant neural network based on the GloVe word embeddings to recognize the intent of questions posted on Quora as part of Kaggle's Quora Question Pairs competition.

Recommender System

In this project, I use the Movie Lens dataset to explore a variety of data science tools, including dimensionality reduction and word embeddings. I build a recommender system using a recurrant neural network, and implement Google's Wide and Deep recommender neural network.

Image Recognition

In this project, I finetune and ensemble a variety of pretrained convolutional neural networks in Keras to identify invasive plant species in images, as part of Kaggle's Invasive Species Monitoring competition.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].