All Projects → brianspiering → Word2vec Workshop

brianspiering / Word2vec Workshop

Licence: apache-2.0
word2vec workshop - a conceptual introduction and practical application

Projects that are alternatives of or similar to Word2vec Workshop

Jupyterlab Pkginstaller
Package Installer (pip) UI for JupyterLab
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Isolation forest example
Example of implementing Isolation Forest in Python
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Data mining in action 2017
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Mongolian Bert
Pre-trained Mongolian BERT models
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Caltech Birds Classification
This repo includes code (written in Python) for Caltech-UCSD Birds-200-2011 dataset classification. I have used PyTorch Library for CNN's. You can download the dataset here http://www.vision.caltech.edu/visipedia-data/CUB-200-2011/CUB_200_2011.tgz
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Servenet
Service Classification based on Service Description
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Spark Notebooks
Collection of useful notebooks to be used with the Spark Notebook (https://github.com/andypetrella/spark-notebook)
Stars: ✭ 20 (-4.76%)
Mutual labels:  jupyter-notebook
Techtalks
Slides and Supplementary Material of the past TechTalks at the Karlsruhe Machine Learning, Statistics and AI Meetup
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Brain Networks Course
Repo for Brain Networks course
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Glo4030 Labs
Laboratoires du cours GLO-4030/GLO-7030
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Wptherml
Pioneering the design of materials to harness heat.
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Repro Zoo 2018
Reproduced papers from the Reproducibility Zoo
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Lab04
Web scraping, APIs, and Twitter
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Av
Contains solutions to AV competitions
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Pytorch Examples
MNIST Clasification with Pytorch
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Coop Cut
Cooperative Cut is a Markov Random Field inference method with high-order edge potentials.
Stars: ✭ 20 (-4.76%)
Mutual labels:  jupyter-notebook
Pover T Tests
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Deeplearning tutorial
Deep Learning Tutorial in Python with Keras library
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook
Rnn Syn
Analogs of Linguistic Structure in Deep Representations
Stars: ✭ 20 (-4.76%)
Mutual labels:  jupyter-notebook
Agu2017
Content for my AGU 2017 presentations.
Stars: ✭ 21 (+0%)
Mutual labels:  jupyter-notebook

Word2vec Algorithm: Made as simple as possible, but no simpler


Description

A Pythonic introduction to the word2vec algorithm. Word2vec, translating words (strings) to vectors (lists of floats), is a relatively new algorithm which has proven to be very useful for making sense of text data. You should walk out at the end with a conceptual understanding of the algorithm and be empowered to try it out on your favorite collection of text data.

“You shall know a word by the company it keeps” is a common refrain in Natural Language Processing (NLP). word2vec does that by training a neural network to learn which words tend to co-occur together and embeds the words in a meaningful vector space. From these "word embeddings", it is possible to compare words with distance measures, add/subtract words to explore relationships between concepts, and clustering to find semantically related words. Actually, word2vec is a general purpose algorithm that allows any sequential data to be encoded into meaningful vectors - including emojis!


Bio

Dr. Brian Spiering is a faculty member at GalvanizeU which offers a Master of Science in Data Science. His passions are Natural Language Processing (NLP), deep learning, and building data products. He is active in the San Francisco Data Science community through volunteering and mentoring.

Drop him a line [email protected]


Disclaimer: These are interactive notebooks that are meant to be run. There might be elements not rendered correctly on static GitHub pages.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].