All Projects → bonzanini → Nlp Tutorial

bonzanini / Nlp Tutorial

Tutorial: Natural Language Processing in Python

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Nlp Tutorial

Aind Nlp
Coding exercises for the Natural Language Processing concentration, part of Udacity's AIND program.
Stars: ✭ 202 (-26.28%)
Mutual labels:  jupyter-notebook, natural-language-processing
Text summarization with tensorflow
Implementation of a seq2seq model for summarization of textual data. Demonstrated on amazon reviews, github issues and news articles.
Stars: ✭ 226 (-17.52%)
Mutual labels:  jupyter-notebook, natural-language-processing
Graph Convolution Nlp
Graph Convolution Network for NLP
Stars: ✭ 208 (-24.09%)
Mutual labels:  jupyter-notebook, natural-language-processing
Notebooks
Jupyter Notebooks with Deep Learning Tutorials
Stars: ✭ 188 (-31.39%)
Mutual labels:  jupyter-notebook, natural-language-processing
Malaya
Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (-12.77%)
Mutual labels:  jupyter-notebook, natural-language-processing
Germanwordembeddings
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-31.02%)
Mutual labels:  jupyter-notebook, natural-language-processing
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (-18.98%)
Mutual labels:  jupyter-notebook, natural-language-processing
Nel
Entity linking framework
Stars: ✭ 176 (-35.77%)
Mutual labels:  jupyter-notebook, natural-language-processing
Pytorch Bert Crf Ner
KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity Recognition model for Korean)
Stars: ✭ 236 (-13.87%)
Mutual labels:  jupyter-notebook, natural-language-processing
Deepnlp Models Pytorch
Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)
Stars: ✭ 2,760 (+907.3%)
Mutual labels:  jupyter-notebook, natural-language-processing
Bert Sklearn
a sklearn wrapper for Google's BERT model
Stars: ✭ 182 (-33.58%)
Mutual labels:  jupyter-notebook, natural-language-processing
Bertviz
Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)
Stars: ✭ 3,443 (+1156.57%)
Mutual labels:  jupyter-notebook, natural-language-processing
Deeptoxic
top 1% solution to toxic comment classification challenge on Kaggle.
Stars: ✭ 180 (-34.31%)
Mutual labels:  jupyter-notebook, natural-language-processing
Pytorch graph Rel
A PyTorch implementation of GraphRel
Stars: ✭ 204 (-25.55%)
Mutual labels:  jupyter-notebook, natural-language-processing
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (-33.94%)
Mutual labels:  jupyter-notebook, natural-language-processing
Practical 1
Oxford Deep NLP 2017 course - Practical 1: word2vec
Stars: ✭ 220 (-19.71%)
Mutual labels:  jupyter-notebook, natural-language-processing
Deep Math Machine Learning.ai
A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Stars: ✭ 173 (-36.86%)
Mutual labels:  jupyter-notebook, natural-language-processing
Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (-36.13%)
Mutual labels:  jupyter-notebook, natural-language-processing
Pytorch Transformers Classification
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, RoBERTa, and XLM models for text classification.
Stars: ✭ 229 (-16.42%)
Mutual labels:  jupyter-notebook, natural-language-processing
Pytorch Sentiment Analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+1071.17%)
Mutual labels:  jupyter-notebook, natural-language-processing

Tutorial: Natural Language Processing in Python

This repo contains material for a workshop on Natural Language Processing with Python.

Audience

The target audience of this workshop are students, researchers, developers, hobbyists and anyone interested in knowing more about Natural Language Processing and Text Analytics.

Some very basic knowledge of Python is assumed (e.g. if you have seen some Python script before, you're good to go), but no previous NLP knowledge is required.

Presentations

Different versions of this workshop have been delivered at different events:

  • PyCon UK 2016: 3h session (slides presentations/2016-pyconuk-slides.pdf)
  • PyCon Ireland 2016: 1.5h session (slides presentations/2016-pyconie-slides.pdf)
  • PyCon Italy 2017: 3.5h session (slides presentations/2017-pyconitaly-slides.pdf)

Environment Set up

The code has been tested with Python 3.4 and 3.5. Support for Python 2.7 is best-effort, if you find an issue please report it.

This paragraph describes how to set up your environment locally.

Step 1 - clone this repo::

git clone https://github.com/bonzanini/nlp-tutorial
cd nlp-tutorial

Step 2 - create and activate a Python virtual environment::

virtualenv nlp-venv
source nlp-venv/bin/activate

Step 2 (alternative) - create a Conda environment::

conda create --name nlp-venv python=3.5
source activate nlp-venv

Step 3 - install libraries::

pip install -r requirements.txt

This will download and install NLTK, scikit-learn and jupyter (plus dependencies).

NLTK requires some data to be installed separately (more details on the NLTK website <http://www.nltk.org/data.html>_).

From the command line, you can download the required packages::

python -m nltk.downloader punkt stopwords reuters

Alternatively, from a Python interactive shell::

>>> import nltk
>>> nltk.download()

Then use the GUI to select the requires packages (punkt, stopwords, reuters).

Tip: even if you can use "all" as package name to install all the NLTK data, it's not a great thing to do over a flakey conference wi-fi. This will download approx. 2Gb and if we all do it at the same time we'll kill the conference wi-fi :)

Finally - run Jupyter::

jupyter notebook

In order to test that your environment is correctly set. Please open the notebook "00 Environment Test" and follow the instructions.

matplotlib backend issues

There might be a few issues related to matplotlib backends as described on their documentation <http://matplotlib.org/faq/virtualenv_faq.html>_, especially on macOS.

By editing/creating the file ~/.matplotlib/matplotlibrc with the following line::

backend: TkAgg

the issue should be fixed. If not, please refer to the matplotlib docs <http://matplotlib.org/faq/virtualenv_faq.html>_

Authors

Main authors:

  • Marco Bonzanini (@MarcoBonzanini <http://www.twitter.com/marcobonzanini>_)
  • Miguel Martinez-Alvarez (@MiguelMAlvarez <http://www.twitter.com/miguelmalvarez>_)

License

Code (mainly in notebooks folder) under MIT license.

Documentation and slides under CC-BY license.

Data

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].