All Projects → raffg → Harry_potter_nlp

raffg / Harry_potter_nlp

Harry Potter and the Allocation of Dirichlet

Projects that are alternatives of or similar to Harry potter nlp

Face Classification
Face model to classify gender and race. Trained on LFWA+ Dataset.
Stars: ✭ 104 (-1.89%)
Mutual labels:  jupyter-notebook
Anomaly Detection
Anomaly detection algorithm implementation in Python
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Makeittalk
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Simple adversarial examples
Repo of simple adversarial examples on vanilla neural networks trained on MNIST
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
How To Generate Art Demo
This is the code for "How to Generate Art - Intro to Deep Learning #8' by Siraj Raval on YouTube
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Intro To Deep Learning For Nlp
The repository contains code walkthroughs which introduces Deep Learning in the field of Natural Language Processing.
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Openplan
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Tianchi Antaicup International E Commerce Artificial Intelligence Challenge
1st place solution for the AntaiCup-International-E-commerce-Artificial-Intelligence-Challenge
Stars: ✭ 104 (-1.89%)
Mutual labels:  jupyter-notebook
Pixel2style2pixel
Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation"
Stars: ✭ 1,395 (+1216.04%)
Mutual labels:  jupyter-notebook
Ipywidgets Static
[obsolete] Static Widgets for IPython Notebooks
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Tf objectdetection api
Tutorial on how to create your own object detection dataset and train using TensorFlow's API
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Ml4music Workshop
Machine Learning for Music and Sound Synthesis workshop
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Openomni
Documentation and library for decoding omnipod communications.
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Kaggle Ds Bowl 2018 Baseline
Full train/inference/submission pipeline adapted to the competition from https://github.com/matterport/Mask_RCNN
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Time Series Forecasting With Python
A use-case focused tutorial for time series forecasting with python
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Faceaging By Cyclegan
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Cgoes
Research by Carlos Góes
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Mcmc pydata london 2019
PyData London 2019 Tutorial on Markov chain Monte Carlo with PyMC3
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook
Cross Lingual Voice Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Stars: ✭ 106 (+0%)
Mutual labels:  jupyter-notebook
Intro machine learning
Introduction to Machine Learning, a series of IPython Notebook and accompanying slideshow and video
Stars: ✭ 105 (-0.94%)
Mutual labels:  jupyter-notebook

NLP on the Books of Harry Potter

This repo demonstrates a collection of NLP tasks all using the books of Harry Potter for source documents. Individual tasks can be read about here:

  1. Topic modeling with Latent Dirichlet Allocation
  2. Regular Expression case study
  3. Extractive text summarization
  4. Sentiment analysis

Emotional Sentiment of the Harry Potter series

Instructions for BasicNLP class (basic_nlp.py)

Functions of the class are topic modeling with LDA, document summarization, and sentiment analysis.

  1. Initialize the class with a list of documents and an optional list of document titles, for example:
texts = ['this is the first document', 'this is the second document', 'this is the third document']
titles = ['doc1', 'doc2', 'doc3']

nlp = BasicNLP(texts, titles)
  1. LDA:

    1. Create an elbow plot and print the coherence scores by specifying the number of topics to include, with:
      nlp.compute_coherence(start=5, stop=20, step=3)
      
    2. Set the number of topics to use in the model with:
      nlp.set_number_of_topics(10)
      
    3. View the clusters (only available in Jupyter notebook):
      import pyLDAvis
      pyLDAvis.enable_notebook()
      vis = nlp.view_clusters()
      pyLDAvis.display(vis)
      
    4. Get the vocabulary for each topic in the LDA model with (topics can be 'all', a list of integers, or a single integer):
      nlp.get_topic_vocabulary(topics='all', num_words=10)
      
    5. Get the documents most highly associated with the given topics with:
      nlp.get_representative_documents(topics='all', num_docs=1)
      
    6. Get the sentence summaries of the documents most highly associated with the given topics with:
      nlp.get_representative_sentences(topics='all', num_sentences=3)
      
    7. Provide a name for an LDA topic (if preferred over the numbering system) with:
      nlp.name_topic(topic_number=1, topic_name='My topic')
      
  2. Document summarization:

    Get the sentence summaries of the requested documents with:

    nlp.get_document_summaries(documents='all', num_sent=5)
    
  3. Sentiment analysis:

    Get the sentiment scores (compound, positive, neutral, negative) for the requested documents with:

    nlp.get_sentiment(documents='all')
    
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].