All Projects β†’ Ankushr785 β†’ Emotion-recognition-from-tweets

Ankushr785 / Emotion-recognition-from-tweets

Licence: other
A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Emotion-recognition-from-tweets

Handwritten-Digits-Classification-Using-KNN-Multiclass Perceptron-SVM
πŸ† A Comparative Study on Handwritten Digits Recognition using Classifiers like K-Nearest Neighbours (K-NN), Multiclass Perceptron/Artificial Neural Network (ANN) and Support Vector Machine (SVM) discussing the pros and cons of each algorithm and providing the comparison results in terms of accuracy and efficiecy of each algorithm.
Stars: ✭ 42 (+147.06%)
Mutual labels:  support-vector-machines, sigmoid-function
img classification deep learning
No description or website provided.
Stars: ✭ 19 (+11.76%)
Mutual labels:  word2vec, ann
Text-Analysis
Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (+182.35%)
Mutual labels:  word2vec, text-processing
asm2vec
An unofficial implementation of asm2vec as a standalone python package
Stars: ✭ 127 (+647.06%)
Mutual labels:  word2vec
FacialEmotionRecognition
Using Extended Cohn-Kanade AU-Coded Facial Expression Database to classify basic human facial emotion expressions using ann
Stars: ✭ 28 (+64.71%)
Mutual labels:  ann
dif
'dif' is a Linux preprocessing front end to gvimdiff/meld/kompare
Stars: ✭ 18 (+5.88%)
Mutual labels:  text-processing
text2video
Text to Video Generation Problem
Stars: ✭ 28 (+64.71%)
Mutual labels:  text-processing
zeyrek
Python morphological analyzer for Turkish language. Partial port of ZemberekNLP.
Stars: ✭ 36 (+111.76%)
Mutual labels:  lemmatization
receiptdID
Receipt.ID is a multi-label, multi-class, hierarchical classification system implemented in a two layer feed forward network.
Stars: ✭ 22 (+29.41%)
Mutual labels:  word2vec
Machine-Learning-Models
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
Stars: ✭ 30 (+76.47%)
Mutual labels:  ann
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
Stars: ✭ 16 (-5.88%)
Mutual labels:  text-processing
doc2vec-api
document embedding and machine learning script for beginners
Stars: ✭ 92 (+441.18%)
Mutual labels:  word2vec
stackoverflow-semantic-search
Word2Vec encodings based search engine for Stackoverflow questions
Stars: ✭ 23 (+35.29%)
Mutual labels:  word2vec
syn
syn - the thesaurus
Stars: ✭ 45 (+164.71%)
Mutual labels:  text-processing
wordmap
Visualize large text collections with WebGL
Stars: ✭ 23 (+35.29%)
Mutual labels:  word2vec
word2vec-movies
Bag of words meets bags of popcorn in Python 3 中文教程
Stars: ✭ 54 (+217.65%)
Mutual labels:  word2vec
acl2017 document clustering
code for "Determining Gains Acquired from Word Embedding Quantitatively Using Discrete Distribution Clustering" ACL 2017
Stars: ✭ 21 (+23.53%)
Mutual labels:  word2vec
SuperCombinators
[Deprecated] A Swift parser combinator framework
Stars: ✭ 19 (+11.76%)
Mutual labels:  text-processing
Word-Embeddings-and-Document-Vectors
An evaluation of word-embeddings for classification
Stars: ✭ 32 (+88.24%)
Mutual labels:  word2vec
TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+394.12%)
Mutual labels:  lemmatization

Emotion-recognition-from-tweets

A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning. The 'Untitled.ipynb' file consists of a jupyter notebook of test codes. Rest of the .py files consist of code corresponding to their names.

Find the complete explanation to the approach here https://medium.com/@ankushraut/artificial-neural-network-for-text-classification-b7aa5994d985.

Problem Statement

-> Given a dataset mapping tweets to the associated emotions, an emotion recognizer algorithm needs to be created.

-> Data source - https://l.facebook.com/l.php?u=https%3A%2F%2Fwww.crowdflower.com%2Fwp-content%2Fuploads%2F2016%2F07%2Ftext_emotion.csv&h=ATMfMjsluVtxjiGHelTFi-X_nKckyiEYJHwiGN9u-1LriOzSoQ4oFbjCBZItdFCummCAEnLlD_m7bUwMZs4LrZnDgDt4txt3laalwYWZnETPZBqChySF_gURrBRyFNGFNb_y&s=1&enc=AZP8Ci-JshpS1aE5ggdBCNoVTeDCTGhQTic6bXTQJ_M6PcwGzRMJfqiPYywc62pbkKEwvWl-M_9_OcEzpf1lKZ38b8xGHiCDtUf50JN-W6vA_Q

-> Libraries - Natural Language Tool-kit (NLTK) and Sci-kit learn

Pre - processing

-> Removal of regular expressions, symbols using the 're' library

-> Removal of lemmas (Lexicon Normalization) using WordNetLemmatizer from NLTK

-> Removal of multi-letter ambiguities, e.g 'noooo' gets converted to 'no'

-> (Optional) Removal of stop-words - caused decrease in f1-score as well as overall accuracy

A look at the data before and after pre-processing

before

whole_before

after

whole_after

Vectorization

-> Term frequency - inverse document frequency (TfidfVectorizer) deployed for converting the words to vectors (for SVM and Naive Bayes)

-> Bag of words representation used as an input for the sigmoid layers model

Model - 1

-> Support Vector Machine - Creation of hyperplanes separating all the classes, linear kernel.

Model - 2

-> Naive Bayes classifier - naively assuming no inter-dependence between words of a sentence corpus.

Model -3

-> Aritificial Neural Network - 3 layer neural network with sigmoid activation and gradient descent optimization

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].