All Projects → rameshjes → Semantic-Textual-Similarity

rameshjes / Semantic-Textual-Similarity

Licence: other
Natural Language Processing using NLTK and Spacy

Programming Languages

Roff
2310 projects
Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Semantic-Textual-Similarity

contextualSpellCheck
✔️Contextual word checker for better suggestions
Stars: ✭ 274 (+813.33%)
Mutual labels:  spacy, spelling-correction, spellchecker
spacy hunspell
✏️ Hunspell extension for spaCy 2.0.
Stars: ✭ 94 (+213.33%)
Mutual labels:  spacy, spelling-correction, spellchecker
tweets-preprocessor
Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
Stars: ✭ 26 (-13.33%)
Mutual labels:  spacy, nltk, spacy-nlp
nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+130%)
Mutual labels:  spacy, nltk, spacy-nlp
topic modelling financial news
Topic modelling on financial news with Natural Language Processing
Stars: ✭ 51 (+70%)
Mutual labels:  spacy, nltk
customized-symspell
Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm
Stars: ✭ 51 (+70%)
Mutual labels:  spelling-correction, spellchecker
NLP Quickbook
NLP in Python with Deep Learning
Stars: ✭ 516 (+1620%)
Mutual labels:  spacy, spacy-nlp
nlp workshop odsc europe20
Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Stars: ✭ 127 (+323.33%)
Mutual labels:  spacy, nltk
bert-tensorflow-pytorch-spacy-conversion
Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.
Stars: ✭ 26 (-13.33%)
Mutual labels:  spacy, spacy-nlp
augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
Stars: ✭ 101 (+236.67%)
Mutual labels:  spacy, spacy-nlp
Cltk
The Classical Language Toolkit
Stars: ✭ 650 (+2066.67%)
Mutual labels:  spacy, nltk
spellchecker-wasm
SpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell
Stars: ✭ 46 (+53.33%)
Mutual labels:  spelling-correction, spellchecker
WordSegmentationDP
Word Segmentation with Dynamic Programming
Stars: ✭ 18 (-40%)
Mutual labels:  spelling-correction, spellchecker
Python nlp tutorial
This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (+140%)
Mutual labels:  spacy, nltk
turing
✨ 🧬 Turing AI - Semantic Navigation, Chatbot using Search Engine and Many NLP Vendors.
Stars: ✭ 30 (+0%)
Mutual labels:  spacy, spacy-nlp
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+3673.33%)
Mutual labels:  spacy, nltk
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+6126.67%)
Mutual labels:  spacy, nltk
spacy-sentence-bert
Sentence transformers models for SpaCy
Stars: ✭ 88 (+193.33%)
Mutual labels:  spacy
shell-extras
Extra Unix Shell Material
Stars: ✭ 22 (-26.67%)
Mutual labels:  english
autumn
A Java parser combinator library written with an unmatched feature set.
Stars: ✭ 112 (+273.33%)
Mutual labels:  parse-trees

Semantic Textual Similarity using Natural Language Processing(NLP)

Abstract

Semantic textual similarity computes the equivalence of two sentences on the basis of its conceptual similarity. It is widely used in natural languages processing tasks such as essay scoring, machine translation, text classification, information extraction, and question answering. This project focuses on one of the applications of semantic textual similarity known as automatic short answer grading (ASAG). It assigns a grade to a response provided by a student by comparing with one or more model answers. In particular, we selected one of the state-of-the-art short answer grading approaches that use Stanford CoreNLP library, and we used the same approach with the help of two open source libraries; Natural Language ToolKit (NLTK) and Spacy. For evaluation, Texas dataset and an in-house benchmarking ASAG dataset based on Mathematics for Robotics and Control (MRC) course were considered. Performances among all three libraries were evaluated using Pearson correlation coefficient, root mean square error (RMSE), and the runtime. Results based on Texas dataset showed that Stanford CoreNLP library has better Pearson correlation coefficient(0.66) and lowest RMSE(0.85) than NLTK and Spacy libraries. While using MRC dataset, all 3 libraries showed the comparative results on evaluated metrics.

Contents of Repository

This repository contains:

    Exercises related to textual similarity using NLTK and SPACY libraries that can help for short answer grading
    
    Comparison of spell corrector approaches using:
                    - Spell corrector using Ngrams,Jaccard coefficient and Minimum edit distance
                    - Spell corrector using Minimum Edit Distance(MED)

    Create jupyter notebooks for each student from Mohler data set for short questions and answers

    Create instructor version of assignments using nbgrader

    Create student version of assignments using nbgrader
    
    Wiki contains theoretically concepts: https://github.com/rameshjesswani/Semantic-Textual-Similarity/wiki
    
    Word Aligner using NLTK and Spacy libraries
    
    ASAG based Sultan et al. (2016) approach using NLTK And Spacy libraries

Guidelines for Monolingual Word Aligner

It can used as individual module. For more usage, check here: Word Aligner using NLTK and Spacy

Install nltk library(procedure given below)
Setup Stanford Parser, NER, PosTagger(link to setup in nltk given below)

Guidelines for ASAG

Details about Asag can be found here: ASAG

Installation

NLTK requires Python versions 2.7, 3.4, or 3.5

Install NLTK library

sudo pip install -U nltk

Install packages of NLTK

import nltk
nltk.download()

SPACY is compatible with 64-bit CPYTHON 2.6+/3.3+ and runs on Unix/Linux, macOS/OS X and WINDOWS

Install SPACY(code works with version 2.0.12) library

pip install -U spacy

After spacy installation you need to download a Language model

python -m spacy download en

Nbgrader Installation

pip install nbgrader

if you are using Anaconda:

conda install jupyter
conda install -c conda-forge nbgrader

To install nbgrader extensions:

jupyter nbextension install --user-prefix --py nbgrader --overwrite
jupyter nbextension enable --user-prefix --py nbgrader
jupyter serverextension enable --user-prefix --py nbgrader

For more docs about nbgrader:

http://nbgrader.readthedocs.io/en/stable/user_guide/installation.html

To use Stanford Parser, NER, PosTagger in NLTK check files:

https://github.com/rameshjesswani/Semantic-Textual-Similarity/blob/master/monolingualWordAligner/stanfordParser_setup.txt

https://github.com/rameshjesswani/Semantic-Textual-Similarity/blob/master/monolingualWordAligner/stanfordNERTagger_setup.txt

https://github.com/rameshjesswani/Semantic-Textual-Similarity/blob/master/monolingualWordAligner/stanfordPOSTagger_setup.txt

MindMap

Mind map

General NLP Pipeline

General Nlp pipeline

Bibtex

@unpublished{[RnD]Kumar,
	Authors = {Ramesh Kumar},
	Month = {January},
	Note = {WS17
	H-BRS - Evaluation of Semantic Textual Similarity Approaches for Automatic Short Answer Grading
Ploeger, Nair supervising},
	Title = {Evaluation of Semantic Textual Similarity Approaches for Automatic Short Answer Grading},
	Year = {2017/18}}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].