All Projects β†’ susantabiswas β†’ Word-Prediction-Ngram

susantabiswas / Word-Prediction-Ngram

Licence: MIT license
Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to Word-Prediction-Ngram

gpt-j-api
API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
Stars: ✭ 248 (+892%)
Mutual labels:  language-model
SGP.NET
C# SGP4 Satellite Prediction Library. Load satellites from TLEs, convert between coordinate systems and reference frames, observe satellites from ground stations, and more.
Stars: ✭ 15 (-40%)
Mutual labels:  prediction
Facial-Recognition-Tool
➰ A Facial Recognition tool based on Open-CV.
Stars: ✭ 44 (+76%)
Mutual labels:  prediction
blockchain-predictor
Deep leraning cryptocurrency prediction with blockchain-based dataset
Stars: ✭ 34 (+36%)
Mutual labels:  prediction
mongolian-nlp
Useful resources for Mongolian NLP
Stars: ✭ 119 (+376%)
Mutual labels:  language-model
Black-Box-Tuning
ICML'2022: Black-Box Tuning for Language-Model-as-a-Service
Stars: ✭ 99 (+296%)
Mutual labels:  language-model
LM-CNLC
Chinese Natural Language Correction via Language Model
Stars: ✭ 15 (-40%)
Mutual labels:  language-model
backoff
PHP library providing retry functionality with multiple backoff strategies and jitter support
Stars: ✭ 132 (+428%)
Mutual labels:  backoff
open clip
An open source implementation of CLIP.
Stars: ✭ 1,534 (+6036%)
Mutual labels:  language-model
chainer-notebooks
Jupyter notebooks for Chainer hands-on
Stars: ✭ 23 (-8%)
Mutual labels:  language-model
CoLAKE
COLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding
Stars: ✭ 86 (+244%)
Mutual labels:  language-model
The-Purchase-and-Redemption-Forecast-Challenge-baseline
ε€©ζ± β€œθ΅„ι‡‘ζ΅ε…₯ζ΅ε‡Ίι’„ζ΅‹β€”β€”ζŒ‘ζˆ˜baselineβ€ηš„θ§£ε†³ζ–Ήζ‘ˆοΌŒηΊΏδΈŠζ•ˆζžœ143.5
Stars: ✭ 78 (+212%)
Mutual labels:  prediction
bert-movie-reviews-sentiment-classifier
Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model
Stars: ✭ 12 (-52%)
Mutual labels:  language-model
poesy
Poetry generation via natural language markov models
Stars: ✭ 56 (+124%)
Mutual labels:  ngrams
ANTsR
Advanced Normalization Tools in R
Stars: ✭ 101 (+304%)
Mutual labels:  prediction
clarktech-ncaab-predictor
A machine learning project to predict NCAA Men's Basketball outcomes
Stars: ✭ 24 (-4%)
Mutual labels:  prediction
Deep-NLP-Resources
Curated list of all NLP Resources
Stars: ✭ 65 (+160%)
Mutual labels:  language-model
FNet-pytorch
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
Stars: ✭ 204 (+716%)
Mutual labels:  language-model
language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Stars: ✭ 84 (+236%)
Mutual labels:  language-model
sg-food-ml
This script is used to scrap images from the Internet to classify 5 common noodle "mee" dishes in Singapore. Wanton Mee, Bak Chor Mee, Lor Mee, Prawn Mee and Mee Siam.
Stars: ✭ 18 (-28%)
Mutual labels:  prediction

HitCount

Word-Prediction-Ngram

Next Word Prediction using n-gram Probabilistic Model.

Various jupyter notebooks are there using different Language Models for next word Prediction.

Input :

The users Enters a text sentence

Output :

Predicts a word which can follow the input sentence

Various Smoothing Techniques have been used in different Language Models along with combination of Interpolation and Backoff in these different Language Models.

Smoothing Techniques Used:

  1. Add 1 
  2. Good Turing 
  3. Simple Knesser Ney
  4. Interpolated Knesser Ney

How the code works:

  1. Cleaning of training corpus ( Removing Punctuations etc)
  2. Creation of Language Model:
    i) Formation of n-grams (Unigram, Bigram, Trigram, Quadgram)
    ii) Probability Dictionary Creation with provision of various Smoothing Mechanism
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].