Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

Stars: ✭ 70 (+100%)

Mutual labels: computational-linguistics

php-ntlm

Message encoder/decoder and password hasher for the NTLM authentication protocol

Stars: ✭ 14 (-60%)

Mutual labels: lm

machine learning

Stars: ✭ 29 (-17.14%)

Mutual labels: language-model

bert-movie-reviews-sentiment-classifier

Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model

Stars: ✭ 12 (-65.71%)

Mutual labels: language-model

nytwit

New York Times Word Innovation Types dataset

Stars: ✭ 21 (-40%)

Mutual labels: computational-linguistics

mongolian-nlp

Useful resources for Mongolian NLP

Stars: ✭ 119 (+240%)

Mutual labels: language-model

tying-wv-and-wc

Implementation for "Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling"

Stars: ✭ 39 (+11.43%)

Mutual labels: language-model

datalinguist

Stanford CoreNLP in idiomatic Clojure.

Stars: ✭ 93 (+165.71%)

Mutual labels: computational-linguistics

folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…

Stars: ✭ 56 (+60%)

Mutual labels: computational-linguistics

gdc

Code for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"

Stars: ✭ 94 (+168.57%)

Mutual labels: language-model

eflm

Efficient Fitting of Linear and Generalized Linear Models by using just base R. The speed gains over lm and glm are obtained by reducing the NxP model matrix to a PxP matrix, and the best computational performance is obtained when R is linked against OpenBLAS, Intel MKL or other optimized BLAS library.

Stars: ✭ 14 (-60%)

Mutual labels: lm

ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …

Stars: ✭ 58 (+65.71%)

Mutual labels: computational-linguistics

wikipron

Massively multilingual pronunciation mining

Stars: ✭ 167 (+377.14%)

Mutual labels: computational-linguistics

subword-lstm-lm

LSTM Language Model with Subword Units Input Representations

Stars: ✭ 45 (+28.57%)

Mutual labels: language-model

gravity

R package that provides estimation methods for Gravity Models

Stars: ✭ 24 (-31.43%)

Mutual labels: lm

Highway-Transformer

[ACL‘20] Highway Transformer: A Gated Transformer.

Stars: ✭ 26 (-25.71%)

Mutual labels: language-model

mystem-scala

Morphological analyzer `mystem` (Russian language) wrapper for JVM languages

Stars: ✭ 21 (-40%)

Mutual labels: computational-linguistics

dasher-web

Dasher text entry in HTML, CSS, JavaScript, and SVG

Stars: ✭ 34 (-2.86%)

Mutual labels: language-model

Deep-NLP-Resources

Curated list of all NLP Resources

Stars: ✭ 65 (+85.71%)

Mutual labels: language-model

embedding evaluation

Evaluate your word embeddings

Stars: ✭ 32 (-8.57%)

Mutual labels: computational-linguistics

open clip

An open source implementation of CLIP.

Stars: ✭ 1,534 (+4282.86%)

Mutual labels: language-model

sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.

Stars: ✭ 51 (+45.71%)

Mutual labels: computational-linguistics

cscg

Code Generation as a Dual Task of Code Summarization.

Stars: ✭ 28 (-20%)

Mutual labels: language-model

minicons

Utility for analyzing Transformer based representations of language.

Stars: ✭ 28 (-20%)

Mutual labels: language-model

citation-function

Measuring the Evolution of a Scientific Field through Citation Frames

Stars: ✭ 40 (+14.29%)

Mutual labels: computational-linguistics

CodeT5

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

Stars: ✭ 390 (+1014.29%)

Mutual labels: language-model

gpt-j-api

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Stars: ✭ 248 (+608.57%)

Mutual labels: language-model

bangla-bert

Bangla-Bert is a pretrained bert model for Bengali language

Stars: ✭ 41 (+17.14%)

Mutual labels: lm

embeddings

Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language

Stars: ✭ 27 (-22.86%)

Mutual labels: lm

word2vec-tsne

Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.

Stars: ✭ 59 (+68.57%)

Mutual labels: computational-linguistics

query completion

Personalized Query Completion

Stars: ✭ 24 (-31.43%)

Mutual labels: language-model

tensorflow-with-kenlm

Tensorflow with KenLM integrated for beam search scoring

Stars: ✭ 30 (-14.29%)

Mutual labels: language-model

minGPT-TF

A minimal TF2 re-implementation of the OpenAI GPT training

Stars: ✭ 36 (+2.86%)

Mutual labels: language-model

Word-Prediction-Ngram

Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques

Stars: ✭ 25 (-28.57%)

Mutual labels: language-model

wechsel

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Stars: ✭ 39 (+11.43%)

Mutual labels: language-model

CISTEM

Stemmer for German

Stars: ✭ 33 (-5.71%)

Mutual labels: computational-linguistics

Romanian-Transformers

This repo is the home of Romanian Transformers.

Stars: ✭ 60 (+71.43%)

Mutual labels: language-model

language-planner

Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

Stars: ✭ 84 (+140%)

Mutual labels: language-model

KoLM

Korean text normalization and language preparation package for LM in Kaldi-based ASR system

Stars: ✭ 46 (+31.43%)

Mutual labels: lm

SentimentAnalysis

Sentiment Analysis: Deep Bi-LSTM+attention model

Stars: ✭ 32 (-8.57%)

Mutual labels: computational-linguistics

backprop

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

Stars: ✭ 229 (+554.29%)

Mutual labels: language-model

chainer-notebooks

Jupyter notebooks for Chainer hands-on

Stars: ✭ 23 (-34.29%)

Mutual labels: language-model

lxa5

Linguistica 5: Unsupervised Learning of Linguistic Structure

Stars: ✭ 27 (-22.86%)

Mutual labels: computational-linguistics

DataAugmentationNMT

Data Augmentation for Neural Machine Translation

Stars: ✭ 26 (-25.71%)

Mutual labels: language-model

mlp-gpt-jax

A GPT, made only of MLPs, in Jax

Stars: ✭ 53 (+51.43%)

Mutual labels: language-model

Black-Box-Tuning

ICML'2022: Black-Box Tuning for Language-Model-as-a-Service

Stars: ✭ 99 (+182.86%)

Mutual labels: language-model

LanguageModel-using-Attention

Pytorch implementation of a basic language model using Attention in LSTM network

Stars: ✭ 27 (-22.86%)

Mutual labels: language-model

pyVHDLParser

Streaming based VHDL parser.

Stars: ✭ 51 (+45.71%)

Mutual labels: language-model

linguistics problems

Natural language processing in examples and games

Stars: ✭ 23 (-34.29%)

Mutual labels: computational-linguistics

datastories-semeval2017-task6

Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".

Stars: ✭ 20 (-42.86%)

Mutual labels: computational-linguistics

foliapy

An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.

Stars: ✭ 13 (-62.86%)

Mutual labels: computational-linguistics

sembei

🍘 単語分割を経由しない単語埋め込み 🍘

Stars: ✭ 14 (-60%)

Mutual labels: computational-linguistics

SDLM-pytorch

Code accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts

Stars: ✭ 27 (-22.86%)

Mutual labels: language-model

1-60 of 183 similar projects

›