Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+236.92%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-11.54%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+227.69%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+1777.69%)
NlprePython library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (+21.54%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (+9.23%)
PykakasiNLP: Convert Japanese Kana-kanji sentences into Kana-Roman in simple algorithm.
Stars: ✭ 238 (+83.08%)
Awesome Bert Japanese📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
Stars: ✭ 76 (-41.54%)
Japanese.jsUtil collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
Stars: ✭ 150 (+15.38%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (+0%)
StringiTHE String Processing Package for R (with ICU)
Stars: ✭ 204 (+56.92%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+28.46%)
Lingua FrancaMycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-60.77%)
ToiroA comparison tool of Japanese tokenizers
Stars: ✭ 95 (-26.92%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-9.23%)
Fnc 1 BaselineA baseline implementation for FNC-1
Stars: ✭ 123 (-5.38%)
FlairA very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+8411.54%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+1066.15%)
FugashiA Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Stars: ✭ 125 (-3.85%)
ClicrMachine reading comprehension on clinical case reports
Stars: ✭ 123 (-5.38%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (-11.54%)
RbertImplementation of BERT in R
Stars: ✭ 114 (-12.31%)
Files2rougeCalculating ROUGE score between two files (line-by-line)
Stars: ✭ 120 (-7.69%)
DeclutrThe corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
Stars: ✭ 111 (-14.62%)
PytextrankPython implementation of TextRank for phrase extraction and summarization of text documents
Stars: ✭ 1,675 (+1188.46%)
Nadesiko3Japanese Programming Language Nadesiko v3 (JavaScript)
Stars: ✭ 125 (-3.85%)
Stanford Tensorflow TutorialsThis repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
Stars: ✭ 10,098 (+7667.69%)
Deep LyricsLyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network
Stars: ✭ 127 (-2.31%)
KeitaMy personal toolkit for PyTorch development.
Stars: ✭ 124 (-4.62%)
GseGo efficient multilingual NLP and text segmentation; support english, chinese, japanese and other. Go 高性能多语言 NLP 和分词
Stars: ✭ 1,695 (+1203.85%)
Unified SummarizationOfficial codes for the paper: A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss.
Stars: ✭ 114 (-12.31%)
Spacy Js🎀 JavaScript API for spaCy with Python REST API
Stars: ✭ 123 (-5.38%)
Tensorflow NlpNLP and Text Generation Experiments in TensorFlow 2.x / 1.x
Stars: ✭ 1,487 (+1043.85%)
Lingopackage lingo provides the data structures and algorithms required for natural language processing
Stars: ✭ 113 (-13.08%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-6.15%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (-13.85%)
LibasciidocA Golang library for processing Asciidoc files.
Stars: ✭ 129 (-0.77%)
Commonsense RcCode for Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine Comprehension
Stars: ✭ 112 (-13.85%)
Opus MtOpen neural machine translation models and web services
Stars: ✭ 111 (-14.62%)
Nlp PapersPapers and Book to look at when starting NLP 📚
Stars: ✭ 111 (-14.62%)
PadatiousA neural network intent parser
Stars: ✭ 124 (-4.62%)
Cs230 Code ExamplesCode examples in pyTorch and Tensorflow for CS230
Stars: ✭ 1,701 (+1208.46%)
DanlpDaNLP is a repository for Natural Language Processing resources for the Danish Language.
Stars: ✭ 111 (-14.62%)
DialoglueDialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Stars: ✭ 120 (-7.69%)
Detecting Scientific ClaimExtracting scientific claims from biomedical abstracts (powered by AllenNLP), demo:
Stars: ✭ 109 (-16.15%)
Neuraldialog LarlPyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 127 (-2.31%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-4.62%)
IchiranLinguistic tools for texts in Japanese language
Stars: ✭ 120 (-7.69%)
Posuto🏣📮〠 Japanese postal code data.
Stars: ✭ 109 (-16.15%)