TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+255.32%)
PyresparserA simple resume parser used for extracting information from resumes
Stars: ✭ 297 (+531.91%)
Link GrammarThe CMU Link Grammar natural language parser
Stars: ✭ 286 (+508.51%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-2.13%)
NlpSelected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Stars: ✭ 304 (+546.81%)
PostaggaA Library to parse natural language in pure Clojure and ClojureScript
Stars: ✭ 152 (+223.4%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (+129.79%)
ThotThot toolkit for statistical machine translation
Stars: ✭ 53 (+12.77%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+240.43%)
LfuzzerFuzzing Parsers with Tokens
Stars: ✭ 28 (-40.43%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+1580.85%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+831.91%)
Works For MeCollection of developer toolkits
Stars: ✭ 131 (+178.72%)
Php Parser🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+751.06%)
VntkVietnamese NLP Toolkit for Node
Stars: ✭ 170 (+261.7%)
Query TranslatorQuery Translator is a search query translator with AST representation
Stars: ✭ 165 (+251.06%)
String CalcPHP calculator library for mathematical terms (expressions) passed as strings
Stars: ✭ 60 (+27.66%)
PynlpA pythonic wrapper for Stanford CoreNLP.
Stars: ✭ 103 (+119.15%)
TokenizerFast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (+180.85%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+300%)
SharpmathA small .NET math library.
Stars: ✭ 36 (-23.4%)
Gsoc2018 3gm💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (-23.4%)
TextblobSimple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Stars: ✭ 7,991 (+16902.13%)
BiomedicusCode for the old version of BioMedICUS, for the new version see the biomedicus3 repository.
Stars: ✭ 45 (-4.26%)
Vale📝 A syntax-aware linter for prose built with speed and extensibility in mind.
Stars: ✭ 978 (+1980.85%)
RebiberA simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Stars: ✭ 1,005 (+2038.3%)
NhazmA C# version of Hazm (Python library for digesting Persian text)
Stars: ✭ 35 (-25.53%)
TextrankTextRank implementation for Python 3.
Stars: ✭ 1,008 (+2044.68%)
TidytextText mining using tidy tools ✨📄✨
Stars: ✭ 975 (+1974.47%)
FreemlA List of Data Science/Machine Learning Resources (Mostly Free)
Stars: ✭ 974 (+1972.34%)
ExemplarAn open relation extraction system
Stars: ✭ 46 (-2.13%)
ConfigparserConfig ini file parser in Go
Stars: ✭ 40 (-14.89%)
Deepnlp基于深度学习的自然语言处理库
Stars: ✭ 34 (-27.66%)
LogosCreate ridiculously fast Lexers
Stars: ✭ 1,001 (+2029.79%)
BudouBudou is an automatic organizer tool for beautiful line breaking in CJK (Chinese, Japanese, and Korean).
Stars: ✭ 971 (+1965.96%)
Substitution Schedule ParserJava library for parsing schools' substitution schedules. Supports multiple different systems mainly used in the German-speaking countries, including Untis, svPlan, and DAVINCI
Stars: ✭ 33 (-29.79%)
Fast Xml ParserValidate XML, Parse XML to JS/JSON and vise versa, or parse XML to Nimn rapidly without C/C++ based libraries and no callback
Stars: ✭ 1,021 (+2072.34%)
RocketNetDisk in command line.
Stars: ✭ 40 (-14.89%)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (-29.79%)
BlocksBlocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)
Stars: ✭ 39 (-17.02%)
Pqg PytorchParaphrase Generation model using pair-wise discriminator loss
Stars: ✭ 33 (-29.79%)
Nlp Js Tools FrenchPOS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-31.91%)
InicppC++ parser of INI files with schema validation.
Stars: ✭ 47 (+0%)
Ical RsRust parser for ics (rfc5545) and vcard (rfc6350)
Stars: ✭ 46 (-2.13%)
Edn DataEDN parser and generator that works with plain JS data, with support for TS and node streams
Stars: ✭ 44 (-6.38%)
GoawkA POSIX-compliant AWK interpreter written in Go
Stars: ✭ 995 (+2017.02%)
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+1953.19%)
ParsonLightweight JSON library written in C.
Stars: ✭ 965 (+1953.19%)
Typing AssistantTyping Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-31.91%)