OpencorporaA web-based engine for creating and annotating textual corpora
Stars: ✭ 204 (+920%)
politic-botsTools and algorithms to analyze Paraguayan Tweets in times of elections
Stars: ✭ 26 (+30%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (+415%)
Rime CantoneseRime Cantonese input schema | 粵語拼音輸入方案
Stars: ✭ 173 (+765%)
use-cases-of-bertUse-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).
Stars: ✭ 18 (-10%)
koikaA core language for rule-based hardware design 🦑
Stars: ✭ 103 (+415%)
phd-resourcesInternet Delivered Treatment using Adaptive Technology
Stars: ✭ 37 (+85%)
TossiChooses correct Korean particle morphs for arbitrary words.
Stars: ✭ 160 (+700%)
meowDaily Bruin's homemade social media manager
Stars: ✭ 42 (+110%)
PycantoneseCantonese Linguistics and NLP in Python
Stars: ✭ 147 (+635%)
urdu-characters📄 Complete collection of Urdu language characters & unicode code points.
Stars: ✭ 24 (+20%)
score-zeroshotSemantically consistent regularizer for zero-shot learning
Stars: ✭ 65 (+225%)
MLH-QuizzetThis is a smart Quiz Generator that generates a dynamic quiz from any uploaded text/PDF document using NLP. This can be used for self-analysis, question paper generation, and evaluation, thus reducing human effort.
Stars: ✭ 23 (+15%)
Nuts自然语言处理常见任务(主要包括文本分类,序列标注,自动问答等)解决方案试验田
Stars: ✭ 21 (+5%)
RiksdagskollenRepository for development of Riksdagskollen
Stars: ✭ 27 (+35%)
ibm-ai-dayPresentation for IBM Community Day AI
Stars: ✭ 13 (-35%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (+460%)
coreWIP - A personal life helper providing solutions and happiness
Stars: ✭ 17 (-15%)
CjworkbenchThe data journalism platform with built in training
Stars: ✭ 244 (+1120%)
Elpis🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (+405%)
OpenPromptAn Open-Source Framework for Prompt-Learning.
Stars: ✭ 1,769 (+8745%)
AmfvA Mind Forever Voyaging, by Steve Meretzky (Infocom)
Stars: ✭ 81 (+305%)
phrase-at-scaleDetect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
Stars: ✭ 115 (+475%)
FlatFoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
Stars: ✭ 93 (+365%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+525%)
CivilThe Main Monorepo and entry-point of all things Civil
Stars: ✭ 181 (+805%)
Quora question pairs NLP KaggleQuora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training
Stars: ✭ 17 (-15%)
BetaAn open source reimplementation of Benny Brodda's BETA in Python
Stars: ✭ 65 (+225%)
alter-nluNatural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (+125%)
votacidade-appCalculadora de afinidade para o Vota Cidade 2020
Stars: ✭ 12 (-40%)
arabic-taggerAQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (+90%)
TarbellA Flask-based static site authoring tool.
Stars: ✭ 159 (+695%)
pytorch-translmAn implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Stars: ✭ 22 (+10%)
PhonemesJason Riggle's chart of phonological features in JSON format + extras
Stars: ✭ 33 (+65%)
lidtkLanguage Identification Toolkit
Stars: ✭ 17 (-15%)
kexKex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
Stars: ✭ 46 (+130%)
contact-officialsForm definitions powering Resistbot's electronic deliveries to elected officials in the United States.
Stars: ✭ 29 (+45%)
rsyntaxtreeSyntax tree generator made with Ruby and RMagic
Stars: ✭ 62 (+210%)
WonderfulPolishLanguageThis is a repository created for the list of resources for learning and exploring Wonderful Polish language.
Stars: ✭ 31 (+55%)
Paribhashaparibhasha.herokuapp.com/
Stars: ✭ 21 (+5%)
Call My CongressDEPRECATED. Simple app that displays contact information for US Congress representatives by district.
Stars: ✭ 125 (+525%)
5callsFrontend for the 5calls.org site
Stars: ✭ 369 (+1745%)
frames🖼 A minimalistic take on responsive iframes in the spirit of Pym.js.
Stars: ✭ 19 (-5%)
Mrc book《机器阅读理解:算法与实践》代码
Stars: ✭ 102 (+410%)