lingtypologyR package for linguistic cartography and typological databases search
Stars: ✭ 47 (+88%)
33 Js Concepts📜 33 JavaScript concepts every developer should know.
Stars: ✭ 45,558 (+182132%)
HangulizeHangulize transcribes non-Korean words into Hangul
Stars: ✭ 152 (+508%)
programming-with-cpp20Companion source code for "Programming with C++20 - Concepts, Coroutines, Ranges, and more"
Stars: ✭ 142 (+468%)
pylangacqLanguage Acquisition Research Tools
Stars: ✭ 33 (+32%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (+8%)
HangulizeKorean Alphabet Transcription
Stars: ✭ 184 (+636%)
expletivesExpletives vomiting library...
Stars: ✭ 12 (-52%)
devPHOIBLE data and development.
Stars: ✭ 90 (+260%)
IchiranLinguistic tools for texts in Japanese language
Stars: ✭ 120 (+380%)
data-science-learning📊 All of courses, assignments, exercises, mini-projects and books that I've done so far in the process of learning by myself Machine Learning and Data Science.
Stars: ✭ 32 (+28%)
lambda-notebookLambda Notebook: Formal Semantics in Jupyter
Stars: ✭ 16 (-36%)
nyt-first-saidTweets when words are published for the first time in the NYT
Stars: ✭ 222 (+788%)
eliza-rsA rust implementation of ELIZA - a natural language processing program developed by Joseph Weizenbaum in 1966.
Stars: ✭ 48 (+92%)
pfootprintPolitical Discourse Analysis Using Pre-Trained Word Vectors.
Stars: ✭ 20 (-20%)
mystemCGo bindings to Yandex.Mystem
Stars: ✭ 28 (+12%)
Awesome LinguisticsA curated list of anything remotely related to linguistics
Stars: ✭ 207 (+728%)
NamingThingsContent on tips, tricks, advice, practices for naming things in in software/technology
Stars: ✭ 31 (+24%)
ProsodicProsodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
Stars: ✭ 162 (+548%)
thread poolThread pool using std::* primitives from C++17, with optional priority queue/greenthreading for POSIX.
Stars: ✭ 74 (+196%)
Ipa DictMonolingual wordlists with pronunciation information in IPA
Stars: ✭ 139 (+456%)
languaA suite of language tools
Stars: ✭ 29 (+16%)
copycatModern port of Melanie Mitchell's and Douglas Hofstadter's Copycat
Stars: ✭ 84 (+236%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (+348%)
lingvo--Ner-ruNamed entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (+52%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+124%)
OnsetA language evolution simulator, using realistic phonetic changes.
Stars: ✭ 30 (+20%)
notes📓 Notes related to Computer Science stuff.
Stars: ✭ 15 (-40%)
OpenGNTOpen Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
Stars: ✭ 55 (+120%)
proiel-treebankOfficial releases of the PROIEL treebank of ancient Indo-European languages
Stars: ✭ 30 (+20%)
NatLangNatLang is an English parser with an extensible grammar
Stars: ✭ 20 (-20%)
GNOME-ConceptsConcepts and ideas for the GNOME desktop
Stars: ✭ 13 (-48%)
poesyPoetic processing, for Python.
Stars: ✭ 28 (+12%)
LangPadA word processor/dictionary/generally useful tool for linguistics.
Stars: ✭ 20 (-20%)
WonderfulPolishLanguageThis is a repository created for the list of resources for learning and exploring Wonderful Polish language.
Stars: ✭ 31 (+24%)
TextGridToolsRead, write, and manipulate Praat TextGrid files with Python
Stars: ✭ 84 (+236%)
mlconjug3A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (+88%)
OpencorporaA web-based engine for creating and annotating textual corpora
Stars: ✭ 204 (+716%)
Rime CantoneseRime Cantonese input schema | 粵語拼音輸入方案
Stars: ✭ 173 (+592%)
libpalasoPalaso Library: A set of .Net libraries useful for developers of Language Software.
Stars: ✭ 36 (+44%)
TossiChooses correct Korean particle morphs for arbitrary words.
Stars: ✭ 160 (+540%)
cefal(Concepts-enabled) Functional Abstraction Layer for C++
Stars: ✭ 52 (+108%)
PycantoneseCantonese Linguistics and NLP in Python
Stars: ✭ 147 (+488%)
verbeccComplete Conjugation of any Verb using Machine Learning for French, Spanish, Portuguese, Italian and Romanian
Stars: ✭ 45 (+80%)
linguisticsdownEasy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-4%)
KoParadigmKoParadigm: Korean Inflectional Paradigm Generator
Stars: ✭ 48 (+92%)
Movie Trailers SwiftUIA simple app which shows the lastest movies trailers based on different genres developed using SwiftUI.
Stars: ✭ 51 (+104%)
wikipronMassively multilingual pronunciation mining
Stars: ✭ 167 (+568%)
dureeDurée: the longest book ever written.
Stars: ✭ 67 (+168%)
lametaThe Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-28%)
ngramrR package to query the Google Ngram Viewer
Stars: ✭ 46 (+84%)