exams-qaA Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
Stars: ✭ 25 (-50.98%)
Position-Aware-Tagging-for-ASTECode and models for the paper " Position-Aware Tagging for Aspect Sentiment Triplet Extraction", EMNLP 2020.
Stars: ✭ 70 (+37.25%)
auto-movie-taggerA Python script that auto tags and adds poster to mkv or mp4 movie files.
Stars: ✭ 49 (-3.92%)
TweebankNLP[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+64.71%)
microalgLangage et environnements dédiés à l’algorithmique.
Stars: ✭ 12 (-76.47%)
French-DictionaryCSV files containing all french adjectives, adverbs, conjunctions, determiners, nouns, prepositions, pronouns, verbs and their gender, types and conjugations
Stars: ✭ 74 (+45.1%)
gogtagsGNU global compatible source code tagging for golang
Stars: ✭ 42 (-17.65%)
farasapyA Python implementation of Farasa toolkit
Stars: ✭ 69 (+35.29%)
lexLex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (-3.92%)
Translatr💬 Translate to multiple languages at once
Stars: ✭ 145 (+184.31%)
sketch-crowdinConnect your Sketch and Crowdin projects together
Stars: ✭ 35 (-31.37%)
awesome-made-by-germans🇩🇪 The best open source projects that were made and mainly contributed by German developers
Stars: ✭ 170 (+233.33%)
tagifyTagify produces a set of tags from a given source. Source can be either an HTML page, a Markdown document or a plain text. Supports English, Russian, Chinese, Hindi, Spanish, Arabic, Japanese, German, Hebrew, French and Korean languages.
Stars: ✭ 24 (-52.94%)
LaserembeddingsLASER multilingual sentence embeddings as a pip package
Stars: ✭ 125 (+145.1%)
BARISUse the French Open Data Portal API features from R
Stars: ✭ 21 (-58.82%)
HistoryOfMeYour own personal diary.
Stars: ✭ 50 (-1.96%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (-11.76%)
hunspellHigh-Performance Stemmer, Tokenizer, and Spell Checker for R
Stars: ✭ 101 (+98.04%)
wiktionary-de-parserExtract data from German Wiktionary XML files. Allows you to add your own extraction methods 🚀
Stars: ✭ 22 (-56.86%)
PhantomBotDEPhantomBotDE ist ein aktiv Entwickelter interaktiver Open Source Twitch Bot mit einer lebendigen Community welche Unterhaltung und Moderation für deinen Kanal bietet, dieser erlaubt dir dich auf das was wirklich zählt zu Konzentrieren - dein Spiel und deine Zuschauer.
Stars: ✭ 24 (-52.94%)
lexertkC++ Lexer Toolkit Library (LexerTk) https://www.partow.net/programming/lexertk/index.html
Stars: ✭ 26 (-49.02%)
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+35.29%)
etiquetteWIP tag-based file organizer & search
Stars: ✭ 27 (-47.06%)
textboxText collections made available by the CLiGS group.
Stars: ✭ 19 (-62.75%)
mok-projectMultilingual Onscreen Keyboard Project
Stars: ✭ 27 (-47.06%)
urteile-gesetze-webWeb-Frontend des juristischen Informationssystems urteile-gesetze.de
Stars: ✭ 16 (-68.63%)
SwiLexA universal lexer library in Swift.
Stars: ✭ 29 (-43.14%)
DAnkiDAnki: Automate deck creation for Anki to learn german
Stars: ✭ 16 (-68.63%)
ingredientsExtract recipe ingredients from any recipe website on the internet.
Stars: ✭ 96 (+88.24%)
Roy VnTokenizerVietnamese tokenizer (Maximum Matching and CRF)
Stars: ✭ 49 (-3.92%)
springcrmAn open-source CRM.
Stars: ✭ 14 (-72.55%)
tkseemArabic Tokenization Library. It provides many tokenization algorithms.
Stars: ✭ 45 (-11.76%)
additional tagsRedmine Plugin for adding tags functionality to issues and wiki pages.
Stars: ✭ 25 (-50.98%)
meta-audioA PHP library to read and write metadata tags to audio files (MP3, ID3, APE, etc)
Stars: ✭ 32 (-37.25%)
gd-tokenizerA small godot project with a tokenizer written in GDScript.
Stars: ✭ 34 (-33.33%)
greebGreeb is a simple Unicode-aware regexp-based tokenizer.
Stars: ✭ 16 (-68.63%)
geommGeometry-aware Multilingual Embeddings
Stars: ✭ 23 (-54.9%)
Core🧿 Bolt 4 core
Stars: ✭ 243 (+376.47%)
python-mecabA repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (-47.06%)
next-multilingualAn opinionated end-to-end solution for Next.js applications that requires multiple languages.
Stars: ✭ 135 (+164.71%)
ElefantElefant, the refreshingly simple PHP CMS and web framework.
Stars: ✭ 188 (+268.63%)
LangageLinotteCode source officiel du langage de programmation Linotte - Langage de programmation en français simple créé dans le but de permettre aux enfants et aux personnes n'ayant pas une connaissance approfondie de l’informatique d’apprendre la programmation facilement.
Stars: ✭ 29 (-43.14%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+256.86%)
pehchaanDevanagari Character Recognition
Stars: ✭ 28 (-45.1%)
MimickCode for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
Stars: ✭ 152 (+198.04%)
SSCTaglistViewCustomizable iOS tag list view, in Swift.
Stars: ✭ 54 (+5.88%)
Localization🌐 Localization package for Laravel
Stars: ✭ 142 (+178.43%)
spacy-server🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec
Stars: ✭ 58 (+13.73%)
wink-nlpDeveloper friendly Natural Language Processing ✨
Stars: ✭ 312 (+511.76%)
checks-outChecks-Out pull request approval system
Stars: ✭ 79 (+54.9%)
htr-unitedGround Truth Resources for the HTR of patrimonial documents
Stars: ✭ 23 (-54.9%)
berserkerBerserker - BERt chineSE woRd toKenizER
Stars: ✭ 17 (-66.67%)
jargonTokenizers and lemmatizers for Go
Stars: ✭ 98 (+92.16%)