ATKSpythis repository is a python package that supports SOAP interface to communicate with the Microsoft ATKS
Stars: ✭ 27 (-60.87%)
arabic-taggerAQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (-44.93%)
tajmeeatonتجميعة من المشاريع، وخصوصا مفتوحة المصدر، للنهوض باللغة العربية والأمة. 👨💻 👨🔬👨🏫🧕
Stars: ✭ 115 (+66.67%)
Camel toolsA suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Stars: ✭ 124 (+79.71%)
ar-embeddingsSentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec
Stars: ✭ 83 (+20.29%)
BasicArabicOCRA very basic Arabic OCR based on tesseract OCR engine written in Java.
Stars: ✭ 19 (-72.46%)
nmathegA simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
Stars: ✭ 19 (-72.46%)
namacoCharacter Based Named Entity Recognition.
Stars: ✭ 41 (-40.58%)
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+0%)
arabic-jekyllابدأ بالتدوين باستخدام جيكل بلحضات وبدون لمس سطر الأوامر
Stars: ✭ 36 (-47.83%)
SumrizedAutomatic Text Summarization (English/Arabic).
Stars: ✭ 37 (-46.38%)
ner-tagger-dynetSee http://github.com/onurgu/joint-ner-and-md-tagger This repository is basically a Bi-LSTM based sequence tagger in both Tensorflow and Dynet which can utilize several sources of information about each word unit like word embeddings, character based embeddings and morphological tags from an FST to obtain the representation for that specific wor…
Stars: ✭ 23 (-66.67%)
CrossNERCrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)
Stars: ✭ 87 (+26.09%)
snapdragon-lexerConverts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (-72.46%)
anonymization-apiHow to build and deploy an anonymization API with FastAPI
Stars: ✭ 51 (-26.09%)
SynLSTM-for-NERCode and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.
Stars: ✭ 26 (-62.32%)
banglabertThis repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…
Stars: ✭ 186 (+169.57%)
comparable-text-minerComparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary translation, documents alignment, corpus information, text classification, tf-idf computation, text similarity computation, html documents cleaning
Stars: ✭ 31 (-55.07%)
Wisty.js🧚♀️ Chatbot library turning conversations into actions, locally, in the browser.
Stars: ✭ 24 (-65.22%)
deepnlp小时候练手的nlp项目
Stars: ✭ 11 (-84.06%)
SwiLexA universal lexer library in Swift.
Stars: ✭ 29 (-57.97%)
BERTOverflowA Pre-trained BERT on StackOverflow Corpus
Stars: ✭ 40 (-42.03%)
molminerPython library and command-line tool for extracting compounds from scientific literature. Written in Python.
Stars: ✭ 38 (-44.93%)
neural name taggingCode for "Reliability-aware Dynamic Feature Composition for Name Tagging" (ACL2019)
Stars: ✭ 39 (-43.48%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (-34.78%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+31.88%)
TwitterNERTwitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html
Stars: ✭ 134 (+94.2%)
xontrib-output-searchGet identifiers, paths, URLs and words from the previous command output and use them for the next command in xonsh shell.
Stars: ✭ 26 (-62.32%)
PersianNERNamed-Entity Recognition in Persian Language
Stars: ✭ 48 (-30.43%)
hunspellHigh-Performance Stemmer, Tokenizer, and Spell Checker for R
Stars: ✭ 101 (+46.38%)
ckipnlpCKIP CoreNLP Toolkits
Stars: ✭ 92 (+33.33%)
scikitcrf NERPython library for custom entity recognition using Sklearn CRF
Stars: ✭ 17 (-75.36%)
tokenizerA simple tokenizer in Ruby for NLP tasks.
Stars: ✭ 44 (-36.23%)
TweebankNLP[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+21.74%)
linderaA morphological analysis library.
Stars: ✭ 226 (+227.54%)
qahiriQahiri (قاهري) is a manuscript Kufic typeface
Stars: ✭ 45 (-34.78%)
suikaSuika 🍉 is a Japanese morphological analyzer written in pure Ruby
Stars: ✭ 31 (-55.07%)
CogIECogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021
Stars: ✭ 47 (-31.88%)
TokenizerA tokenizer for Icelandic text
Stars: ✭ 27 (-60.87%)
HurufA simple chrome extension to make reading Arabic easier
Stars: ✭ 23 (-66.67%)
psr2r-snifferA PSR-2-R code sniffer and code-style auto-correction-tool - including many useful additions
Stars: ✭ 32 (-53.62%)
Quran-Pro-iOSDescription Qur’an Pro - القرآن الكريم offers all muslims the complete Holy Quran with verse by verse recitation, translation and transcription. The application is written in SWIFT 2.0
Stars: ✭ 21 (-69.57%)
gd-tokenizerA small godot project with a tokenizer written in GDScript.
Stars: ✭ 34 (-50.72%)
PhoNER COVID19COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Stars: ✭ 55 (-20.29%)
lexertkC++ Lexer Toolkit Library (LexerTk) https://www.partow.net/programming/lexertk/index.html
Stars: ✭ 26 (-62.32%)
python-mecabA repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (-60.87%)
spertPyTorch code for SpERT: Span-based Entity and Relation Transformer
Stars: ✭ 572 (+728.99%)
graspEssential NLP & ML, short & fast pure Python code
Stars: ✭ 58 (-15.94%)
lexLex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (-28.99%)
araع Command line tool that displays Arabic text in terminal.
Stars: ✭ 27 (-60.87%)