All Projects → tkseem → Similar Projects or Alternatives

35 Open source projects that are alternatives of or similar to tkseem

Spacy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+48740%)
Mutual labels:  tokenization
charformer-pytorch
Implementation of the GBST block from the Charformer paper, in Pytorch
Stars: ✭ 74 (+64.44%)
Mutual labels:  tokenization
vgs-collect-ios
VGS Collect iOS SDK
Stars: ✭ 17 (-62.22%)
Mutual labels:  tokenization
uax29
A tokenizer based on Unicode text segmentation (UAX 29), for Go
Stars: ✭ 26 (-42.22%)
Mutual labels:  tokenization
youtokentome-ruby
High performance unsupervised text tokenization for Ruby
Stars: ✭ 17 (-62.22%)
Mutual labels:  tokenization
auto-data-tokenize
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Stars: ✭ 21 (-53.33%)
Mutual labels:  tokenization
simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
Stars: ✭ 32 (-28.89%)
Mutual labels:  tokenization
bert tokenization for java
This is a java version of Chinese tokenization descried in BERT.
Stars: ✭ 39 (-13.33%)
Mutual labels:  tokenization
polycash
The ultimate open source betting protocol. PolyCash is a P2P blockchain platform for wallets, asset issuance, bonds & gaming.
Stars: ✭ 24 (-46.67%)
Mutual labels:  tokenization
spacy russian tokenizer
Custom Russian tokenizer for spaCy
Stars: ✭ 35 (-22.22%)
Mutual labels:  tokenization
wink-tokenizer
Multilingual tokenizer that automatically tags each token with its type
Stars: ✭ 51 (+13.33%)
Mutual labels:  tokenization
spacy-server
🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec
Stars: ✭ 58 (+28.89%)
Mutual labels:  tokenization
ling
Natural Language Processing Toolkit in Golang
Stars: ✭ 57 (+26.67%)
Mutual labels:  tokenization
nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+53.33%)
Mutual labels:  tokenization
FAT
Factom Asset Tokens - Open tokenization standards on Factom
Stars: ✭ 17 (-62.22%)
Mutual labels:  tokenization
xontrib-output-search
Get identifiers, paths, URLs and words from the previous command output and use them for the next command in xonsh shell.
Stars: ✭ 26 (-42.22%)
Mutual labels:  tokenization
TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+86.67%)
Mutual labels:  tokenization
lunasec
LunaSec - Dependency Security Scanner that automatically notifies you about vulnerabilities like Log4Shell or node-ipc in your Pull Requests and Builds. Protect yourself in 30 seconds with the LunaTrace GitHub App: https://github.com/marketplace/lunatrace-by-lunasec/
Stars: ✭ 1,261 (+2702.22%)
Mutual labels:  tokenization
lima
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Stars: ✭ 75 (+66.67%)
Mutual labels:  tokenization
Vaaku2Vec
Language Modeling and Text Classification in Malayalam Language using ULMFiT
Stars: ✭ 68 (+51.11%)
Mutual labels:  tokenization
BasicArabicOCR
A very basic Arabic OCR based on tesseract OCR engine written in Java.
Stars: ✭ 19 (-57.78%)
Mutual labels:  arabic-nlp
ATKSpy
this repository is a python package that supports SOAP interface to communicate with the Microsoft ATKS
Stars: ✭ 27 (-40%)
Mutual labels:  arabic-nlp
alyahmor
Arabic flexionnal morphology generator
Stars: ✭ 22 (-51.11%)
Mutual labels:  arabic-nlp
arabic-stop-words
Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب
Stars: ✭ 193 (+328.89%)
Mutual labels:  arabic-nlp
ArSarcasm
This repository contains the Arabic sarcasm dataset (ArSarcasm)
Stars: ✭ 18 (-60%)
Mutual labels:  arabic-nlp
Arabic-Tashkeela-Model
This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on Kaggle
Stars: ✭ 15 (-66.67%)
Mutual labels:  arabic-nlp
masader
The largest public catalogue for Arabic NLP and speech datasets. There are +250 datasets annotated with more than 25 attributes.
Stars: ✭ 66 (+46.67%)
Mutual labels:  arabic-nlp
tajmeeaton
تجميعة من المشاريع، وخصوصا مفتوحة المصدر، للنهوض باللغة العربية والأمة. 👨‍💻 👨‍🔬👨‍🏫🧕
Stars: ✭ 115 (+155.56%)
Mutual labels:  arabic-nlp
farasapy
A Python implementation of Farasa toolkit
Stars: ✭ 69 (+53.33%)
Mutual labels:  arabic-nlp
arabic-sentiment-analysis
Sentiment Analysis in Arabic tweets
Stars: ✭ 64 (+42.22%)
Mutual labels:  arabic-nlp
comparable-text-miner
Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary translation, documents alignment, corpus information, text classification, tf-idf computation, text similarity computation, html documents cleaning
Stars: ✭ 31 (-31.11%)
Mutual labels:  arabic-nlp
arabic-tagger
AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (-15.56%)
Mutual labels:  arabic-nlp
ar-embeddings
Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec
Stars: ✭ 83 (+84.44%)
Mutual labels:  arabic-nlp
Sumrized
Automatic Text Summarization (English/Arabic).
Stars: ✭ 37 (-17.78%)
Mutual labels:  arabic-nlp
nmatheg
A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
Stars: ✭ 19 (-57.78%)
Mutual labels:  arabic-nlp
1-35 of 35 similar projects