All Projects → ilmulti → Similar Projects or Alternatives

196 Open source projects that are alternatives of or similar to ilmulti

Indian ParallelCorpus
Curated list of publicly available parallel corpus for Indian Languages
Stars: ✭ 23 (+21.05%)
Thot
Thot toolkit for statistical machine translation
Stars: ✭ 53 (+178.95%)
Mutual labels:  tokenizer, machine-translation
Sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (+1442.11%)
Mutual labels:  tokenizer, machine-translation
Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (+594.74%)
Mutual labels:  tokenizer, machine-translation
farasapy
A Python implementation of Farasa toolkit
Stars: ✭ 69 (+263.16%)
Mutual labels:  tokenizer
rtg
Reader Translator Generator - NMT toolkit based on pytorch
Stars: ✭ 26 (+36.84%)
Mutual labels:  machine-translation
skt
Sanskrit compound segmentation using seq2seq model
Stars: ✭ 21 (+10.53%)
Mutual labels:  machine-translation
snapdragon-lexer
Converts a string into an array of tokens, with useful methods for looking ahead and behind, capturing, matching, et cetera.
Stars: ✭ 19 (+0%)
Mutual labels:  tokenizer
berserker
Berserker - BERt chineSE woRd toKenizER
Stars: ✭ 17 (-10.53%)
Mutual labels:  tokenizer
psr2r-sniffer
A PSR-2-R code sniffer and code-style auto-correction-tool - including many useful additions
Stars: ✭ 32 (+68.42%)
Mutual labels:  tokenizer
chinese-tokenizer
Tokenizes Chinese texts into words.
Stars: ✭ 72 (+278.95%)
Mutual labels:  tokenizer
SwiLex
A universal lexer library in Swift.
Stars: ✭ 29 (+52.63%)
Mutual labels:  tokenizer
dynmt-py
Neural machine translation implementation using dynet's python bindings
Stars: ✭ 17 (-10.53%)
Mutual labels:  machine-translation
gd-tokenizer
A small godot project with a tokenizer written in GDScript.
Stars: ✭ 34 (+78.95%)
Mutual labels:  tokenizer
NiuTrans.NMT
A Fast Neural Machine Translation System. It is developed in C++ and resorts to NiuTensor for fast tensor APIs.
Stars: ✭ 112 (+489.47%)
Mutual labels:  machine-translation
tai5-uan5 gian5-gi2 kang1-ku7
臺灣言語工具
Stars: ✭ 79 (+315.79%)
Mutual labels:  machine-translation
packetevents
PacketEvents is a powerful packet library. Our packet wrappers are efficient and easy to use. We support many protocol versions. (1.8+)
Stars: ✭ 235 (+1136.84%)
Mutual labels:  wrappers
extreme-adaptation-for-personalized-translation
Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"
Stars: ✭ 42 (+121.05%)
Mutual labels:  machine-translation
omegat-tencent-plugin
This is a plugin to allow OmegaT to source machine translations from Tencent Cloud.
Stars: ✭ 31 (+63.16%)
Mutual labels:  machine-translation
parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (+84.21%)
Mutual labels:  machine-translation
suika
Suika 🍉 is a Japanese morphological analyzer written in pure Ruby
Stars: ✭ 31 (+63.16%)
Mutual labels:  tokenizer
Text-Classification-LSTMs-PyTorch
The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (+136.84%)
Mutual labels:  tokenizer
lexertk
C++ Lexer Toolkit Library (LexerTk) https://www.partow.net/programming/lexertk/index.html
Stars: ✭ 26 (+36.84%)
Mutual labels:  tokenizer
jargon
Tokenizers and lemmatizers for Go
Stars: ✭ 98 (+415.79%)
Mutual labels:  tokenizer
hunspell
High-Performance Stemmer, Tokenizer, and Spell Checker for R
Stars: ✭ 101 (+431.58%)
Mutual labels:  tokenizer
grasp
Essential NLP & ML, short & fast pure Python code
Stars: ✭ 58 (+205.26%)
Mutual labels:  tokenizer
lindera
A morphological analysis library.
Stars: ✭ 226 (+1089.47%)
Mutual labels:  tokenizer
elasticsearch-plugins
Some native scoring script plugins for elasticsearch
Stars: ✭ 30 (+57.89%)
Mutual labels:  tokenizer
SequenceToSequence
A seq2seq with attention dialogue/MT model implemented by TensorFlow.
Stars: ✭ 11 (-42.11%)
Mutual labels:  machine-translation
Deep-NLP-Resources
Curated list of all NLP Resources
Stars: ✭ 65 (+242.11%)
Mutual labels:  machine-translation
MetricMT
The official code repository for MetricMT - a reward optimization method for NMT with learned metrics
Stars: ✭ 23 (+21.05%)
Mutual labels:  machine-translation
neural tokenizer
Tokenize English sentences using neural networks.
Stars: ✭ 64 (+236.84%)
Mutual labels:  tokenizer
python-mecab
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (+42.11%)
Mutual labels:  tokenizer
mtdata
A tool that locates, downloads, and extracts machine translation corpora
Stars: ✭ 95 (+400%)
Mutual labels:  machine-translation
instamojo-java
Java wrapper for Instamojo API
Stars: ✭ 15 (-21.05%)
Mutual labels:  wrappers
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+447.37%)
Mutual labels:  tokenizer
xontrib-output-search
Get identifiers, paths, URLs and words from the previous command output and use them for the next command in xonsh shell.
Stars: ✭ 26 (+36.84%)
Mutual labels:  tokenizer
wink-tokenizer
Multilingual tokenizer that automatically tags each token with its type
Stars: ✭ 51 (+168.42%)
Mutual labels:  tokenizer
Distill-BERT-Textgen
Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".
Stars: ✭ 121 (+536.84%)
Mutual labels:  machine-translation
masakhane-web
Masakhane Web is a translation web application for solely African Languages.
Stars: ✭ 27 (+42.11%)
Mutual labels:  machine-translation
OPUS-MT-train
Training open neural machine translation models
Stars: ✭ 166 (+773.68%)
Mutual labels:  machine-translation
vscode-blockman
VSCode extension to highlight nested code blocks
Stars: ✭ 233 (+1126.32%)
Mutual labels:  tokenizer
tvsub
TVsub: DCU-Tencent Chinese-English Dialogue Corpus
Stars: ✭ 40 (+110.53%)
Mutual labels:  machine-translation
lex
Lex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (+157.89%)
Mutual labels:  tokenizer
bergamot-translator
Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.
Stars: ✭ 181 (+852.63%)
Mutual labels:  machine-translation
deepl-rb
A simple ruby gem for the DeepL API
Stars: ✭ 38 (+100%)
Mutual labels:  machine-translation
Tokenizer
A tokenizer for Icelandic text
Stars: ✭ 27 (+42.11%)
Mutual labels:  tokenizer
Machine-Translation-Hindi-to-english-
Machine translation is the task of converting one language to other. Unlike the traditional phrase-based translation system which consists of many small sub-components that are tuned separately, neural machine translation attempts to build and train a single, large neural network that reads a sentence and outputs a correct translation.
Stars: ✭ 19 (+0%)
Mutual labels:  machine-translation
osdg-tool
OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant content in any text. The tool is available online at www.osdg.ai. API access available for research purposes.
Stars: ✭ 22 (+15.79%)
Mutual labels:  machine-translation
ReductionWrappers
R wrappers to connect Python dimensional reduction tools and single cell data objects (Seurat, SingleCellExperiment, etc...)
Stars: ✭ 31 (+63.16%)
Mutual labels:  wrappers
BSD
The Business Scene Dialogue corpus
Stars: ✭ 51 (+168.42%)
Mutual labels:  machine-translation
OpenISS
OpenISS -- a unified multimodal motion data delivery framework.
Stars: ✭ 22 (+15.79%)
Mutual labels:  wrappers
Roy VnTokenizer
Vietnamese tokenizer (Maximum Matching and CRF)
Stars: ✭ 49 (+157.89%)
Mutual labels:  tokenizer
sb-nmt
Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)
Stars: ✭ 66 (+247.37%)
Mutual labels:  machine-translation
Machine-Translation-v2
英中机器文本翻译
Stars: ✭ 48 (+152.63%)
Mutual labels:  machine-translation
urbans
A tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.
Stars: ✭ 19 (+0%)
Mutual labels:  machine-translation
sinling
A collection of NLP tools for Sinhalese (සිංහල).
Stars: ✭ 38 (+100%)
Mutual labels:  tokenizer
apertium-apy
📦 Apertium HTTP Server in Python
Stars: ✭ 29 (+52.63%)
Mutual labels:  machine-translation
tokenizer
A simple tokenizer in Ruby for NLP tasks.
Stars: ✭ 44 (+131.58%)
Mutual labels:  tokenizer
Video-guided-Machine-Translation
Starter code for the VMT task and challenge
Stars: ✭ 45 (+136.84%)
Mutual labels:  machine-translation
1-60 of 196 similar projects