BSDThe Business Scene Dialogue corpus
Stars: ✭ 51 (+75.86%)
banglanmtThis repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Stars: ✭ 91 (+213.79%)
bergamot-translatorCross platform C++ library focusing on optimized machine translation on the consumer-grade device.
Stars: ✭ 181 (+524.14%)
Machine-Translation-Hindi-to-english-Machine translation is the task of converting one language to other. Unlike the traditional phrase-based translation system which consists of many small sub-components that are tuned separately, neural machine translation attempts to build and train a single, large neural network that reads a sentence and outputs a correct translation.
Stars: ✭ 19 (-34.48%)
R-Learning-JourneySome of the projects i made when starting to learn R for Data Science at the university
Stars: ✭ 19 (-34.48%)
mtdataA tool that locates, downloads, and extracts machine translation corpora
Stars: ✭ 95 (+227.59%)
OpennmtOpen Source Neural Machine Translation in Torch (deprecated)
Stars: ✭ 2,339 (+7965.52%)
SequenceToSequenceA seq2seq with attention dialogue/MT model implemented by TensorFlow.
Stars: ✭ 11 (-62.07%)
LingvoLingvo
Stars: ✭ 2,361 (+8041.38%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+313.79%)
Cross-Language-DatasetA multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
Stars: ✭ 60 (+106.9%)
osdg-toolOSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant content in any text. The tool is available online at www.osdg.ai. API access available for research purposes.
Stars: ✭ 22 (-24.14%)
parallel-corpora-toolsTools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (+20.69%)
apertium-apy📦 Apertium HTTP Server in Python
Stars: ✭ 29 (+0%)
allie🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Stars: ✭ 93 (+220.69%)
ibleuA visual and interactive scoring environment for machine translation systems.
Stars: ✭ 27 (-6.9%)
urbansA tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.
Stars: ✭ 19 (-34.48%)
Attention MechanismsImplementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.
Stars: ✭ 203 (+600%)
NpmtTowards Neural Phrase-based Machine Translation
Stars: ✭ 175 (+503.45%)
objectiv-analyticsPowerful product analytics for data teams, with full control over data & models.
Stars: ✭ 399 (+1275.86%)
Mt Reading ListA machine translation reading list maintained by Tsinghua Natural Language Processing Group
Stars: ✭ 2,166 (+7368.97%)
exemplary-ml-pipelineExemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-20.69%)
Distill-BERT-TextgenResearch code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".
Stars: ✭ 121 (+317.24%)
ilmultiTooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-34.48%)
OPUS-MT-trainTraining open neural machine translation models
Stars: ✭ 166 (+472.41%)
dynmt-pyNeural machine translation implementation using dynet's python bindings
Stars: ✭ 17 (-41.38%)
tvsubTVsub: DCU-Tencent Chinese-English Dialogue Corpus
Stars: ✭ 40 (+37.93%)
apertium-html-toolsWeb application providing a fully localised interface for text/website/document translation, analysis and generation powered by Apertium.
Stars: ✭ 36 (+24.14%)
Cleaner.jlA toolbox of simple solutions for common data cleaning problems.
Stars: ✭ 21 (-27.59%)
masakhane-webMasakhane Web is a translation web application for solely African Languages.
Stars: ✭ 27 (-6.9%)
FIFA-2019-AnalysisThis is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Stars: ✭ 28 (-3.45%)
sb-nmtCode for Synchronous Bidirectional Neural Machine Translation (SB-NMT)
Stars: ✭ 66 (+127.59%)
errorlocateFind and replace erroneous fields in data using validation rules
Stars: ✭ 19 (-34.48%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+4558.62%)
transformer-pytorchA PyTorch implementation of Transformer in "Attention is All You Need"
Stars: ✭ 77 (+165.52%)
transformerBuild English-Vietnamese machine translation with ProtonX Transformer. :D
Stars: ✭ 41 (+41.38%)
ModernmtNeural Adaptive Machine Translation that adapts to context and learns from corrections.
Stars: ✭ 231 (+696.55%)
omegat-tencent-pluginThis is a plugin to allow OmegaT to source machine translations from Tencent Cloud.
Stars: ✭ 31 (+6.9%)
Hardware Aware Transformers[ACL 2020] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Stars: ✭ 206 (+610.34%)
inmtInteractive Neural Machine Translation tool
Stars: ✭ 44 (+51.72%)
BleualignMachine-Translation-based sentence alignment tool for parallel text
Stars: ✭ 199 (+586.21%)
foofahFoofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (-17.24%)
TexarToolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+7610.34%)
rtgReader Translator Generator - NMT toolkit based on pytorch
Stars: ✭ 26 (-10.34%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+8582.76%)
NiuTrans.NMTA Fast Neural Machine Translation System. It is developed in C++ and resorts to NiuTensor for fast tensor APIs.
Stars: ✭ 112 (+286.21%)
OpenkiwiOpen-Source Machine Translation Quality Estimation in PyTorch
Stars: ✭ 157 (+441.38%)
MetricMTThe official code repository for MetricMT - a reward optimization method for NMT with learned metrics
Stars: ✭ 23 (-20.69%)
Natural-Language-ProcessingContains various architectures and novel paper implementations for Natural Language Processing tasks like Sequence Modelling and Neural Machine Translation.
Stars: ✭ 48 (+65.52%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+217.24%)
deepl-rbA simple ruby gem for the DeepL API
Stars: ✭ 38 (+31.03%)