ThotThot toolkit for statistical machine translation
Stars: ✭ 53 (-59.85%)
StringiTHE String Processing Package for R (with ICU)
Stars: ✭ 204 (+54.55%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+231.82%)
Hardware Aware Transformers[ACL 2020] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Stars: ✭ 206 (+56.06%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-10.61%)
TexarToolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+1593.94%)
SacremosesPython port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (+121.97%)
Mtbook《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models
Stars: ✭ 2,307 (+1647.73%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1807.58%)
ZhihuThis repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
Stars: ✭ 3,307 (+2405.3%)
Nlg EvalEvaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+522.73%)
IcuThe new home of the ICU project source code.
Stars: ✭ 1,011 (+665.91%)
GreynirThe greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-64.39%)
String To Tree NmtSource code and data for the paper "Towards String-to-Tree Neural Machine Translation"
Stars: ✭ 16 (-87.88%)
ilmultiTooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-85.61%)
ICU4NInternational Components for Unicode for .NET
Stars: ✭ 18 (-86.36%)
MtntCode for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-63.64%)
Opennmt TfNeural machine translation and sequence learning using TensorFlow
Stars: ✭ 1,223 (+826.52%)
Attention MechanismsImplementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.
Stars: ✭ 203 (+53.79%)
Nlp ProgressRepository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Stars: ✭ 19,518 (+14686.36%)
icu-dotnetC# wrapper for ICU4C
Stars: ✭ 48 (-63.64%)
Icu4xSolving i18n for client-side and resource-constrained environments.
Stars: ✭ 275 (+108.33%)
Texar PytorchIntegrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+381.82%)
Comet A Neural Framework for MT Evaluation
Stars: ✭ 58 (-56.06%)
Deep Learning DrizzleDrench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+7261.36%)
icu-swiftSwift APIs for ICU
Stars: ✭ 23 (-82.58%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+21.21%)
Opus MtOpen neural machine translation models and web services
Stars: ✭ 111 (-15.91%)
stringxDrop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-89.39%)
greebGreeb is a simple Unicode-aware regexp-based tokenizer.
Stars: ✭ 16 (-87.88%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-65.15%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-18.18%)
SyntokText tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-6.82%)
Files2rougeCalculating ROUGE score between two files (line-by-line)
Stars: ✭ 120 (-9.09%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-7.58%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (-1.52%)
Deep LyricsLyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network
Stars: ✭ 127 (-3.79%)
Cs230 Code ExamplesCode examples in pyTorch and Tensorflow for CS230
Stars: ✭ 1,701 (+1188.64%)
Neuraldialog LarlPyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 127 (-3.79%)
DialoglueDialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Stars: ✭ 120 (-9.09%)
RatelRAT-el is an open source penetration test tool that allows you to take control of a windows machine. It works on the client-server model, the server sends commands and the client executes the commands and sends the result back to the server. The client is completely undetectable by anti-virus software.
Stars: ✭ 121 (-8.33%)
Neuro🔮 Neuro.js is machine learning library for building AI assistants and chat-bots (WIP).
Stars: ✭ 126 (-4.55%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+1204.55%)
DiscobertCode for paper "Discourse-Aware Neural Extractive Text Summarization" (ACL20)
Stars: ✭ 120 (-9.09%)
TextacyNLP, before and after spaCy
Stars: ✭ 1,849 (+1300.76%)
FugashiA Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Stars: ✭ 125 (-5.3%)
TokenizerSource code tokenizer
Stars: ✭ 119 (-9.85%)
PymetamapPython wraper for MetaMap
Stars: ✭ 119 (-9.85%)