All Projects → Indian_ParallelCorpus → Similar Projects or Alternatives

186 Open source projects that are alternatives of or similar to Indian_ParallelCorpus

OpenDialog
An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Stars: ✭ 94 (+308.7%)
Mutual labels:  corpus
Cross-Language-Dataset
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
Stars: ✭ 60 (+160.87%)
Mutual labels:  parallel-corpus
pdf-corpus
Python script to quickly create hand-crafted PDF files
Stars: ✭ 17 (-26.09%)
Mutual labels:  corpus
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-8.7%)
Mutual labels:  corpus
pytorch basic nmt
A simple yet strong implementation of neural machine translation in pytorch
Stars: ✭ 66 (+186.96%)
Chatbot-Training-Corpus
总结了一些可以用作聊天机器人训练实作的文字语聊,包含中英文不同语言
Stars: ✭ 117 (+408.7%)
Mutual labels:  corpus
CBLUE
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Stars: ✭ 379 (+1547.83%)
Mutual labels:  corpus
DCGCN
Densely Connected Graph Convolutional Networks for Graph-to-Sequence Learning (authors' MXNet implementation for the TACL19 paper)
Stars: ✭ 73 (+217.39%)
PubMed-PICO-Detection
PubMed PICO Element Detection Dataset
Stars: ✭ 37 (+60.87%)
Mutual labels:  corpus
Speech-Corpus-Collection
A Collection of Speech Corpus for ASR and TTS
Stars: ✭ 113 (+391.3%)
Mutual labels:  corpus
egret-wenda-corpus
A Public Corpus for Machine Learning
Stars: ✭ 41 (+78.26%)
Mutual labels:  corpus
proiel-treebank
Official releases of the PROIEL treebank of ancient Indo-European languages
Stars: ✭ 30 (+30.43%)
Mutual labels:  corpus
fastmorph
Fast corpus search engine originally made for the Corpus of Written Tatar language
Stars: ✭ 14 (-39.13%)
Mutual labels:  corpus
Probabilistic-RNN-DA-Classifier
Probabilistic Dialogue Act Classification for the Switchboard Corpus using an LSTM model
Stars: ✭ 22 (-4.35%)
Mutual labels:  corpus
minimal-nmt
A minimal nmt example to serve as an seq2seq+attention reference.
Stars: ✭ 36 (+56.52%)
german-nouns
A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
Stars: ✭ 101 (+339.13%)
Mutual labels:  corpus
thai-language
computer tools for thai language
Stars: ✭ 20 (-13.04%)
Mutual labels:  corpus
Dialogue-Corpus
No description or website provided.
Stars: ✭ 27 (+17.39%)
Mutual labels:  corpus
NiuTrans.NMT
A Fast Neural Machine Translation System. It is developed in C++ and resorts to NiuTensor for fast tensor APIs.
Stars: ✭ 112 (+386.96%)
Chinese Names Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Stars: ✭ 3,053 (+13173.91%)
Mutual labels:  corpus
dialogue-datasets
collect the open dialog corpus and some useful data processing utils.
Stars: ✭ 24 (+4.35%)
Mutual labels:  corpus
Weibo terminater
Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator
Stars: ✭ 2,295 (+9878.26%)
Mutual labels:  corpus
TV4Dialog
No description or website provided.
Stars: ✭ 33 (+43.48%)
Mutual labels:  corpus
Efaqa Corpus Zh
❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
Stars: ✭ 170 (+639.13%)
Mutual labels:  corpus
transformer
Neutron: A pytorch based implementation of Transformer and its variants.
Stars: ✭ 60 (+160.87%)
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (+521.74%)
Mutual labels:  corpus
LanguageCodes
We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).
Stars: ✭ 70 (+204.35%)
Mutual labels:  corpus
Lyrics Corpora
An unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts
Stars: ✭ 13 (-43.48%)
Mutual labels:  corpus
xl-sum
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Stars: ✭ 160 (+595.65%)
Mutual labels:  low-resource-languages
open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
Stars: ✭ 65 (+182.61%)
Mutual labels:  corpus
kanji-frequency
Kanji usage frequency data collected from various sources
Stars: ✭ 92 (+300%)
Mutual labels:  corpus
Awesome Chatbot
Awesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:
Stars: ✭ 1,785 (+7660.87%)
Mutual labels:  corpus
cljs-corpus
A greppable archive of ClojureScript code
Stars: ✭ 37 (+60.87%)
Mutual labels:  corpus
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+9082.61%)
Mutual labels:  corpus
MT-Preparation
Machine Translation (MT) Preparation Scripts
Stars: ✭ 15 (-34.78%)
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (+426.09%)
Mutual labels:  corpus
fuzzing-corpus
My fuzzing corpus
Stars: ✭ 120 (+421.74%)
Mutual labels:  corpus
Colibri Core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (+386.96%)
Mutual labels:  corpus
parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (+52.17%)
Ua Gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (+369.57%)
Mutual labels:  corpus
bible-corpus
A multilingual parallel corpus created from translations of the Bible.
Stars: ✭ 115 (+400%)
Mutual labels:  corpus
Pubmed Rct
PubMed 200k RCT dataset: a large dataset for sequential sentence classification.
Stars: ✭ 101 (+339.13%)
Mutual labels:  corpus
2018-dlsl
UPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-21.74%)
Chi Corpus
迟先生语料库
Stars: ✭ 96 (+317.39%)
Mutual labels:  corpus
nepali-translator
Neural Machine Translation on the Nepali-English language pair
Stars: ✭ 29 (+26.09%)
Mutual labels:  parallel-corpus
Dataset List
lists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (+265.22%)
Mutual labels:  corpus
gum
Repository for the Georgetown University Multilayer Corpus (GUM)
Stars: ✭ 71 (+208.7%)
Mutual labels:  corpus
Russian news corpus
Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (+230.43%)
Mutual labels:  corpus
transformer-slt
Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)
Stars: ✭ 92 (+300%)
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (+139.13%)
Mutual labels:  corpus
textbox
Text collections made available by the CLiGS group.
Stars: ✭ 19 (-17.39%)
Mutual labels:  corpus
Chatterbot Corpus
A multilingual dialog corpus
Stars: ✭ 964 (+4091.3%)
Mutual labels:  corpus
Attention-Visualization
Visualization for simple attention and Google's multi-head attention.
Stars: ✭ 54 (+134.78%)
Word-Level-Eng-Mar-NMT
Translating English sentences to Marathi using Neural Machine Translation
Stars: ✭ 37 (+60.87%)
EdgarAllanPoetry
Computer-generated poetry
Stars: ✭ 22 (-4.35%)
Mutual labels:  corpus
wordfish-python
extract relationships from standardized terms from corpus of interest with deep learning 🐟
Stars: ✭ 19 (-17.39%)
Mutual labels:  corpus
Species-Names-Corpus
物种名称语料库。植物名,动物名。
Stars: ✭ 23 (+0%)
Mutual labels:  corpus
DataAugmentationNMT
Data Augmentation for Neural Machine Translation
Stars: ✭ 26 (+13.04%)
Data-Rejuvenation
Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.
Stars: ✭ 18 (-21.74%)
nytwit
New York Times Word Innovation Types dataset
Stars: ✭ 21 (-8.7%)
Mutual labels:  corpus
61-120 of 186 similar projects