All Projects → BSD → Similar Projects or Alternatives

474 Open source projects that are alternatives of or similar to BSD

This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.

Stars: ✭ 91 (+78.43%)

Mutual labels: machine-translation, parallel-corpus, parallel-corpora

Indian ParallelCorpus

Curated list of publicly available parallel corpus for Indian Languages

Stars: ✭ 23 (-54.9%)

Mutual labels: corpus, parallel-corpus, parallel-corpora

TALPCo

TUFS Asian Language Parallel Corpus

Stars: ✭ 32 (-37.25%)

Mutual labels: japanese, english, parallel-corpus

tvsub

TVsub: DCU-Tencent Chinese-English Dialogue Corpus

Stars: ✭ 40 (-21.57%)

Mutual labels: machine-translation, corpus

FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Stars: ✭ 154 (+201.96%)

Mutual labels: japanese, english

KWDLC

Kyoto University Web Document Leads Corpus

Stars: ✭ 64 (+25.49%)

Mutual labels: japanese, corpus

open2ch-dialogue-corpus

おーぷん2ちゃんねるをクロールして作成した対話コーパス

Stars: ✭ 65 (+27.45%)

Mutual labels: japanese, corpus

jiten

jiten - japanese android/cli/web dictionary based on jmdict/kanjidic — 日本語　辞典　和英辞典　漢英字典　和独辞典　和蘭辞典

Stars: ✭ 64 (+25.49%)

Mutual labels: japanese, english

Cluedatasetsearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Stars: ✭ 2,112 (+4041.18%)

Mutual labels: machine-translation, corpus

kanji-frequency

Kanji usage frequency data collected from various sources

Stars: ✭ 92 (+80.39%)

Mutual labels: japanese, corpus

nepali-translator

Neural Machine Translation on the Nepali-English language pair

Stars: ✭ 29 (-43.14%)

Mutual labels: machine-translation, parallel-corpus

Mouse Dictionary

📘A super fast dictionary for Chrome/Firefox

Stars: ✭ 670 (+1213.73%)

Mutual labels: japanese, english

Memorize

🚀 Japanese-English-Mongolian dictionary. It lets you find words, kanji and more quickly and easily

Stars: ✭ 72 (+41.18%)

Mutual labels: japanese, english

Google Ime Dictionary

日英変換・英語略語展開のための IME 追加辞書 📙 日本語から英語への和英変換や英語略語の展開を Google 日本語入力や ATOK などで可能にする IME 拡張辞書です

Stars: ✭ 30 (-41.18%)

Mutual labels: japanese, english

Gse

Go efficient multilingual NLP and text segmentation; support english, chinese, japanese and other. Go 高性能多语言 NLP 和分词

Stars: ✭ 1,695 (+3223.53%)

Mutual labels: japanese, english

Distill-BERT-Textgen

Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".

Stars: ✭ 121 (+137.25%)

Mutual labels: machine-translation

MetricMT

The official code repository for MetricMT - a reward optimization method for NMT with learned metrics

Stars: ✭ 23 (-54.9%)

Mutual labels: machine-translation

ap-style-title-case

Convert a value to AP/APA title case

Stars: ✭ 34 (-33.33%)

Mutual labels: english

OPUS-MT-train

Training open neural machine translation models

Stars: ✭ 166 (+225.49%)

Mutual labels: machine-translation

japanese-word-handler

Better Japanese word handling on Visual Studio Code.

Stars: ✭ 32 (-37.25%)

Mutual labels: japanese

nytwit

New York Times Word Innovation Types dataset

Stars: ✭ 21 (-58.82%)

Mutual labels: corpus

sql-novice-survey

Databases and SQL

Stars: ✭ 59 (+15.69%)

Mutual labels: english

OpenConvert

Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)

Stars: ✭ 20 (-60.78%)

Mutual labels: corpus

jmdict-kindle

Japanese - English dictionary for Kindle based on the JMdict / EDICT database

Stars: ✭ 151 (+196.08%)

Mutual labels: japanese

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Stars: ✭ 711 (+1294.12%)

Mutual labels: corpus

kanjigrid

Fork of the Kanji Grid addon for Anki

Stars: ✭ 21 (-58.82%)

Mutual labels: japanese

introduction-to-conda-for-data-scientists

Introduction to Conda for (Data) Scientists

Stars: ✭ 35 (-31.37%)

Mutual labels: english

gum

Repository for the Georgetown University Multilayer Corpus (GUM)

Stars: ✭ 71 (+39.22%)

Mutual labels: corpus

extreme-adaptation-for-personalized-translation

Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"

Stars: ✭ 42 (-17.65%)

Mutual labels: machine-translation

folket

Swedish–English dictionary for macOS (December 20, 2020)

Stars: ✭ 31 (-39.22%)

Mutual labels: english

docker-introduction

Reproducible Computational Environments using Containers

Stars: ✭ 34 (-33.33%)

Mutual labels: english

japanese-pretrained-models

Code for producing Japanese pretrained models provided by rinna Co., Ltd.

Stars: ✭ 484 (+849.02%)

Mutual labels: japanese

malay-dataset

Text corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html

Stars: ✭ 189 (+270.59%)

Mutual labels: corpus

translate english

Java程序员阅读源码必知英语单词

Stars: ✭ 24 (-52.94%)

Mutual labels: english

introcsharpbook

"Fundamentals of Computer Programming with C#" Book

Stars: ✭ 12 (-76.47%)

Mutual labels: english

sample-boot-scala

Spring Boot + Scala + Skinny ORM

Stars: ✭ 14 (-72.55%)

Mutual labels: japanese

almanca

Almanca dilbilgisi ve gramer notlari / Lesson notes I have taken to learn the German language beginning from A1.

Stars: ✭ 15 (-70.59%)

Mutual labels: english

Cross-Language-Dataset

A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection

Stars: ✭ 60 (+17.65%)

Mutual labels: parallel-corpus

shell-genomics

Introduction to the Command Line for Genomics

Stars: ✭ 54 (+5.88%)

Mutual labels: english

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-58.82%)

Mutual labels: corpus

rhymes

Give me an English word and I’ll give you a list of rhymes

Stars: ✭ 34 (-33.33%)

Mutual labels: english

dic-nico-intersection-pixiv

ニコニコ大百科とピクシブ百科事典の共通部分のIME辞書

Stars: ✭ 49 (-3.92%)

Mutual labels: japanese

skt

Sanskrit compound segmentation using seq2seq model

Stars: ✭ 21 (-58.82%)

Mutual labels: machine-translation

Chatbot-Training-Corpus

总结了一些可以用作聊天机器人训练实作的文字语聊，包含中英文不同语言

Stars: ✭ 117 (+129.41%)

Mutual labels: corpus

urbans

A tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.

Stars: ✭ 19 (-62.75%)

Mutual labels: machine-translation

ocr2text

Convert a PDF via OCR to a TXT file in UTF-8 encoding

Stars: ✭ 90 (+76.47%)

Mutual labels: corpus

frostpunk mod

Frostpunk / Mod Tools / 非公式日本語化MODツール

Stars: ✭ 17 (-66.67%)

Mutual labels: japanese

sample-ui-vue-pages

Bootstrap + Vue.js [ Scss / Babel ] (Multi-Page/SSR Model)

Stars: ✭ 20 (-60.78%)

Mutual labels: japanese

SequenceToSequence

A seq2seq with attention dialogue/MT model implemented by TensorFlow.

Stars: ✭ 11 (-78.43%)

Mutual labels: machine-translation

matlab-novice-inflammation

Programming with MATLAB

Stars: ✭ 26 (-49.02%)

Mutual labels: english

python-packaging-publishing

Packaging and Publishing with Python

Stars: ✭ 24 (-52.94%)

Mutual labels: english

bergamot-translator

Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.

Stars: ✭ 181 (+254.9%)

Mutual labels: machine-translation

nextword

Predict next English words.

Stars: ✭ 65 (+27.45%)

Mutual labels: english

deep-learning-intro

Learn Deep Learning with Python

Stars: ✭ 23 (-54.9%)

Mutual labels: english

sql-ecology-lesson

Data Management with SQL for Ecologists

Stars: ✭ 37 (-27.45%)

Mutual labels: english

inmt

Interactive Neural Machine Translation tool

Stars: ✭ 44 (-13.73%)

Mutual labels: machine-translation

rtg

Reader Translator Generator - NMT toolkit based on pytorch

Stars: ✭ 26 (-49.02%)

Mutual labels: machine-translation

tai5-uan5 gian5-gi2 kang1-ku7

臺灣言語工具

Stars: ✭ 79 (+54.9%)

Mutual labels: machine-translation

Speech-Corpus-Collection

A Collection of Speech Corpus for ASR and TTS

Stars: ✭ 113 (+121.57%)

Mutual labels: corpus

Kawazu

A C# library for converting Japanese sentence to Hiragana, Katakana or Romaji with furigana and okurigana modes supported. Inspired by project Kuroshiro.

Stars: ✭ 33 (-35.29%)

Mutual labels: japanese

1-60 of 474 similar projects

›

next*5