OValery16 / Language-Translation-with-deep-learning-

Licence: MPL-2.0 license

No description or website provided.

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Language-Translation-with-deep-learning-

appsync-lambda-ai

Demo of using a GraphQL resolver to hit a lambda function, then hit a few AI services, and return the response.

Stars: ✭ 47 (+95.83%)

Mutual labels: translation

inlang

Open Source Localization Solution for Software.

Stars: ✭ 160 (+566.67%)

Mutual labels: translation

legesher

Because language shouldn't be a barrier to code

Stars: ✭ 29 (+20.83%)

Mutual labels: translation

i18n

internationalize projects to Arabic

Stars: ✭ 67 (+179.17%)

Mutual labels: translation

arboles

Mapa de Arbolado Urbano

Stars: ✭ 13 (-45.83%)

Mutual labels: translation

DidacticalEnigma

An integrated translator environment for translating text from Japanese to English

Stars: ✭ 29 (+20.83%)

Mutual labels: translation

bible-corpus

A multilingual parallel corpus created from translations of the Bible.

Stars: ✭ 115 (+379.17%)

Mutual labels: translation

translation

👅 Translations (symfony/translation) to Nette Framework (@nette)

Stars: ✭ 55 (+129.17%)

Mutual labels: translation

SQUAD2.Q-Augmented-Dataset

Augmented version of SQUAD 2.0 for Questions

Stars: ✭ 31 (+29.17%)

Mutual labels: translation

gettext-extractor

A flexible and powerful Gettext message extractor with support for JavaScript, TypeScript, JSX and HTML.

Stars: ✭ 82 (+241.67%)

Mutual labels: translation

tarjama

This package allows you to translate your models fields. `2.0` version will be continued here: https://github.com/fevrok/laravel-translatable

Stars: ✭ 2 (-91.67%)

Mutual labels: translation

pydantic-i18n

pydantic-i18n is an extension to support an i18n for the pydantic error messages.

Stars: ✭ 32 (+33.33%)

Mutual labels: translation

Translate helper

A Java application to help to translate the localisation files of the Paradox games.

Stars: ✭ 23 (-4.17%)

Mutual labels: translation

genshin-wishes-i18n

A contribution-driven project to translate Genshin Wishes in many languages.

Stars: ✭ 25 (+4.17%)

Mutual labels: translation

RestApiTutorial.ru

https://restapitutorial.ru/

Stars: ✭ 72 (+200%)

Mutual labels: translation

Deep-Learning-with-PyTorch-A-60-Minute-Blitz-cn

PyTorch1.0 深度学习：60分钟入门与实战（Deep Learning with PyTorch: A 60 Minute Blitz 中文翻译与学习）

Stars: ✭ 127 (+429.17%)

Mutual labels: translation

Xiaomi.eu-MIUIv10-XML-Compare

MIUI 10 XML Daily Compare for Xiaomi.eu builds

Stars: ✭ 28 (+16.67%)

Mutual labels: translation

tmdb-web-translations

No description or website provided.

Stars: ✭ 12 (-50%)

Mutual labels: translation

qstardict

A read-only mirror

Stars: ✭ 55 (+129.17%)

Mutual labels: translation

Functional-Light-JS-Zh

《Functional-Light-JS》中文翻译

Stars: ✭ 14 (-41.67%)

Mutual labels: translation

View All Similar Projects ➔

Language Translation with deep learning

Project purpose

For this project we build a RNN sequence-to-sequence learning in Keras to translate a language A to a language B.

Language and Dataset

Since I am french, I choose to translate english to french. However our system is pretty general and accepts any other language pair (e.g. english/french). By defauft, we use ANKI dataset which can be easy download there

What is Sequence-to-sequence learning ?

Sequence-to-sequence learning (Seq2Seq) is about training models to convert sequences from one domain to sequences in another domain. It works as following:

We start with input sequences from a domain (e.g. English sentences) and correspding target sequences from another domain (e.g. French sentences).
An encoder LSTM turns input sequences to 2 state vectors (we keep the last LSTM state and discard the outputs).
A decoder LSTM is trained to turn the target sequences into the same sequence but offset by one timestep in the future, a training process called "teacher forcing" in this context. Is uses as initial state the state vectors from the encoder. Effectively, the decoder learns to generate targets[t+1...] given targets[...t], conditioned on the input sequence.
In inference mode, when we want to decode unknown input sequences, we:
- Encode the input sequence into state vectors
- Start with a target sequence of size 1 (just the start-of-sequence character)
- Feed the state vectors and 1-char target sequence to the decoder to produce predictions for the next character
- Sample the next character using these predictions (we simply use argmax).
- Append the sampled character to the target sequence
- Repeat until we generate the end-of-sequence character or we hit the character limit.

For more information, please check these papers:

* [Sequence to Sequence Learning with Neural Networks](https://arxiv.org/abs/1409.3215)
* [Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation](https://arxiv.org/abs/1406.1078)

How to use it

Downlow you training dataset
Update path and the number of training example
Run python3 training.py
Prediction with python3 predictionTranslation.py

LSTM or GRU

By default, the model runs with LSTM cell (long short term memory), but we also provide the user the opportunity to use instead GRU cell. (GRU cell only include 1 gate which is meke the training faster)

Downloading weights

We trained this model on the complete English/French dataset. The all training takes weeks. But we got promising results after 18 h of training (20 epoch). You can download our weights there

Our result

For sure, our system is far from being as accurate as Google Transle. But after 20 epoch only, it reconnizes accurately short sentences.

Example of output:

Input sentence: I love you.

Decoded sentence: Je t'aime !

It is accurate.

Input sentence: We studied.

Decoded sentence: Nous étudions.

It is accurate.

Input sentence: I slept well.

Decoded sentence: J'ai dormi toute la journée.

Same meaning, but the translation is not fully accurate. The right translation would be "j'ai bien dormi"

Input sentence: He worked a lot. Decoded sentence: Il a travaillé pour un homme riche.

The translation is not correct.

Conclusion

To conclude, our network learnt the basic concept of english/french, but it still requires two things:

A longer training time
A deeper architecture, such as more LSTM cells

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

OValery16 / Language-Translation-with-deep-learning-

Programming Languages

Labels

Projects that are alternatives of or similar to Language-Translation-with-deep-learning-

Language Translation with deep learning

Project purpose

Language and Dataset

What is Sequence-to-sequence learning ?

How to use it

LSTM or GRU

Downloading weights

Our result

Conclusion