This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.

Stars: ✭ 91 (-98.36%)

Mutual labels: neural-machine-translation

Turkish Stemmer Python

🐍 Turkish Language Stemmer for Python

Stars: ✭ 165 (-97.02%)

Mutual labels: natural-language-processing

Nlp Python Deep Learning

NLP in Python with Deep Learning

Stars: ✭ 374 (-93.25%)

Mutual labels: natural-language-processing

Newsrecommender

A news recommendation system tailored for user communities

Stars: ✭ 164 (-97.04%)

Mutual labels: natural-language-processing

pytorch basic nmt

A simple yet strong implementation of neural machine translation in pytorch

Stars: ✭ 66 (-98.81%)

Mutual labels: neural-machine-translation

Lazynlp

Library to scrape and clean web pages to create massive datasets.

Stars: ✭ 1,985 (-64.17%)

Mutual labels: natural-language-processing

Spacy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Stars: ✭ 21,978 (+296.71%)

Mutual labels: natural-language-processing

Covid Papers Browser

Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖

Stars: ✭ 161 (-97.09%)

Mutual labels: natural-language-processing

Attention-Visualization

Visualization for simple attention and Google's multi-head attention.

Stars: ✭ 54 (-99.03%)

Mutual labels: neural-machine-translation

Nlp bahasa resources

A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia

Stars: ✭ 158 (-97.15%)

Mutual labels: natural-language-processing

Awesome Text Generation

A curated list of recent models of text generation and application

Stars: ✭ 370 (-93.32%)

Mutual labels: natural-language-processing

youtokentome-ruby

High performance unsupervised text tokenization for Ruby

Stars: ✭ 17 (-99.69%)

Mutual labels: word-segmentation

Mishkal

Mishkal is an arabic text vocalization software

Stars: ✭ 158 (-97.15%)

Mutual labels: natural-language-processing

Iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI additions

Stars: ✭ 499 (-90.99%)

Mutual labels: natural-language-processing

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+130.38%)

Mutual labels: natural-language-processing

SymSpellCppPy

Fast SymSpell written in c++ and exposes to python via pybind11

Stars: ✭ 28 (-99.49%)

Mutual labels: word-segmentation

Awesome Pytorch List

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

Stars: ✭ 12,475 (+125.18%)

Mutual labels: natural-language-processing

Matchzoo Py

Facilitating the design, comparison and sharing of deep text matching models.

Stars: ✭ 362 (-93.47%)

Mutual labels: natural-language-processing

Visdial Rl

PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning

Stars: ✭ 157 (-97.17%)

Mutual labels: natural-language-processing

SSAN

How Does Selective Mechanism Improve Self-attention Networks?

Stars: ✭ 18 (-99.68%)

Mutual labels: neural-machine-translation

Speech signal processing and classification

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].

Stars: ✭ 155 (-97.2%)

Mutual labels: natural-language-processing

Open Korean Text

Open Korean Text Processor - An Open-source Korean Text Processor

Stars: ✭ 438 (-92.09%)

Mutual labels: natural-language-processing

Rnn lstm from scratch

How to build RNNs and LSTMs from scratch with NumPy.

Stars: ✭ 156 (-97.18%)

Mutual labels: natural-language-processing

RNNSearch

An implementation of attention-based neural machine translation using Pytorch

Stars: ✭ 43 (-99.22%)

Mutual labels: neural-machine-translation

Deeplearning nlp

基于深度学习的自然语言处理库

Stars: ✭ 154 (-97.22%)

Mutual labels: natural-language-processing

Spacy Streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

Stars: ✭ 360 (-93.5%)

Mutual labels: natural-language-processing

Natural Language Processing Specialization

This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera

Stars: ✭ 151 (-97.27%)

Mutual labels: natural-language-processing

transformer-slt

Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)

Stars: ✭ 92 (-98.34%)

Mutual labels: neural-machine-translation

Paraphrase identification

Examine two sentences and determine whether they have the same meaning.

Stars: ✭ 154 (-97.22%)

Mutual labels: natural-language-processing

Seq2seq.pytorch

Sequence-to-Sequence learning using PyTorch

Stars: ✭ 514 (-90.72%)

Mutual labels: neural-machine-translation

Postagga

A Library to parse natural language in pure Clojure and ClojureScript

Stars: ✭ 152 (-97.26%)

Mutual labels: natural-language-processing

Pytorch-NLU

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (-97.27%)

Mutual labels: word-segmentation

Chineseblue

Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)

Stars: ✭ 149 (-97.31%)

Mutual labels: natural-language-processing

Question generation

Neural question generation using transformers

Stars: ✭ 356 (-93.57%)

Mutual labels: natural-language-processing

Spacymoji

💙 Emoji handling and meta data for spaCy with custom extension attributes

Stars: ✭ 151 (-97.27%)

Mutual labels: natural-language-processing

codeprep

A toolkit for pre-processing large source code corpora

Stars: ✭ 39 (-99.3%)

Mutual labels: word-segmentation

Swiftychrono

A natural language date parser in Swift (ported from chrono.js)

Stars: ✭ 148 (-97.33%)

Mutual labels: natural-language-processing

Code search

Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"

Stars: ✭ 436 (-92.13%)

Mutual labels: natural-language-processing

Neat Vision

Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)

Stars: ✭ 213 (-96.16%)

Mutual labels: natural-language-processing

D2l Vn

Một cuốn sách tương tác về học sâu có mã nguồn, toán và thảo luận. Đề cập đến nhiều framework phổ biến (TensorFlow, Pytorch & MXNet) và được sử dụng tại 175 trường Đại học.

Stars: ✭ 402 (-92.74%)

Mutual labels: natural-language-processing

Ner

Named Entity Recognition

Stars: ✭ 288 (-94.8%)

Mutual labels: natural-language-processing

Nlp Roadmap

ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP

Stars: ✭ 2,653 (-52.11%)

Mutual labels: natural-language-processing

Shifterator

Interpretable data visualizations for understanding how texts differ at the word level