Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+1009.04%)

Mutual labels: language-model

Skip Thoughts.torch

Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7

Stars: ✭ 146 (-22.34%)

Mutual labels: word2vec

Clue

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Stars: ✭ 2,425 (+1189.89%)

Mutual labels: language-model

Wordvectors

Pre-trained word vectors of 30+ languages

Stars: ✭ 2,043 (+986.7%)

Mutual labels: word2vec

F Lm

Language Modeling

Stars: ✭ 156 (-17.02%)

Mutual labels: language-model

Turkish Word2vec

Pre-trained Word2Vec Model for Turkish

Stars: ✭ 136 (-27.66%)

Mutual labels: word2vec

Word2vec Spam Filter

Using word vectors to classify spam messages

Stars: ✭ 149 (-20.74%)

Mutual labels: word2vec

Lazynlp

Library to scrape and clean web pages to create massive datasets.

Stars: ✭ 1,985 (+955.85%)

Mutual labels: language-model

Ld Net

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

Stars: ✭ 148 (-21.28%)

Mutual labels: language-model

Debiaswe

Remove problematic gender bias from word embeddings.

Stars: ✭ 175 (-6.91%)

Mutual labels: word2vec

Word2vec

Go library for performing computations in word2vec binary models

Stars: ✭ 143 (-23.94%)

Mutual labels: word2vec

Keras Xlnet

Implementation of XLNet that can load pretrained checkpoints

Stars: ✭ 159 (-15.43%)

Mutual labels: language-model

Word2vec

对 ansj 编写的 Word2VEC_java 的进一步包装，同时实现了常用的词语相似度和句子相似度计算。

Stars: ✭ 136 (-27.66%)

Mutual labels: word2vec

Sentence Similarity

对四种句子/文本相似度计算方法进行实验与比较

Stars: ✭ 181 (-3.72%)

Mutual labels: word2vec

Webvectors

Web-ify your word2vec: framework to serve distributional semantic models online

Stars: ✭ 154 (-18.09%)

Mutual labels: word2vec

Pytorch word2vec

Use pytorch to implement word2vec

Stars: ✭ 133 (-29.26%)

Mutual labels: word2vec

Electra

中文预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model

Stars: ✭ 132 (-29.79%)

Mutual labels: language-model

Fasttext.js

FastText for Node.js

Stars: ✭ 127 (-32.45%)

Mutual labels: word2vec

Macbert

Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP)

Stars: ✭ 167 (-11.17%)

Mutual labels: language-model

Skip Gram Pytorch

A complete pytorch implementation of skip-gram

Stars: ✭ 153 (-18.62%)

Mutual labels: word2vec

Kogpt2 Finetuning

🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥

Stars: ✭ 124 (-34.04%)

Mutual labels: language-model

Awesome Sentence Embedding

A curated list of pretrained sentence and word embedding models

Stars: ✭ 1,973 (+949.47%)

Mutual labels: language-model

Xlnet Gen

XLNet for generating language.

Stars: ✭ 164 (-12.77%)

Mutual labels: language-model

Awd Lstm Lm

LSTM and QRNN Language Model Toolkit for PyTorch

Stars: ✭ 1,834 (+875.53%)

Mutual labels: language-model

Splitter

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

Stars: ✭ 177 (-5.85%)

Mutual labels: word2vec

Textfeatures

👷‍♂️ A simple package for extracting useful features from character objects 👷‍♀️

Stars: ✭ 148 (-21.28%)

Mutual labels: word2vec

Danmf

A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).

Stars: ✭ 161 (-14.36%)

Mutual labels: word2vec

Fasttext4j

Implementing Facebook's FastText with java

Stars: ✭ 148 (-21.28%)

Mutual labels: word2vec

Keras Bert

Implementation of BERT that could load official pre-trained models for feature extraction and prediction

Stars: ✭ 2,264 (+1104.26%)

Mutual labels: language-model

Wordembeddings Elmo Fasttext Word2vec

Using pre trained word embeddings (Fasttext, Word2Vec)

Stars: ✭ 146 (-22.34%)

Mutual labels: word2vec

Entity2rec

entity2rec generates item recommendation using property-specific knowledge graph embeddings

Stars: ✭ 159 (-15.43%)

Mutual labels: word2vec

Tupe

Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.

Stars: ✭ 143 (-23.94%)

Mutual labels: language-model

Deep Math Machine Learning.ai

A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.

Stars: ✭ 173 (-7.98%)

Mutual labels: word2vec

Nlp research

NLP research：基于tensorflow的nlp深度学习项目，支持文本分类/句子匹配/序列标注/文本生成四大任务

Stars: ✭ 141 (-25%)

Mutual labels: word2vec

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+6688.83%)

Mutual labels: word2vec

sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees

Stars: ✭ 136 (-27.66%)

Mutual labels: word2vec

Text Pairs Relation Classification

About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based on Neural Network.

Stars: ✭ 182 (-3.19%)

Mutual labels: word2vec

Role2vec

A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).

Stars: ✭ 134 (-28.72%)

Mutual labels: word2vec

Text2vec

text2vec, chinese text to vetor.(文本向量化表示工具，包括词向量化、句子向量化、句子相似度计算)

Stars: ✭ 155 (-17.55%)

Mutual labels: word2vec

Scattertext Pydata

Notebooks for the Seattle PyData 2017 talk on Scattertext

Stars: ✭ 132 (-29.79%)

Mutual labels: word2vec

Log Anomaly Detector

Log Anomaly Detection - Machine learning to detect abnormal events logs

Stars: ✭ 169 (-10.11%)

Mutual labels: word2vec

Chars2vec

Character-based word embeddings model based on RNN for handling real world texts

Stars: ✭ 130 (-30.85%)

Mutual labels: language-model

Transformer Lm

Transformer language model (GPT-2) with sentencepiece tokenizer

Stars: ✭ 154 (-18.09%)

Mutual labels: language-model

Ml Projects

ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

Stars: ✭ 127 (-32.45%)

Mutual labels: word2vec

Optimus

Optimus: the first large-scale pre-trained VAE language model

Stars: ✭ 180 (-4.26%)

Mutual labels: language-model

Speecht

An opensource speech-to-text software written in tensorflow

Stars: ✭ 152 (-19.15%)

Mutual labels: language-model

Dynamic Memory Networks Plus Pytorch

Implementation of Dynamic memory networks plus in Pytorch

Stars: ✭ 123 (-34.57%)

Mutual labels: language-model

Hierarchical Attention Network

Implementation of Hierarchical Attention Networks in PyTorch

Stars: ✭ 120 (-36.17%)

Mutual labels: word2vec

Scattertext

Beautiful visualizations of how language differs among document types.

Stars: ✭ 1,722 (+815.96%)

Mutual labels: word2vec

Gpt Neo

An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.

Stars: ✭ 1,252 (+565.96%)

Mutual labels: language-model

Graphwavemachine

A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".

Stars: ✭ 151 (-19.68%)

Mutual labels: word2vec

Robbert

A Dutch RoBERTa-based language model

Stars: ✭ 120 (-36.17%)

Mutual labels: language-model

Haystack

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (+1713.3%)

Mutual labels: language-model

Embedding As Service

One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

Stars: ✭ 151 (-19.68%)

Mutual labels: word2vec

Bert As Language Model

bert as language model, fork from https://github.com/google-research/bert

Stars: ✭ 185 (-1.6%)

Mutual labels: language-model

1-60 of 349 similar projects

›

next*5