All Projects → Sentencepiece → Similar Projects or Alternatives

756 Open source projects that are alternatives of or similar to Sentencepiece

Multimodal Sentiment Analysis
Attention-based multimodal fusion for sentiment analysis
Stars: ✭ 172 (-96.9%)
Ai Job Notes
AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
Stars: ✭ 3,191 (-42.4%)
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (-54.55%)
Practical Nlp
Official Repository for 'Practical Natural Language Processing' by O'Reilly Media
Stars: ✭ 452 (-91.84%)
Syfertext
A privacy preserving NLP framework
Stars: ✭ 170 (-96.93%)
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-95.4%)
Vntk
Vietnamese NLP Toolkit for Node
Stars: ✭ 170 (-96.93%)
Usc Ds Relationextraction
Distantly Supervised Relation Extraction
Stars: ✭ 378 (-93.18%)
Data Science Toolkit
Collection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-96.95%)
hashformers
Hashformers is a framework for hashtag segmentation with transformers.
Stars: ✭ 18 (-99.68%)
Mutual labels:  word-segmentation
Acl Anthology
Data and software for building the ACL Anthology.
Stars: ✭ 168 (-96.97%)
Awesome Semi Supervised Learning
📜 An up-to-date & curated list of awesome semi-supervised learning papers, methods & resources.
Stars: ✭ 538 (-90.29%)
Lineflow
⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Stars: ✭ 168 (-96.97%)
banglanmt
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Stars: ✭ 91 (-98.36%)
Turkish Stemmer Python
🐍 Turkish Language Stemmer for Python
Stars: ✭ 165 (-97.02%)
Nlp Python Deep Learning
NLP in Python with Deep Learning
Stars: ✭ 374 (-93.25%)
Newsrecommender
A news recommendation system tailored for user communities
Stars: ✭ 164 (-97.04%)
pytorch basic nmt
A simple yet strong implementation of neural machine translation in pytorch
Stars: ✭ 66 (-98.81%)
Lazynlp
Library to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (-64.17%)
Spacy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+296.71%)
Covid Papers Browser
Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖
Stars: ✭ 161 (-97.09%)
Attention-Visualization
Visualization for simple attention and Google's multi-head attention.
Stars: ✭ 54 (-99.03%)
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-97.15%)
Awesome Text Generation
A curated list of recent models of text generation and application
Stars: ✭ 370 (-93.32%)
youtokentome-ruby
High performance unsupervised text tokenization for Ruby
Stars: ✭ 17 (-99.69%)
Mutual labels:  word-segmentation
Mishkal
Mishkal is an arabic text vocalization software
Stars: ✭ 158 (-97.15%)
Iowncode
A curated collection of iOS, ML, AR resources sprinkled with some UI additions
Stars: ✭ 499 (-90.99%)
Gensim
Topic Modelling for Humans
Stars: ✭ 12,763 (+130.38%)
SymSpellCppPy
Fast SymSpell written in c++ and exposes to python via pybind11
Stars: ✭ 28 (-99.49%)
Mutual labels:  word-segmentation
Awesome Pytorch List
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Stars: ✭ 12,475 (+125.18%)
Matchzoo Py
Facilitating the design, comparison and sharing of deep text matching models.
Stars: ✭ 362 (-93.47%)
Visdial Rl
PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
Stars: ✭ 157 (-97.17%)
SSAN
How Does Selective Mechanism Improve Self-attention Networks?
Stars: ✭ 18 (-99.68%)
Speech signal processing and classification
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].
Stars: ✭ 155 (-97.2%)
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (-92.09%)
Rnn lstm from scratch
How to build RNNs and LSTMs from scratch with NumPy.
Stars: ✭ 156 (-97.18%)
RNNSearch
An implementation of attention-based neural machine translation using Pytorch
Stars: ✭ 43 (-99.22%)
Deeplearning nlp
基于深度学习的自然语言处理库
Stars: ✭ 154 (-97.22%)
Spacy Streamlit
👑 spaCy building blocks and visualizers for Streamlit apps
Stars: ✭ 360 (-93.5%)
Natural Language Processing Specialization
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Stars: ✭ 151 (-97.27%)
transformer-slt
Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)
Stars: ✭ 92 (-98.34%)
Paraphrase identification
Examine two sentences and determine whether they have the same meaning.
Stars: ✭ 154 (-97.22%)
Seq2seq.pytorch
Sequence-to-Sequence learning using PyTorch
Stars: ✭ 514 (-90.72%)
Postagga
A Library to parse natural language in pure Clojure and ClojureScript
Stars: ✭ 152 (-97.26%)
Pytorch-NLU
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (-97.27%)
Mutual labels:  word-segmentation
Chineseblue
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Stars: ✭ 149 (-97.31%)
Question generation
Neural question generation using transformers
Stars: ✭ 356 (-93.57%)
Spacymoji
💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 151 (-97.27%)
codeprep
A toolkit for pre-processing large source code corpora
Stars: ✭ 39 (-99.3%)
Mutual labels:  word-segmentation
Swiftychrono
A natural language date parser in Swift (ported from chrono.js)
Stars: ✭ 148 (-97.33%)
Code search
Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"
Stars: ✭ 436 (-92.13%)
Neat Vision
Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
Stars: ✭ 213 (-96.16%)
D2l Vn
Một cuốn sách tương tác về học sâu có mã nguồn, toán và thảo luận. Đề cập đến nhiều framework phổ biến (TensorFlow, Pytorch & MXNet) và được sử dụng tại 175 trường Đại học.
Stars: ✭ 402 (-92.74%)
Ner
Named Entity Recognition
Stars: ✭ 288 (-94.8%)
Nlp Roadmap
ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP
Stars: ✭ 2,653 (-52.11%)
Shifterator
Interpretable data visualizations for understanding how texts differ at the word level
Stars: ✭ 209 (-96.23%)
Medacy
🏥 Medical Text Mining and Information Extraction with spaCy
Stars: ✭ 287 (-94.82%)
Graph Convolution Nlp
Graph Convolution Network for NLP
Stars: ✭ 208 (-96.25%)
Kagnet
Knowledge-Aware Graph Networks for Commonsense Reasoning (EMNLP-IJCNLP 19)
Stars: ✭ 205 (-96.3%)
Opennmt Py
Open Source Neural Machine Translation in PyTorch
Stars: ✭ 5,378 (-2.92%)
301-360 of 756 similar projects