Ai Job NotesAI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
Stars: ✭ 3,191 (-42.4%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (-54.55%)
Practical NlpOfficial Repository for 'Practical Natural Language Processing' by O'Reilly Media
Stars: ✭ 452 (-91.84%)
SyfertextA privacy preserving NLP framework
Stars: ✭ 170 (-96.93%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-95.4%)
VntkVietnamese NLP Toolkit for Node
Stars: ✭ 170 (-96.93%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-96.95%)
hashformersHashformers is a framework for hashtag segmentation with transformers.
Stars: ✭ 18 (-99.68%)
Acl AnthologyData and software for building the ACL Anthology.
Stars: ✭ 168 (-96.97%)
Lineflow⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Stars: ✭ 168 (-96.97%)
banglanmtThis repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Stars: ✭ 91 (-98.36%)
NewsrecommenderA news recommendation system tailored for user communities
Stars: ✭ 164 (-97.04%)
pytorch basic nmtA simple yet strong implementation of neural machine translation in pytorch
Stars: ✭ 66 (-98.81%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (-64.17%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+296.71%)
Covid Papers BrowserBrowse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖
Stars: ✭ 161 (-97.09%)
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-97.15%)
youtokentome-rubyHigh performance unsupervised text tokenization for Ruby
Stars: ✭ 17 (-99.69%)
MishkalMishkal is an arabic text vocalization software
Stars: ✭ 158 (-97.15%)
IowncodeA curated collection of iOS, ML, AR resources sprinkled with some UI additions
Stars: ✭ 499 (-90.99%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+130.38%)
SymSpellCppPyFast SymSpell written in c++ and exposes to python via pybind11
Stars: ✭ 28 (-99.49%)
Awesome Pytorch ListA comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Stars: ✭ 12,475 (+125.18%)
Matchzoo PyFacilitating the design, comparison and sharing of deep text matching models.
Stars: ✭ 362 (-93.47%)
Visdial RlPyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
Stars: ✭ 157 (-97.17%)
SSANHow Does Selective Mechanism Improve Self-attention Networks?
Stars: ✭ 18 (-99.68%)
Speech signal processing and classificationFront-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].
Stars: ✭ 155 (-97.2%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (-92.09%)
RNNSearchAn implementation of attention-based neural machine translation using Pytorch
Stars: ✭ 43 (-99.22%)
Spacy Streamlit👑 spaCy building blocks and visualizers for Streamlit apps
Stars: ✭ 360 (-93.5%)
transformer-sltSign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)
Stars: ✭ 92 (-98.34%)
Seq2seq.pytorchSequence-to-Sequence learning using PyTorch
Stars: ✭ 514 (-90.72%)
PostaggaA Library to parse natural language in pure Clojure and ClojureScript
Stars: ✭ 152 (-97.26%)
Pytorch-NLUPytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (-97.27%)
ChineseblueChinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Stars: ✭ 149 (-97.31%)
Spacymoji💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 151 (-97.27%)
codeprepA toolkit for pre-processing large source code corpora
Stars: ✭ 39 (-99.3%)
SwiftychronoA natural language date parser in Swift (ported from chrono.js)
Stars: ✭ 148 (-97.33%)
Code searchCode For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"
Stars: ✭ 436 (-92.13%)
Neat VisionNeat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
Stars: ✭ 213 (-96.16%)
D2l VnMột cuốn sách tương tác về học sâu có mã nguồn, toán và thảo luận. Đề cập đến nhiều framework phổ biến (TensorFlow, Pytorch & MXNet) và được sử dụng tại 175 trường Đại học.
Stars: ✭ 402 (-92.74%)
NerNamed Entity Recognition
Stars: ✭ 288 (-94.8%)
Nlp RoadmapROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP
Stars: ✭ 2,653 (-52.11%)
ShifteratorInterpretable data visualizations for understanding how texts differ at the word level
Stars: ✭ 209 (-96.23%)
Medacy🏥 Medical Text Mining and Information Extraction with spaCy
Stars: ✭ 287 (-94.82%)
KagnetKnowledge-Aware Graph Networks for Commonsense Reasoning (EMNLP-IJCNLP 19)
Stars: ✭ 205 (-96.3%)
Opennmt PyOpen Source Neural Machine Translation in PyTorch
Stars: ✭ 5,378 (-2.92%)