LazynlpLibrary to scrape and clean web pages to create massive datasets.
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Ngx Dynamic Dashboard FrameworkThis is a JSON driven angular x based dashboard framework that is inspired by JIRA's dashboard implementation and https://github.com/raulgomis/angular-dashboard-framework
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
MixtextMixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification
Pytorch NlpBasic Utilities for PyTorch Natural Language Processing (NLP)
Mtbook《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models
MishkalMishkal is an arabic text vocalization software
NlprePython library for Natural Language Preprocessing (NLPre)
GensimTopic Modelling for Humans
Awesome Nlp📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Awesome Pytorch ListA comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
SlingSLING - A natural language frame semantics parser
Visdial RlPyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
Speech signal processing and classificationFront-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].
SwagafRepository for paper "SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference"
PythonrougePython wrapper for evaluating summarization quality by ROUGE package
ChemdataextractorAutomatically extract chemical information from scientific documents
PostaggaA Library to parse natural language in pure Clojure and ClojureScript
Crf Layer On The Top Of BilstmThe CRF Layer was implemented by using Chainer 2.0. Please see more details here: https://createmomo.github.io/2017/09/12/CRF_Layer_on_the_Top_of_BiLSTM_1/
ChineseblueChinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Finnlp ProgressNLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datasets, papers, and current state-of-the-art results for the most popular tasks.
Spacymoji💙 Emoji handling and meta data for spaCy with custom extension attributes
Spacy Course👩🏫 Advanced NLP with spaCy: A free online course
SwiftychronoA natural language date parser in Swift (ported from chrono.js)
NegapojiJapanese negative positive classification.日本語文書のネガポジを判定。
Turkce Yapay Zeka KaynaklariTürkiye'de yapılan derin öğrenme (deep learning) ve makine öğrenmesi (machine learning) çalışmalarının derlendiği sayfa.
Hands On Natural Language Processing With PythonThis repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
GooglelanguagerR client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Awesome Nlp ResourcesThis repository contains landmark research papers in Natural Language Processing that came out in this century.
AbsapapersWorth-reading papers and related awesome resources on aspect-based sentiment analysis (ABSA). 值得一读的方面级情感分析论文与相关资源集合
Multihead Siamese NetsImplementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
Monkeylearn PythonOfficial Python client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Python apps.
NeusumCode for the ACL 2018 paper "Neural Document Summarization by Jointly Learning to Score and Select Sentences"
Stanza OldStanford NLP group's shared Python tools.
Paper Survey📚Survey of previous research and related works on machine learning (especially Deep Learning) in Japanese
NlpaugData augmentation for NLP
Practical Machine Learning With PythonMaster the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Learn To Select DataCode for Learning to select data for transfer learning with Bayesian Optimization