Text ClassificationMachine Learning and NLP: Text Classification using python, scikit-learn and NLTK
Speech signal processing and classificationFront-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].
Proctoring AiCreating a software for automatic monitoring in online proctoring
Practical Machine Learning With PythonMaster the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Ai Chatbot FrameworkA python chatbot framework with Natural Language Understanding and Artificial Intelligence.
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
StocksightStock market analyzer and predictor using Elasticsearch, Twitter, News headlines and Python natural language processing and sentiment analysis
TextblobSimple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Ryuzaki botSimple chatbot in Python using NLTK and scikit-learn
Rake NltkPython implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
WikiquizGenerates a quiz for a Wikipedia page using parts of speech and text chunking.
CltkThe Classical Language Toolkit
GitsuggestA tool to suggest github repositories based on the repositories you have shown interest in.
WatcherWatcher - Open Source Cybersecurity Threat Hunting Platform. Developed with Django & React JS.
resume tailorAn unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
billboard🎤 Lyrics/associated NLP data for Billboard's Top 100, 1950-2015.
curso-IRIIntrodução à Recuperação de Informações
tweets-preprocessorRepo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
ru punktRussian language support for NLTK's PunktSentenceTokenizer
Stock-Analyser📈 Stocks technical analysis code collection and Stocks data platform.
nlp-akashNatural Language Processing notes and implementations.
character-extractionExtracts character names from a text file and performs analysis of text sentences containing the names.
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
NRCLexAn affect generator based on TextBlob and the NRC affect lexicon. Note that lexicon license is for research purposes only.
nlp workshop odsc europe20Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Deception-Detection-on-Amazon-reviews-datasetA SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.
namebotA company/project name generator for Python. Uses NLTK and diverse techniques derived from existing corporate etymologies and naming agencies for sophisticated word generation and ideation.
ResumeRiseAn NLP tool which classifies and summarizes resumes
pygramsExtracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence