kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (+83.33%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+961.11%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+1833.33%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+411.11%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+688.89%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+961.11%)
TextheroText preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+13272.22%)
text analysis tools中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
Stars: ✭ 410 (+2177.78%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (+150%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+1888.89%)
DaDengAndHisPython【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱
[email protected] Stars: ✭ 59 (+227.78%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+4288.89%)
policy-data-analyzerBuilding a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
Stars: ✭ 22 (+22.22%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+1983.33%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+988.89%)
GeniusEasily access song lyrics from Genius in a tibble.
Stars: ✭ 111 (+516.67%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+788.89%)
Text predictorChar-level RNN LSTM text generator📄.
Stars: ✭ 99 (+450%)
LexiconA data package containing lexicons and dictionaries for text analysis
Stars: ✭ 87 (+383.33%)
R Text DataList of textual data sources to be used for text mining in R
Stars: ✭ 85 (+372.22%)
Awesome Nlp📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Stars: ✭ 12,626 (+70044.44%)
Orange3 Text🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (+361.11%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (+538.89%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+10927.78%)
Learning Social Media Analytics With RThis repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (+466.67%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (+405.56%)
PyphoneticsA Python 3 phonetics library.
Stars: ✭ 61 (+238.89%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+1105.56%)
Textfeatures👷♂️ A simple package for extracting useful features from character objects 👷♀️
Stars: ✭ 148 (+722.22%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (+300%)
ChemdataextractorAutomatically extract chemical information from scientific documents
Stars: ✭ 152 (+744.44%)
PipeitPipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (+216.67%)
KonlpyPython package for Korean natural language processing.
Stars: ✭ 1,098 (+6000%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+722.22%)
NgramFast n-Gram Tokenization
Stars: ✭ 55 (+205.56%)
BreadabilityReworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
Stars: ✭ 186 (+933.33%)
QdapQuantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Stars: ✭ 146 (+711.11%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (+177.78%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (+138.89%)
Hands On Natural Language Processing With PythonThis repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Stars: ✭ 146 (+711.11%)
Friend.lyA social media platform with a friend recommendation engine based on personality trait extraction
Stars: ✭ 41 (+127.78%)
Gsoc2018 3gm💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (+100%)
KateCode & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Stars: ✭ 135 (+650%)
TidytextText mining using tidy tools ✨📄✨
Stars: ✭ 975 (+5316.67%)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (+83.33%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+9494.44%)
Tidy Text MiningManuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
Stars: ✭ 961 (+5238.89%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+1044.44%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+905.56%)
KhcoderKH Coder: for Quantitative Content Analysis or Text Mining
Stars: ✭ 126 (+600%)
NlpplnNLP pipeline software using common workflow language
Stars: ✭ 31 (+72.22%)