LSXA word embeddings-based semi-supervised model for document scaling
Stars: ✭ 42 (+68%)
Open Semantic SearchOpen Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Stars: ✭ 386 (+1444%)
R Text DataList of textual data sources to be used for text mining in R
Stars: ✭ 85 (+240%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (+32%)
SmltarManuscript of the book "Supervised Machine Learning for Text Analysis in R" by Emil Hvitfeldt and Julia Silge
Stars: ✭ 125 (+400%)
Giveme5w1hExtraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
Stars: ✭ 316 (+1164%)
Woke✊ Detect non-inclusive language in your source code.
Stars: ✭ 190 (+660%)
Giveme5WExtraction of the five journalistic W-questions (5W) from news articles
Stars: ✭ 16 (-36%)
HurdleDMR.jlHurdle Distributed Multinomial Regression (HDMR) implemented in Julia
Stars: ✭ 19 (-24%)
HomerHomer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.
Stars: ✭ 607 (+2328%)
QdapQuantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Stars: ✭ 146 (+484%)
Whatlang RsNatural language detection library for Rust. Try demo online: https://www.greyblake.com/whatlang/
Stars: ✭ 400 (+1500%)
ShifteratorInterpretable data visualizations for understanding how texts differ at the word level
Stars: ✭ 209 (+736%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+1332%)
Ml Dl ScriptsThe repository provides usefull python scripts for ML and data analysis
Stars: ✭ 119 (+376%)
TextpipeTextpipe: clean and extract metadata from text
Stars: ✭ 284 (+1036%)
jmdict-simplifiedJMdict, JMnedict, Kanjidic, KRADFILE/RADKFILE in JSON format
Stars: ✭ 96 (+284%)
DaDengAndHisPython【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱
[email protected] Stars: ✭ 59 (+136%)
occupationcoderGiven a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.
Stars: ✭ 30 (+20%)
TextcleanTools for cleaning and normalizing text data
Stars: ✭ 159 (+536%)
aylien textapi goAYLIEN's officially supported Go client library for accessing Text API
Stars: ✭ 15 (-40%)
BiomedicusCode for the old version of BioMedICUS, for the new version see the biomedicus3 repository.
Stars: ✭ 45 (+80%)
TRUNAJOD2.0An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (-28%)
ArticleparseHeuristic text extraction from news sites in Python3
Stars: ✭ 6 (-76%)
WikitextparserA simple WikiText parsing library for MediaWiki
Stars: ✭ 149 (+496%)
MetaA Modern C++ Data Sciences Toolkit
Stars: ✭ 600 (+2300%)
wordhoardThis Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.
Stars: ✭ 78 (+212%)
Php Text AnalysisPHP Text Analysis is a library for performing Information Retrieval (IR) and Natural Language Processing (NLP) tasks using the PHP language
Stars: ✭ 410 (+1540%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (+468%)
JekyllJekyll-based static site for The Programming Historian
Stars: ✭ 387 (+1448%)
text-analysisWeaving analytical stories from text data
Stars: ✭ 12 (-52%)
Python CourseTutorial and introduction into programming with Python for the humanities and social sciences
Stars: ✭ 370 (+1380%)
PadatiousA neural network intent parser
Stars: ✭ 124 (+396%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+1292%)
GraphbrainLanguage, Knowledge, Cognition
Stars: ✭ 294 (+1076%)
StopwordsMultilingual Stopword Lists in R
Stars: ✭ 89 (+256%)
aylien textapi nodejsAYLIEN's officially supported node.js client library for accessing Text API
Stars: ✭ 13 (-48%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+468%)
Orange3 Text🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (+232%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+568%)
YelpDatasetSQLWorking with the Yelp Dataset in Azure SQL and SQL Server
Stars: ✭ 16 (-36%)
Lexisnexistools📰 Working with newspaper data from 'LexisNexis'
Stars: ✭ 59 (+136%)
ritaWebsite, documentation and examples for RiTa
Stars: ✭ 42 (+68%)
learning-stmLearning structural topic modeling using the stm R package.
Stars: ✭ 103 (+312%)
OreAn R interface to the Onigmo regular expression library
Stars: ✭ 54 (+116%)
nlpbuddyA text analysis application for performing common NLP tasks through a web dashboard interface and an API
Stars: ✭ 115 (+360%)
DoctopicsVarious examples of topic modeling and other text analysis
Stars: ✭ 32 (+28%)
nippon日语N5-N2语法笔记~ 🍻
Stars: ✭ 84 (+236%)
Applied MlCode and Resources for "Applied Machine Learning"
Stars: ✭ 156 (+524%)
RezonatorRezonator: Dynamics of human engagement
Stars: ✭ 25 (+0%)