All Categories → Text Processing → text-analysis

Top 68 text-analysis open source projects

Shifterator
Interpretable data visualizations for understanding how texts differ at the word level
Woke
✊ Detect non-inclusive language in your source code.
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Textclean
Tools for cleaning and normalizing text data
Applied Ml
Code and Resources for "Applied Machine Learning"
Wikitextparser
A simple WikiText parsing library for MediaWiki
Qdap
Quantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Smltar
Manuscript of the book "Supervised Machine Learning for Text Analysis in R" by Emil Hvitfeldt and Julia Silge
Stopwords
Multilingual Stopword Lists in R
R Text Data
List of textual data sources to be used for text mining in R
Awesome Customer Analytics
A curated list of awesome customer analytics content
Lexisnexistools
📰 Working with newspaper data from 'LexisNexis'
Javascript Text Expander
Expands texts as you type, naturally
Ore
An R interface to the Onigmo regular expression library
Biomedicus
Code for the old version of BioMedICUS, for the new version see the biomedicus3 repository.
Doctopics
Various examples of topic modeling and other text analysis
Rezonator
Rezonator: Dynamics of human engagement
Articleparse
Heuristic text extraction from news sites in Python3
Homer
Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.
Awesome Sentiment Analysis
Repository with all what is necessary for sentiment analysis and related areas
Php Text Analysis
PHP Text Analysis is a library for performing Information Retrieval (IR) and Natural Language Processing (NLP) tasks using the PHP language
Whatlang Rs
Natural language detection library for Rust. Try demo online: https://www.greyblake.com/whatlang/
Jekyll
Jekyll-based static site for The Programming Historian
Open Semantic Search
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Python Course
Tutorial and introduction into programming with Python for the humanities and social sciences
Giveme5w1h
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
Textpipe
Textpipe: clean and extract metadata from text
aylien textapi nodejs
AYLIEN's officially supported node.js client library for accessing Text API
support-tickets-classification
This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
DaDengAndHisPython
【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱[email protected]
LSX
A word embeddings-based semi-supervised model for document scaling
occupationcoder
Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.
HurdleDMR.jl
Hurdle Distributed Multinomial Regression (HDMR) implemented in Julia
learning-stm
Learning structural topic modeling using the stm R package.
aylien textapi go
AYLIEN's officially supported Go client library for accessing Text API
quanteda.corpora
A collection of corpora for quanteda
nlpbuddy
A text analysis application for performing common NLP tasks through a web dashboard interface and an API
rectr
💒 Reproducible Extraction of Cross-lingual Topics using R
ChineseTextAnalysisResouce
中文文本分析相关资源汇总
woolly
The Text Mining Elixir
uima-uimafit
Apache UIMA uimaFIT
1-60 of 68 text-analysis projects