TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-94.53%)
R.TeMiSR.TeMiS: R Text Mining Solution
Stars: ✭ 21 (-95.75%)
DaDengAndHisPython【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱
[email protected] Stars: ✭ 59 (-88.06%)
SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (-74.29%)
nejiFlexible and powerful platform for biomedical information extraction from text
Stars: ✭ 37 (-92.51%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-93.32%)
civicmineText mining cancer biomarkers for the CIVIC database
Stars: ✭ 19 (-96.15%)
misinfo📊 Tools to Perform ‘Misinformation’ Analysis on a Text Corpus (wrapper for methods in https://github.com/PDXBek/Misinformation)
Stars: ✭ 17 (-96.56%)
named-entity-recognitionNotebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities
Stars: ✭ 18 (-96.36%)
ipo-minerIPO Investment via Text Mining.
Stars: ✭ 20 (-95.95%)
textreadrTools to uniformly read in text data including semi-structured transcripts
Stars: ✭ 65 (-86.84%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-88.66%)
Guten-gutterStrips boilerplate from Project Gutenberg text files
Stars: ✭ 16 (-96.76%)
RplosR client for the PLoS Journals API
Stars: ✭ 289 (-41.5%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (-71.26%)
learning2hash.github.ioWebsite for "A survey of learning to hash for Computer Vision" https://learning2hash.github.io
Stars: ✭ 14 (-97.17%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (-24.09%)
thrones2vecUsing Word2Vec to explore semantic similarities between the entities of "A Song of Ice and Fire" ("Game of Thrones").
Stars: ✭ 27 (-94.53%)
readerDistant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (-96.36%)
NlpythonThis repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (-46.36%)
tf-idf-pythonTerm frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (-80.16%)
textstemTools for fast text stemming & lemmatization
Stars: ✭ 36 (-92.71%)
textdigesterTextDigester: document summarization java library
Stars: ✭ 23 (-95.34%)
extractnetA Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (-89.47%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-90.28%)
gofastrMake a DocumentTermMatrix faster
Stars: ✭ 19 (-96.15%)
GraphbrainLanguage, Knowledge, Cognition
Stars: ✭ 294 (-40.49%)
sacred📖 Sacred texts in R
Stars: ✭ 19 (-96.15%)
TwEaterA Python Bot for Scraping Conversations from Twitter
Stars: ✭ 16 (-96.76%)
restaurant-finder-featureReviewsBuild a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-95.75%)
Open Semantic SearchOpen Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Stars: ✭ 386 (-21.86%)
eventextraction中文复合事件抽取,能识别文本的模式,包括条件事件、顺承事件、反转事件等,可以用于文本逻辑性分析。
Stars: ✭ 17 (-96.56%)
Textractextract text from any document. no muss. no fuss.
Stars: ✭ 3,165 (+540.69%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-94.53%)
elpresidente🇺🇸 Search and Extract Corpus Elements from 'The American Presidency Project'
Stars: ✭ 21 (-95.75%)
AdjutantRuns a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (-88.06%)
TRUNAJOD2.0An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (-96.36%)
ruimteholR package to Embed All the Things! using StarSpace
Stars: ✭ 95 (-80.77%)
converseConversational text Analysis using various NLP techniques
Stars: ✭ 147 (-70.24%)
TextminingPython文本挖掘系统 Research of Text Mining System
Stars: ✭ 268 (-45.75%)
VERSEVancouver Event and Relation System for Extraction
Stars: ✭ 13 (-97.37%)
aera-workshopThis workshop introduces participants to the Learning Analytics (LA), and provides a brief overview of LA methodologies, literature, applications, and ethical issues as they relate to STEM education.
Stars: ✭ 14 (-97.17%)
TabInOutFramework for information extraction from tables
Stars: ✭ 37 (-92.51%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (-27.53%)
sensimSentence Similarity Estimator (SenSim)
Stars: ✭ 15 (-96.96%)
textlearnRA simple collection of well working NLP models (Keras, H2O, StarSpace) tuned and benchmarked on a variety of datasets.
Stars: ✭ 16 (-96.76%)
blueprints-textJupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"
Stars: ✭ 103 (-79.15%)
LdavisR package for web-based interactive topic model visualization.
Stars: ✭ 466 (-5.67%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (-29.55%)
tg crawlerJust a crawler based on tg-cli for Telegram. Deprecated by now, please use telegram-export.
Stars: ✭ 71 (-85.63%)