All Projects → Textmining → Similar Projects or Alternatives

336 Open source projects that are alternatives of or similar to Textmining

How To Mine Newsfeed Data And Extract Interactive Insights In Python

A practical guide to topic mining and interactive visualizations

Stars: ✭ 61 (-77.24%)

Mutual labels: text-mining, sklearn, tf-idf

2018 Machinelearning Lectures Esa

Machine Learning Lectures at the European Space Agency (ESA) in 2018

Stars: ✭ 280 (+4.48%)

Mutual labels: text-mining, tf-idf

tf-idf-python

Term frequency–inverse document frequency for Chinese novel/documents implemented in python.

Stars: ✭ 98 (-63.43%)

Mutual labels: text-mining, tf-idf

Nlp In Practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Stars: ✭ 790 (+194.78%)

Mutual labels: text-mining, tf-idf

topic modelling financial news

Topic modelling on financial news with Natural Language Processing

Stars: ✭ 51 (-80.97%)

Mutual labels: sklearn, tf-idf

lda2vec

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019

Stars: ✭ 27 (-89.93%)

Mutual labels: text-mining, sklearn

blueprints-text

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"

Stars: ✭ 103 (-61.57%)

Mutual labels: text-mining

elpresidente

🇺🇸 Search and Extract Corpus Elements from 'The American Presidency Project'

Stars: ✭ 21 (-92.16%)

Mutual labels: text-mining

lucilla

Fast, efficient, in-memory Full Text Search for Kotlin

Stars: ✭ 102 (-61.94%)

Mutual labels: tf-idf

gofastr

Make a DocumentTermMatrix faster

Stars: ✭ 19 (-92.91%)

Mutual labels: text-mining

snorkeling

Extracting biomedical relationships from literature with Snorkel 🏊

Stars: ✭ 56 (-79.1%)

Mutual labels: text-mining

DaDengAndHisPython

【微信公众号：大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱[email protected]

Stars: ✭ 59 (-77.99%)

Mutual labels: text-mining

weibo-summary

微博自动摘要系统 Chinese Microblog Automatic Summary System

Stars: ✭ 28 (-89.55%)

Mutual labels: tf-idf

sensim

Sentence Similarity Estimator (SenSim)

Stars: ✭ 15 (-94.4%)

Mutual labels: text-mining

support-tickets-classification

This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en

Stars: ✭ 142 (-47.01%)

Mutual labels: text-mining

iresearch

IResearch is a cross-platform, high-performance document oriented search engine library written entirely in C++ with the focus on a pluggability of different ranking/similarity models

Stars: ✭ 121 (-54.85%)

Mutual labels: tf-idf

tg crawler

Just a crawler based on tg-cli for Telegram. Deprecated by now, please use telegram-export.

Stars: ✭ 71 (-73.51%)

Mutual labels: text-mining

ipo-miner

IPO Investment via Text Mining.

Stars: ✭ 20 (-92.54%)

Mutual labels: text-mining

HumanOrRobot

a solution for competition of kaggle `Human or Robot`

Stars: ✭ 16 (-94.03%)

Mutual labels: sklearn

sacred

📖 Sacred texts in R

Stars: ✭ 19 (-92.91%)

Mutual labels: text-mining

Credit-Risk-Analysis

No description or website provided.

Stars: ✭ 29 (-89.18%)

Mutual labels: sklearn

Guten-gutter

Strips boilerplate from Project Gutenberg text files

Stars: ✭ 16 (-94.03%)

Mutual labels: text-mining

NewsSearch

主要使用python+Scrapy框架去抓取新闻网站

Stars: ✭ 23 (-91.42%)

Mutual labels: tf-idf

fb scraper

FBLYZE is a Facebook scraping system and analysis system.

Stars: ✭ 61 (-77.24%)

Mutual labels: tf-idf

restaurant-finder-featureReviews

Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).

Stars: ✭ 21 (-92.16%)

Mutual labels: text-mining

sklearn-feature-engineering

使用sklearn做特征工程

Stars: ✭ 114 (-57.46%)

Mutual labels: sklearn

ruimtehol

R package to Embed All the Things! using StarSpace

Stars: ✭ 95 (-64.55%)

Mutual labels: text-mining

TextDatasetCleaner

🔬 Очистка датасетов от мусора (нормализация, препроцессинг)

Stars: ✭ 27 (-89.93%)

Mutual labels: text-mining

named-entity-recognition

Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities

Stars: ✭ 18 (-93.28%)

Mutual labels: text-mining

eventextraction

中文复合事件抽取，能识别文本的模式，包括条件事件、顺承事件、反转事件等，可以用于文本逻辑性分析。

Stars: ✭ 17 (-93.66%)

Mutual labels: text-mining

textstem

Tools for fast text stemming & lemmatization

Stars: ✭ 36 (-86.57%)

Mutual labels: text-mining

Diabetic-Retinopathy-Detection

DIAGNOSIS OF DIABETIC RETINOPATHY FROM FUNDUS IMAGES USING SVM, KNN, and attention-based CNN models with GradCam score for interpretability,

Stars: ✭ 31 (-88.43%)

Mutual labels: sklearn

advanced-text-mining

TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.

Stars: ✭ 15 (-94.4%)

Mutual labels: text-mining

Data-Analyst-Nanodegree

Kai Sheng Teh - Udacity Data Analyst Nanodegree

Stars: ✭ 42 (-84.33%)

Mutual labels: sklearn

textdigester

TextDigester: document summarization java library

Stars: ✭ 23 (-91.42%)

Mutual labels: text-mining

News Search Engine

新闻搜索引擎

Stars: ✭ 254 (-5.22%)

Mutual labels: jieba

AI-Project

Stock predictor using Machine Learning

Stars: ✭ 22 (-91.79%)

Mutual labels: sklearn

datahub

DataHub - Synthetic data library

Stars: ✭ 66 (-75.37%)

Mutual labels: sklearn

lorca

Natural Language Processing for Spanish in Node.js. Stemmer, sentiment analysis, readability, tf-idf with batteries, concordance and more!

Stars: ✭ 95 (-64.55%)

Mutual labels: tf-idf

Text-Analysis

Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.

Stars: ✭ 48 (-82.09%)

Mutual labels: text-mining

SparseLSH

A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.

Stars: ✭ 127 (-52.61%)

Mutual labels: text-mining

sklearn-predict

机器学习数据，预测趋势并画图

Stars: ✭ 16 (-94.03%)

Mutual labels: sklearn

watchman

Watchman: An open-source social-media event-detection system

Stars: ✭ 18 (-93.28%)

Mutual labels: tf-idf

Attentionwalk

A PyTorch Implementation of "Watch Your Step: Learning Node Embeddings via Graph Attention" (NeurIPS 2018).

Stars: ✭ 266 (-0.75%)

Mutual labels: sklearn

occupationcoder

Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.

Stars: ✭ 30 (-88.81%)

Mutual labels: tf-idf

skippa

SciKIt-learn Pipeline in PAndas

Stars: ✭ 33 (-87.69%)

Mutual labels: sklearn

codeflare

Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.

Stars: ✭ 163 (-39.18%)

Mutual labels: sklearn

machine-learning-novice-sklearn

A Carpentry style lesson on machine learning with Python and scikit-learn.

Stars: ✭ 22 (-91.79%)

Mutual labels: sklearn

scikit-learn

به فارسی، برای مشارکت scikit-learn

Stars: ✭ 19 (-92.91%)

Mutual labels: sklearn

Breast-Cancer-Scikitlearn

simple tutorial on Machine Learning with Scikitlearn

Stars: ✭ 33 (-87.69%)

Mutual labels: sklearn

sklearn-audio-classification

An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP

Stars: ✭ 31 (-88.43%)

Mutual labels: sklearn

awesome-text-summarization

Text summarization starting from scratch.

Stars: ✭ 86 (-67.91%)

Mutual labels: text-mining

Quran-and-Arabic-Language-Repository

Projects & Libraries related to Quran & Arabic Language

Stars: ✭ 26 (-90.3%)

Mutual labels: text-mining

Kaio-machine-learning-human-face-detection

Machine Learning project a case study focused on the interaction with digital characters, using a character called "Kaio", which, based on the automatic detection of facial expressions and classification of emotions, interacts with humans by classifying emotions and imitating expressions

Stars: ✭ 18 (-93.28%)

Mutual labels: sklearn

Introduction-to-text-mining-with-Python

Lectures in Urban Data Science Lab, Seoul

Stars: ✭ 25 (-90.67%)

Mutual labels: text-mining

MachineLearning

机器学习教程，本教程包含基于numpy、sklearn与tensorflow机器学习，也会包含利用spark、flink加快模型训练等用法。本着能够较全的引导读者入门机器学习。

Stars: ✭ 23 (-91.42%)

Mutual labels: sklearn

TwEater

A Python Bot for Scraping Conversations from Twitter