All Projects → kjahan → twitter_mining

kjahan / twitter_mining

Licence: other
Twitter Mining in Java

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to twitter mining

sentiment-analysis-using-python
Large Data Analysis Course Project
Stars: ✭ 23 (-8%)
Mutual labels:  sentiment-analysis, naive-bayes-classifier
ethersocial-pool
Open EthersocialNetwork Mining Pool based off the sammy007's open-ethereum-pool
Stars: ✭ 19 (-24%)
Mutual labels:  mining
Real Time DataMining Software
携程/榛果民宿实时评论挖掘软件,包含数据的实时采集/数据清洗/结构化保存/ UGC 数据主题提取/情感分析/后结构化可视化等技术的综合性演示 Demo。基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和 NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。主要克服用户打分和评论不一致,实时对携程和美团在线民宿的满意度进行评测以及对额外数据进行可视化的综合性工具,多维度的对在线 UGC 进行数据挖掘并可视化,demo 视频演示见链接。
Stars: ✭ 43 (+72%)
Mutual labels:  sentiment-analysis
AirBnbPricePrediction
Training and Testing a Set of Machine Learning/Deep Learning Models to Predict Airbnb Prices for NYC
Stars: ✭ 47 (+88%)
Mutual labels:  sentiment-analysis
cryptocoin scrypt stratum
A javascript scrypt cryptocurrency miner that uses a PHP built stratum server as a proxy.
Stars: ✭ 25 (+0%)
Mutual labels:  mining
noso-go
Golang CLI application for mining Noso coin
Stars: ✭ 21 (-16%)
Mutual labels:  mining
pandas twitter
Analyzing Trump's tweets using Python (Pandas + Twitter workshop)
Stars: ✭ 81 (+224%)
Mutual labels:  sentiment-analysis
athena
Opinion mining
Stars: ✭ 25 (+0%)
Mutual labels:  sentiment-analysis
char-cnn-text-classification-tensorflow
Simple Convolutional Neural Network (CNN) for sentiment classification of Chinese movie reviews.
Stars: ✭ 55 (+120%)
Mutual labels:  sentiment-analysis
amazon-reviews
Sentiment Analysis & Topic Modeling with Amazon Reviews
Stars: ✭ 26 (+4%)
Mutual labels:  sentiment-analysis
LDA thesis
Hierarchical, multi-label topic modelling with LDA
Stars: ✭ 49 (+96%)
Mutual labels:  latent-dirichlet-allocation
sentiment-analysis-imdb
This is a classifier focused on sentiment analysis of movie reviews
Stars: ✭ 11 (-56%)
Mutual labels:  sentiment-analysis
foxy-miner
A scavenger / conqueror wrapper for collision free multi mining of PoC coins
Stars: ✭ 17 (-32%)
Mutual labels:  mining
chatto
Chatto is a minimal chatbot framework in Go.
Stars: ✭ 98 (+292%)
Mutual labels:  naive-bayes-classifier
rosette-elasticsearch-plugin
Document Enrichment plugin for Elasticsearch
Stars: ✭ 25 (+0%)
Mutual labels:  sentiment-analysis
soroka
Узнай, хорошо или плохо говорят о тебе или твоей фирме в Интернете! Наша "Сорока" с искусственным интеллектом принесёт тебе это на своём хвосте.
Stars: ✭ 16 (-36%)
Mutual labels:  sentiment-analysis
Text tone analyzer
Система, анализирующая тональность текстов и высказываний.
Stars: ✭ 15 (-40%)
Mutual labels:  sentiment-analysis
Dataset-Sentimen-Analisis-Bahasa-Indonesia
Repositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
Stars: ✭ 38 (+52%)
Mutual labels:  sentiment-analysis
pytorch-sentiment-analysis
char-rnn implementation for sentiment analysis on twitter data
Stars: ✭ 32 (+28%)
Mutual labels:  sentiment-analysis
telegram bitcoin bot
Telegram Auto Mining Bitcoins In Termux
Stars: ✭ 115 (+360%)
Mutual labels:  mining

Twitter Mining Project

This project is a ML/NLP library in Java for analyzing tweets and building predictive models. The predictive models are built to help election/ad/marketing campaigns dig into social media conversations (public opinions) in order to get insights for making intelligent decisions.

The project consists of four main packages and a resource directory:

  1. Algorithms package contains implementations of a few ml/nlp algorithms for running text analysis on tweets contents.
  2. Twitter package is designed to wrap twitter data regradless of the persistent layer that one uses to store/retrieve tweets.
  3. Runanalysis package is the interface for running ml/nlp algorithms.
  4. Utilities package is developed to provide a collection of helper classes for different analysis.
  5. Resources directory includes a few data sources used for tweets analysis such as stop words, training data for sentiment analysis and so on.

Packages Details:

Algorithms Package:

  1. LDA Algorithm: an implentation of Latent Dirichlet Allocation algorithm used for topic modeling.
  2. NaiveBayes Classifier: a customized version of Naive Bayes classifier for running sentiment analysis on tweets.
  3. TextAnalysis: a class for performing various text analysis such as computing word frequencies.
  4. TweetsStatistics: provides functionalities for computing basic statistics from tweets.

Twitter Package:

  1. Tweet: a representative class for tweets.
  2. TweetDate: a class for dealing with date range. This allows us to analyze tweets in a give time range.
  3. TweetsConstants: a class for constants and configuration parameters.
  4. TwitterDataSource: an interface designed to deal with different persistent layers.
  5. TwitterFileDataSource: an implementation of TwitterDataSource interface when persistent layer is raw File.
  6. TwitterMySqlDataSource: an implementation of TwitterDataSource interface when persistent layer is MySql DB.

Runanalysis Package:

  1. RunBayes: runs sentiment analysis on tweets using NaiveBayes class.
  2. RunLDA: runs topic modeling on tweets using LDA class.
  3. RunStatistics: runs basic statistics on tweets using TweetsStatistics class.
  4. RunTextAnalysis: runs text analysis on tweets using TextAnalysis class.
  5. ThreadPool & WorkerThread: a multi-threaded code for running analysis.

Utilities Package:

  1. DayIntervals: a class for reading day interval files and generating a list of day pairs.
  2. GenerateCsv: a class for generating a CSV file for post-processing and visualization steps.
  3. MapUtil: a class for printing a TreeMap data.
  4. Pair: a class for defining pair objects.
  5. SentimentLabel: sentiment labels.
  6. StopWords: a class for building stop words for NLP analysis.
  7. TimeZone: time zone class.
  8. TweetUtils: a helper class which has functionalities for cleaning/normalizing tweets.
  9. ValueComparator: a comparator class.

Tweets Data Schema:

This library requires your twitter data to be stored in a MySql database/table (i.e. politics/tweets). Schema of tweets table is shown below:

Field Type
id int(10) unsigned, PRI
timestamp int(10) unsigned
source varchar(40)
author varchar(20)
lat decimal(10,8)
lng decimal(11,8)
text varchar(140)
created at datetime

If you'd like to read more about this project, you should check Barack Obama or Mitt Romney: that's the question! web page. You can also check our published paper using this ML/NLP framework here: The Predictive Power of Social Media: On the Predictability of U.S. Presidential Elections using Twitter.

If you have any question about the code, contact me @ kDOTjahanbakhshATgmailDOTcom

Licence

Copyright (c) 2013 Black Square Media Ltd. All rights reserved.
(The MIT License)

Permission is hereby granted, free of charge, to any person obtaining
a copy of this software and associated documentation files (the
'Software'), to deal in the Software without restriction, including
without limitation the rights to use, copy, modify, merge, publish,
distribute, sublicense, and/or sell copies of the Software, and to
permit persons to whom the Software is furnished to do so, subject to
the following conditions:

The above copyright notice and this permission notice shall be
included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED 'AS IS', WITHOUT WARRANTY OF ANY KIND,
EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.
IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY
CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT,
TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE
SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].