All Projects → mutux → TwEater

mutux / TwEater

Licence: MIT License
A Python Bot for Scraping Conversations from Twitter

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to TwEater

Orange3 Text
🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (+418.75%)
Mutual labels:  text-mining, twitter, sentiment-analysis
Twitter Sentiment Analysis
This script can tell you the sentiments of people regarding to any events happening in the world by analyzing tweets related to that event
Stars: ✭ 94 (+487.5%)
Mutual labels:  twitter, tweets, sentiment-analysis
Sarcasm Detection
Detecting Sarcasm on Twitter using both traditonal machine learning and deep learning techniques.
Stars: ✭ 73 (+356.25%)
Mutual labels:  twitter, tweets, sentiment-analysis
Learning Social Media Analytics With R
This repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (+537.5%)
Mutual labels:  text-mining, twitter, sentiment-analysis
SEDTWik-Event-Detection-from-Tweets
Segmentation based event detection from Tweets. Published at NAACL SRW 2019
Stars: ✭ 58 (+262.5%)
Mutual labels:  text-mining, tweets
arabic-sentiment-analysis
Sentiment Analysis in Arabic tweets
Stars: ✭ 64 (+300%)
Mutual labels:  tweets, sentiment-analysis
converse
Conversational text Analysis using various NLP techniques
Stars: ✭ 147 (+818.75%)
Mutual labels:  text-mining, sentiment-analysis
restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (+31.25%)
Mutual labels:  text-mining, sentiment-analysis
text-analysis
Weaving analytical stories from text data
Stars: ✭ 12 (-25%)
Mutual labels:  text-mining, sentiment-analysis
twitter-aws-comprehend
An app to analyze tweets using Amazon Comprehend's Sentiment Analysis service
Stars: ✭ 13 (-18.75%)
Mutual labels:  tweets, sentiment-analysis
sentiment-thermometer
Measure the sentiment towards a word, name or sentence on social networks
Stars: ✭ 56 (+250%)
Mutual labels:  twitter, sentiment-analysis
sentometrics
An integrated framework in R for textual sentiment time series aggregation and prediction
Stars: ✭ 77 (+381.25%)
Mutual labels:  text-mining, sentiment-analysis
Text-Classification-LSTMs-PyTorch
The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (+181.25%)
Mutual labels:  text-mining, tweets
rosette-elasticsearch-plugin
Document Enrichment plugin for Elasticsearch
Stars: ✭ 25 (+56.25%)
Mutual labels:  text-mining, sentiment-analysis
overview-and-benchmark-of-traditional-and-deep-learning-models-in-text-classification
NLP tutorial
Stars: ✭ 41 (+156.25%)
Mutual labels:  tweets, sentiment-analysis
sentiment-analysis-of-tweets-in-russian
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
Stars: ✭ 51 (+218.75%)
Mutual labels:  tweets, sentiment-analysis
gnip
Connect to Gnip streaming API and manage rules
Stars: ✭ 28 (+75%)
Mutual labels:  twitter, tweets
congresstweets
Datasets of the daily Twitter output of Congress.
Stars: ✭ 76 (+375%)
Mutual labels:  twitter, tweets
download-tweets-ai-text-gen-plus
Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation
Stars: ✭ 26 (+62.5%)
Mutual labels:  twitter, tweets
Archive-Tweets
Archive and Delete Liked and Posted Tweets
Stars: ✭ 28 (+75%)
Mutual labels:  twitter, tweets

TwEater

A Python way to collect MORE Tweets and their REPLIES from Twitter than the official API. Currently only Python 2.7.x is supported.

The motivation is to collect tweets for Text Mining or NLP tasks, such as message understanding, talking bot, opinion Mining, information extraction, event detection & tracking, tweet ranking, and so on.

Therefore, not only the tweet text and basic attributes, but also conversations, emojis, links, mentions, hashtags are all necessary to be able to collected by it.

Also, official API imposes limits on time and amount of the tweets you can collect, try TwEater!

Examples

Look into the eater.py, it's a simple example of using this bot. First, you need place your order either by a configuration file, or by K=V parameters:

TwOrder.order('order.conf')

Or

TwOrder.order(user='BarackObama')

Two methods digest_2_file and digest_2_mongo are provided to process data after collecting them, either store them in a file or in a MongoDB, or even process them on the fly, it's up to you. You can define your own processing function.

Then, go harvest tweets together with replies (emojis are also collected, very important for sentiment analysis):

TwEater.eatTweets(digest_2_file, 'test')

If you just want get the replies of someone's username some tweet tweet_id, this will return a json array.

print TwChef.shopComments('BarackObama', '876456804305252353')

Parameters

The example values for the 9 parameters is as follows, which can be seen from file order.conf:

    {
      "user": "",
      "query": "calorie OR eat",
      "since": "2017-06-10",
      "until": "2017-07-19",
      "max_tweets": 10,
      "max_comments": 0,
      "bufferlength": 100,
      "near": "Montréal, Québec",
      "within": "5km",
      "lang": "en"
    }

Note:

user and query, at least one of them must be specified.

  • user: specifies which user you want collect, default ""
  • query: either a keyword or a hashtag you care about, default ""
  • since: the start time of the tweets you want, default ""
  • until: the end time of the tweets you want, default ""
  • max_tweets: how many tweets you want collect for this query and/or user, default 1
  • max_comment: how many replies you want for each tweet if there is any, default 1
  • bufferlength: process and clear the data in a reasonably sized batch before you run out of memory, default 100
  • near: a location where the tweets are posted do you need, default "".
  • within: has to be used together with near, specifying the radius of the location, default "".
  • lang: specify the language of the tweets you need, only English and French are suppored at the moment, default "en".

MuTuX

For the benefits of learners or researchers, don't abuse it! Have fun!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].