All Projects → jonbakerfish → Tweetscraper

jonbakerfish / Tweetscraper

Licence: gpl-2.0
TweetScraper is a simple crawler/spider for Twitter Search without using API

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Tweetscraper

download-tweets-ai-text-gen-plus
Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation
Stars: ✭ 26 (-96.25%)
Mutual labels:  twitter, tweets
TwEater
A Python Bot for Scraping Conversations from Twitter
Stars: ✭ 16 (-97.69%)
Mutual labels:  twitter, tweets
awesome-twitter-bots
A Curated Collection of the Best Twitter Bots 🤖
Stars: ✭ 99 (-85.73%)
Mutual labels:  twitter, tweets
detweet
delete tweets en masse
Stars: ✭ 14 (-97.98%)
Mutual labels:  twitter, tweets
React Static Tweets
Extremely fast static renderer for tweets.
Stars: ✭ 278 (-59.94%)
Mutual labels:  twitter, tweets
congresstweets
Datasets of the daily Twitter output of Congress.
Stars: ✭ 76 (-89.05%)
Mutual labels:  twitter, tweets
TwitterPiBot
A Python based bot for Raspberry Pi that grabs tweets with a specific hashtag and reads them out loud.
Stars: ✭ 85 (-87.75%)
Mutual labels:  twitter, tweets
Laravel Twitter Streaming Api
Easily work with the Twitter Streaming API in a Laravel app
Stars: ✭ 153 (-77.95%)
Mutual labels:  twitter, tweets
Twitter Scraper
Scrape the Twitter Frontend API without authentication.
Stars: ✭ 3,037 (+337.61%)
Mutual labels:  twitter, tweets
archive-explorer-web
Browse your Twitter archive with a friendly, responsive, full experience, and quickly delete the tweets you don't want.
Stars: ✭ 19 (-97.26%)
Mutual labels:  twitter, tweets
Twitterdelete
💀 Delete your old, unpopular tweets.
Stars: ✭ 231 (-66.71%)
Mutual labels:  twitter, tweets
Yotter
Youtube and Twitter with privacy.
Stars: ✭ 376 (-45.82%)
Mutual labels:  twitter, tweets
Dmarchiver
A tool to archive the direct messages, images and videos from your private conversations on Twitter
Stars: ✭ 204 (-70.61%)
Mutual labels:  twitter, tweets
gnip
Connect to Gnip streaming API and manage rules
Stars: ✭ 28 (-95.97%)
Mutual labels:  twitter, tweets
Scrape Twitter
🐦 Access Twitter data without an API key. [DEPRECATED]
Stars: ✭ 166 (-76.08%)
Mutual labels:  twitter, tweets
Archive-Tweets
Archive and Delete Liked and Posted Tweets
Stars: ✭ 28 (-95.97%)
Mutual labels:  twitter, tweets
Twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Stars: ✭ 12,102 (+1643.8%)
Mutual labels:  twitter, tweets
Real Time Sentiment Tracking On Twitter For Brand Improvement And Trend Recognition
A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)
Stars: ✭ 127 (-81.7%)
Mutual labels:  twitter, tweets
twitter-analytics-wrapper
A simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.
Stars: ✭ 44 (-93.66%)
Mutual labels:  twitter, tweets
Tweetie
Simple jQuery Twitter feed plugin
Stars: ✭ 314 (-54.76%)
Mutual labels:  twitter, tweets

Introduction

TweetScraper can get tweets from Twitter Search. It is built on Scrapy without using Twitter's APIs. The crawled data is not as clean as the one obtained by the APIs, but the benefits are you can get rid of the API's rate limits and restrictions. Ideally, you can get all the data from Twitter Search.

WARNING: please be polite and follow the crawler's politeness policy.

Installation

  1. Install conda, you can get it from miniconda. The tested python version is 3.7.

  2. Install selenium python bindings: https://selenium-python.readthedocs.io/installation.html. (Note: the KeyError: 'driver' is caused by wrong setup)

  3. For ubuntu or debian user, run:

    $ bash install.sh
    $ conda activate tweetscraper
    $ scrapy list
    $ #If the output is 'TweetScraper', then you are ready to go.
    

    the install.sh will create a new environment tweetscraper and install all the dependencies (e.g., firefox-geckodriver and firefox),

Usage

  1. Change the USER_AGENT in TweetScraper/settings.py to identify who you are

     USER_AGENT = 'your website/e-mail'
    
  2. In the root folder of this project, run command like:

     scrapy crawl TweetScraper -a query="foo,#bar"
    

    where query is a list of keywords seperated by comma and quoted by ". The query can be any thing (keyword, hashtag, etc.) you want to search in Twitter Search. TweetScraper will crawl the search results of the query and save the tweet content and user information.

  3. The tweets will be saved to disk in ./Data/tweet/ in default settings and ./Data/user/ is for user data. The file format is JSON. Change the SAVE_TWEET_PATH and SAVE_USER_PATH in TweetScraper/settings.py if you want another location.

Acknowledgement

Keeping the crawler up to date requires continuous efforts, please support our work via opencollective.com/tweetscraper.

License

TweetScraper is released under the GNU GENERAL PUBLIC LICENSE, Version 2

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].