All Projects → rhiever → Reddit Analysis

rhiever / Reddit Analysis

A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Reddit Analysis

Reddit2YouTube
Convert reddit posts to YT videos
Stars: ✭ 17 (-93.68%)
Mutual labels:  reddit
reddit-clone
Full stack Reddit clone with nodejs and react native.
Stars: ✭ 69 (-74.35%)
Mutual labels:  reddit
reddit-radio
Discord bot that streams music and radio to voice channels.
Stars: ✭ 29 (-89.22%)
Mutual labels:  reddit
Personal-Finance-Net-Worth-Tracker
Personal Finance (Net Worth Tracker) Wealth Management Spreadsheet
Stars: ✭ 31 (-88.48%)
Mutual labels:  reddit
reddit-news
📰 simple browser extension that shows recent news from reddit.com
Stars: ✭ 46 (-82.9%)
Mutual labels:  reddit
timesearch
The subreddit archiver
Stars: ✭ 114 (-57.62%)
Mutual labels:  reddit
subreddit-css
used for /r/web_design and /r/graphic_design
Stars: ✭ 44 (-83.64%)
Mutual labels:  reddit
PlayStoreLinks Bot
A Reddit Bot that links to Android Apps when requested.
Stars: ✭ 91 (-66.17%)
Mutual labels:  reddit
tootbot
Python tool for mirroring Reddit posts to Twitter and Mastodon
Stars: ✭ 68 (-74.72%)
Mutual labels:  reddit
spotify-reddit
Spotify playlists backed by subreddits
Stars: ✭ 14 (-94.8%)
Mutual labels:  reddit
Updoot
A reddit client built for android
Stars: ✭ 51 (-81.04%)
Mutual labels:  reddit
reddit-pocket-sync
No description or website provided.
Stars: ✭ 37 (-86.25%)
Mutual labels:  reddit
gettit
An easy to use go program to download videos from Reddit.
Stars: ✭ 27 (-89.96%)
Mutual labels:  reddit
rsoccerlive
Live highlight updates from the https://reddit.com/r/soccer subreddit.
Stars: ✭ 14 (-94.8%)
Mutual labels:  reddit
reddit-comment-bot
Reddit bot that auto replies to comments on set subreddits
Stars: ✭ 59 (-78.07%)
Mutual labels:  reddit
Glance
Another client for Reddit, but this time, it's not just a client. It's a different way of browsing photos, GIFs, memes and other things there!
Stars: ✭ 104 (-61.34%)
Mutual labels:  reddit
alternative-front-ends
Overview of alternative open source front-ends for popular internet platforms (e.g. YouTube, Twitter, etc.)
Stars: ✭ 1,664 (+518.59%)
Mutual labels:  reddit
Liked-Saved-Image-Downloader
Save content you enjoy!
Stars: ✭ 80 (-70.26%)
Mutual labels:  reddit
GfycatDetailsConvert
Reddit bot built on top of snoowrap which helps with converting Gfycat URLs.
Stars: ✭ 14 (-94.8%)
Mutual labels:  reddit
memer-action
A GitHub Action for Programmer Memes xD
Stars: ✭ 21 (-92.19%)
Mutual labels:  reddit

PyPI version Python 2.7 Python 3.5 License

Reddit Analysis project

Please send all requests to make a Most-Used Words (MUW) cloud to http://www.reddit.com/r/MUWs/

Feel free to post the MUWs you've made there, too.

License

Copyright 2016 Randal S. Olson.

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.

Dependencies

You must first install the Python library if you do not have that already. Preferably, use the Anaconda Python distribution for an easy install.

Next, you can install this package. Enter the following command into the terminal:

pip install redditanalysis

You may need to put sudo in front of the above command if your system requires root access.

If you want to install the lastest development version from github first clone the package:

git clone https://github.com/rhiever/reddit-analysis.git

change into the reddit-analysis directory:

cd reddit-analysis

then run the update script:

python setup.py install

Files in this repository

redditanalysis/words/common-words.txt is a data file containing a list of words that should be considered common. Note that this list is not final and is constantly changing.

redditanalysis/words/dict-words.txt is a data file containing a list of words from a dictionary. It is only recommended to use this file (with the -x option) if you want word_freqs to pick out very uncommon words.

Usage

Once installed, run the following on your command line to produce a usage message:

word_freqs --help

This command will detail all of the command line options and arguments for the word_freqs.

Make a MUW cloud for a subreddit or redditor

To count the most-used words for a subreddit over the last month, enter the following command:

word_freqs YOUR-USERNAME /r/SUBREDDIT

Similarly, for a reddit user:

word_freqs YOUR-USERNAME /u/REDDITOR

where YOUR-USERNAME is your reddit username and SUBREDDIT / REDDITOR is the subreddit / redditor you want to make the MUW cloud for. You must provide both arguments for the script to work properly.

Why is your username required? Simply because it will be used as the user-agent when making the Reddit API request. Reddit asks its API users to use something unique as the user-agent and recomends to use the users username.

Once the script completes, it will create a file called subreddit-SUBREDDIT.csv (or user-REDDITOR.csv) to the directory you ran it in. This file contains all of the commonly-used words from the subreddit / redditor you specified in the frequencies they were used.

To make a MUW cloud out of the words, copy all of the words into http://www.wordle.net/compose and click the Go button. Ta-da, you're done!

Multiprocess

reddit-analysis supports multiprocess PRAW. This allows you to run multiple instances of reddit-analysis simultaneously and not risk getting banned for overusing the reddit API. To enable multiprocess PRAW in reddit-analysis, add the -u flag.

See the PRAW documentation for more information.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].