All Projects → reidpr → Quac

reidpr / Quac

Licence: other
QUAC ("quantitative analysis of chatter" or any related acronym you like) is a package for acquiring and analyzing social Internet content. Docs are online at http://reidpr.github.io/quac.

Projects that are alternatives of or similar to Quac

Deconfounder tutorial
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Deepbayes2017
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Pydata Nyc 2015
📜 Understanding Probabilistic Topic Models with Simulation in Python
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Machine Learning Notes
A repository to save my machine learning notes.
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Matplotlib4papers
Matplotlib examples to present results.
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Text Top Model
Benchmarking text classification algorithms
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Notebooks
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Datascience
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Machine Learning In Finance
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Taxiprediction
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Otml ds3 2018
Practical sessions for the Optimal Transport and Machine learning course at DS3 2018
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Rmsync
A script for synchronizing the reMarkable e-reader
Stars: ✭ 63 (-1.56%)
Mutual labels:  jupyter-notebook
Dqn Pytorch
Deep Q Learning via Pytorch
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Indonesian Language Models
Indonesian Language Models and its Usage
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Pneumonia Detection From Chest X Ray Images With Deep Learning
Detecting Pneumonia in Chest X-ray Images using Convolutional Neural Network and Pretrained Models
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Genetic Algorithm Rnn
Using Genetic Algorithms to optimize Recurrent Neural Network's Configuration
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Python Course
Stuff for python courses
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Learningwithdata
Code for Learning with Data Blog
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Apmae4990
Introduction to Data Science in Industry
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook

About QUAC

QUAC ("Quantitative analysis of chatter" or any related acronym you like) is a package for acquiring and analyzing social internet content. Features:

  • Reliable data collection and conversion of raw data into into easy-to-parse, de-duplicated, and well-ordered formats. We support:

    • Tweets from the Twitter Streaming API <https://dev.twitter.com/docs/streaming-apis>_.

    • Wikipedia hourly aggregate pageview logs <http://dumps.wikimedia.org/other/pagecounts-raw/>_.

    • Wikipedia edit history and related XML dumps <http://meta.wikimedia.org/wiki/Data_dumps>_.

  • Estimate the origin location of tweets with no geotag. (But see issue #15 <https://github.com/reidpr/quac/issues/15>_.)

  • Careful preservation of Unicode throughout the processing pipeline.

  • Various cleanup steps to deal with tweet quirks, including very rare ones (we've seen certain weirdnesses in only one of our 1.3+ billion tweets). That is, we deal with the special cases so you don't have to.

  • Parallel processing using various combinations of Make, joblib, and a simple map-reduce framework called QUACreduce which is included.

QUAC is copyright © 2012–2019 Triad National Security, LLC, and others. It is open source under the Apache license and was formerly known as Twepi ("Twitter for epidemic analysis").

Reporting bugs

Use our list of issues <https://github.com/reidpr/quac/issues>. To maximize the chances of your bug being understood and fixed, take a look at "Three parts to every good bug report" <http://www.joelonsoftware.com/articles/fog0000000029.html> (scroll down).

That said, note that unlike many open source projects, we make a point of being friendly to bug reporters, even newbies. Therefore, please don't hesitate to report a bug, even if you're inexperienced with QUAC or feel unsure. In almost all cases you will tell us something useful, even if the issue turns out not to be a bug per se, and we will support your efforts in this regard.

If you find QUAC useful

Please send us a note at [email protected] if you use QUAC, even for small uses, and/or star the project on GitHub. This type of feedback is very important for continued justification of the project to our sponsors.

Note that for many uses of QUAC (especially research) you are ethically obligated to cite it. For guidelines on how to do this, see the Citing section of the documentation.

Science!

We use QUAC for scientific research. To promote reproducibility, which is one of the core values of science, we try to open-source the code that runs our related experiments as well as QUAC itself. This code, and further information about it, can be found in the directory experiments.

For more information

  • Documentation is online at http://reidpr.github.io/quac. (Note: this may describe a different version of QUAC than the one you have.)

  • Current documentation is rooted at doc/index.html. (You'll probably need to build it first.)

  • Most scripts have pretty help which you can print using the --help option and/or look at in comments at the top of the script. Modules also usually have good docstrings.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].