All Projects β†’ pmacinec β†’ fake-news-datasets

pmacinec / fake-news-datasets

Licence: MIT license
This repository contains list of available fake news datasets for data mining.

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to fake-news-datasets

rumor-fake-news-papers
🚨 Rumor, Fake News, Misinformation Papers
Stars: ✭ 35 (+25%)
Mutual labels:  fake-news, misinformation
faker
Generate massive amounts of fake data in the browser and node.js
Stars: ✭ 6,940 (+24685.71%)
Mutual labels:  fake
ts-mock-imports
Intuitive mocking library for Typescript class imports
Stars: ✭ 103 (+267.86%)
Mutual labels:  fake
jest-ts-auto-mock
Jest test utility with automatic mock creation for interfaces and classes
Stars: ✭ 150 (+435.71%)
Mutual labels:  fake
fake-news-detection-resources
πŸ“– A curated list of resources dedicated to Fake News Detection
Stars: ✭ 41 (+46.43%)
Mutual labels:  fake-news
talltale
A Clojure/ClojureScript Fake Data Generator Library
Stars: ✭ 62 (+121.43%)
Mutual labels:  fake
FakePlayer
Specter but targeting PocketMine-MP API 4.0.0
Stars: ✭ 46 (+64.29%)
Mutual labels:  fake
Mockaco
🐡 HTTP mock server, useful to stub services and simulate dynamic API responses, leveraging ASP.NET Core features, built-in fake data generation and pure C# scripting
Stars: ✭ 213 (+660.71%)
Mutual labels:  fake
meta-coronavirus-dataset
MetaCOVID: META-Coronavrius dataset repository
Stars: ✭ 37 (+32.14%)
Mutual labels:  misinformation
parler-py-api
UNOFFICIAL Python API to interface with Parler.com
Stars: ✭ 52 (+85.71%)
Mutual labels:  disinformation
moq.ts
Moq for Typescript
Stars: ✭ 107 (+282.14%)
Mutual labels:  fake
legitbot
πŸ€” Is this Web request from a real search engineπŸ•· or from an impersonating agent πŸ•΅οΈβ€β™€οΈ?
Stars: ✭ 18 (-35.71%)
Mutual labels:  fake
ts-api-faker
Micro api faker written in TypeScript which returns faked data from JSON queries
Stars: ✭ 45 (+60.71%)
Mutual labels:  fake
useragent-generator
Easily generate correct user-agent strings for popular browsers
Stars: ✭ 62 (+121.43%)
Mutual labels:  fake
random
Random data generator AKA faker
Stars: ✭ 14 (-50%)
Mutual labels:  fake
react-fake-tweet
React renderer for tweets.
Stars: ✭ 33 (+17.86%)
Mutual labels:  fake
FakerDotNet
A .NET port of the Ruby faker gem
Stars: ✭ 15 (-46.43%)
Mutual labels:  fake
fakey-json
This is a utility for mocking json data that pretends the api response data with JSON format.
Stars: ✭ 27 (-3.57%)
Mutual labels:  fake
Final Project
Using Twitter Ego Network Analysis to Detect Sources of Fake News
Stars: ✭ 44 (+57.14%)
Mutual labels:  fake-news
mkjson
A commandline tool to generate static or random JSON records
Stars: ✭ 16 (-42.86%)
Mutual labels:  fake

Fake News Datasets

License: MIT

Introduction

This project was created to show basic analysis of public datasets of fake news. Main idea is to make each analysis replicable, so everyone can add his own analysis and use it for his experiments and data mining. Every dataset has its own python jupyter notebook with simple analysis, which can help to choose appropriate dataset.

Prerequisites

Installation and running

To run all jupyter notebooks with appropriate libraries installed, we refer to use Docker.

With installed Docker, run the following command to build docker image and start container:

./scripts/run.sh -b

Note: Next time, when no build is needed (because image has been already built), you can just run container by skipping -b argument.

Datasets

List of all processed datasets with simple comparison is stored in datasets/README.md file.

All datasets analyses are stored in datasets/ folder. Each dataset has its own folder with simple description in README file and jupyter notebook (also can include different files, e.g. data itself).

Dataset files (e.g. .csv or .tsv files) are stored using Git LFS (see Git LFS for more information).

Adding new dataset

When adding new dataset, please follow these steps:

  1. Call ./scripts/create_structure.sh {name} script with name argument supplied in snake_case format (e.g. fake_news_detection_kaggle). This script will create needed folders and files in datasets/{name} folder.
  2. Add data into datasets/{name}/data directory.
  3. Update datasets/{name}/README.md file to provide link, potential tasks, description and attributes descriptions. Please, follow template file structure.
  4. Update datasets/{name}/{name}.ipynb file with analysis of the dataset. Please, follow template file structure.
  5. Add dataset and details into table of datasets in datasets/README.md file (please, follow the alphabetical order).

TODO

Finish prepared datasets:

  • coaid
  • that_is_a_known_lie
  • fake_health
  • fake_covid
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].