Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

CMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.

Stars: ✭ 388 (-17.45%)

Mutual labels: dataset

Awesome Remote Sensing Change Detection

List of datasets, codes, and contests related to remote sensing change detection

Stars: ✭ 414 (-11.91%)

Mutual labels: dataset

Dsprites Dataset

Dataset to assess the disentanglement properties of unsupervised learning methods

Stars: ✭ 340 (-27.66%)

Mutual labels: dataset

Lidar Bonnetal

Semantic and Instance Segmentation of LiDAR point clouds for autonomous driving

Stars: ✭ 465 (-1.06%)

Mutual labels: dataset

Free Spoken Digit Dataset

A free audio dataset of spoken digits. Think MNIST for audio.

Stars: ✭ 396 (-15.74%)

Mutual labels: dataset

Quickdraw Dataset

Documentation on how to access and use the Quick, Draw! Dataset.

Stars: ✭ 4,622 (+883.4%)

Mutual labels: dataset

Tfrecord

TFRecord reader for PyTorch

Stars: ✭ 377 (-19.79%)

Mutual labels: dataset

Comma2k19

A driving dataset for the development and validation of fused pose estimators and mapping algorithms

Stars: ✭ 391 (-16.81%)

Mutual labels: dataset

Squad Explorer

Visually Explore the Stanford Question Answering Dataset

Stars: ✭ 421 (-10.43%)

Mutual labels: dataset

Data

Python related videos and metadata powering =>

Stars: ✭ 355 (-24.47%)

Mutual labels: dataset

Joke Dataset

A dataset of 200k English plaintext jokes.

Stars: ✭ 447 (-4.89%)

Mutual labels: dataset

Medmnist

[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis

Stars: ✭ 338 (-28.09%)

Mutual labels: dataset

Wuhan 2019 Ncov

2019-nCoV 新冠状病毒 2019-12-01至今国家、省、市三级每日统计数据（支持接口读取）

Stars: ✭ 414 (-11.91%)

Mutual labels: dataset

Seq2seqchatbots

A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.

Stars: ✭ 466 (-0.85%)

Mutual labels: dataset

Mongodb Json Files

📦 A curated list of JSON / BSON datasets from the web in order to practice / use in MongoDB

Stars: ✭ 456 (-2.98%)

Mutual labels: dataset

Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO

Stars: ✭ 427 (-9.15%)

Mutual labels: dataset

View All Similar Projects ➔

中文谣言数据

该数据为从新浪微博不实信息举报平台抓取的中文谣言数据，分为两个部分。其中当前目录下的数据集仅包含谣言原微博，不包含转发/评论信息；而CED_Dataset中是包含转发/评论信息的中文谣言数据集。

数据集介绍

第一部分数据集（./rumors_v170613.json）共包含从2009年9月4日至2017年6月12日的31669条谣言。文件中，每一行为一条json格式的谣言数据，字段释义如下：

rumorCode: 该条谣言的唯一编码，可以通过该编码直接访问该谣言举报页面。
title: 该条谣言被举报的标题内容
informerName: 举报者微博名称
informerUrl: 举报者微博链接
rumormongerName: 发布谣言者的微博名称
rumormongerUr: 发布谣言者的微博链接
rumorText: 谣言内容
visitTimes: 该谣言被访问次数
result: 该谣言审查结果
publishTime: 该谣言被举报时间

引用

如果您使用该数据集，请引用以下论文：

中文：

	@article{liu2015rumors,
	  title={中文社交媒体谣言统计语义分析},
	  author={刘知远 and 张乐 and 涂存超 and 孙茂松},
	  journal={中国科学: 信息科学},
	  volume={12},
	  pages={1536--1546},
	  year={2015}
	}

English：

	@article{liu2015rumors,
	  title={Statistical and semantic analysis of rumors in Chinese social media},
	  author={Liu, Zhiyuan and Zhang, Le and Tu, Cunchao and Sun, Maosong},
	  journal={Scientia Sinica Informationis},
	  volume={45},
	  number={12},
	  pages={1536},
	  year={2015}
	}

第二部分数据集（CED_Dataset）包含与微博原文相关的转发与评论信息，数据集中共包含谣言1538条和非谣言1849条。该数据集分为微博原文与其转发/评论内容。其中所有微博原文（包含谣言与非谣言）在original-microblog文件夹中，剩余两个文件夹non-rumor-repost和rumor-repost分别包含非谣言原文与谣言原文的对应的转发与评论信息。（该数据集中并不区分评论与转发）该数据文件中，每条原文，评论或评论均为json格式的数据，部分字段释义如下：

微博原文信息：
- text: 微博原文的文字内容
- user: 发布该条微博原文的用户信息
- time: 用户发布该条微博原文的时间（时间戳格式）
转发/评论信息：
- uid: 发布该转发/评论的用户ID
- text: 转发/评论的文字内容（若部分用户转发时不添加评论内容，该项无内容）
- data: 该转发/评论的发布时间（格式如：2014-07-24 14:37:38）

引用

如果您使用该数据集，请引用以下论文：

@article{song2018ced,
  title={CED: Credible Early Detection of Social Media Rumors},
  author={Song, Changhe and Tu, Cunchao and Yang, Cheng and Liu, Zhiyuan and Sun, Maosong},
  journal={arXiv preprint arXiv:1811.04175},
  year={2018}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 470

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (8) 🔗