Top 763 data open source projects

fakenewsdata1
This repository contains two independent news datasets used in the 2017 study: "This Just In: Fake News Packs a Lot in Title, Uses Simpler, Repetitive Content in Text Body, More Similar to Satire than Real News"
✭ 18
data
faker
Generate massive amounts of fake data in the browser and node.js
sql-to-mongodb
A Node.js script to convert an SQL table to a MongoDB database.
kart
Distributed version-control for geospatial and tabular data
steam-data
A simple data project for Steam data
sketch-data-faker
A Sketch plugin providing 130+ types of smart placeholder content for your mockups from Faker.js and other sources.
urban-and-regional-planning-resources
Community list of data & technology resources concerning the built environment and communities. 🏙️🌳🚌🚦🗺️
great-migration
Copy objects from Rackspace to S3
chord-transitions
Transitioning Chord Diagram Demo with Angular/D3
ccu-historian
Der CCU-Historian erfasst die Betriebsdaten des Hausautomations-Systems HomeMatic der Firma eQ-3.
datasets
The primary repository for all of the CORGIS Datasets
i18n-testing
International data for testing and QA
rq-data
获取股票期货等数据
irsync
rsync on interval, via command line binary or docker container. Server and IOT builds for pull or push based device content management.
ElegantData
像操作Room一样操作 SharedPreferences 和 File 文件.
neiss
Data from National Electronic Injury Surveillance System
✭ 45
HTMLrdata
harlan
Harlan é o sistema modular que permite você automatizar toda sua governança cadastral da nuvem.
goseeder
Go database seeder inspired from Laravel/Lumen seeder and more
mysql-random-data-generator
This is the easiest MySQL random test data generator tool. Load the procedure and execute to auto detect column types and load data.
COVID19Tweet
WNUT-2020 Task 2: Identification of informative COVID-19 English Tweets
conp-dataset
📂 A DataLad dataset for CONP
cloud-data-analysis-at-scale
[Course-2020-2022] taught at Duke MIDS. This is also a Coursera Course that covers MLOps, ML Engineering and the foundations of Cloud Computing for Data Science.
pgsink
Logically replicate data out of Postgres into sinks (files, Google BigQuery, etc)
machine learning
A gentle introduction to machine learning: data handling, linear regression, naive bayes, clustering
farolcovid
🚦🏥. Ferramenta de monitoramento do risco de colapso no sistema de saúde em municípios brasileiros com a Covid-19 • Monitoring tool & simulation of the risk of collapse in Brazilian municipalities' health system due to Covid-19
stat133-spring-2019
Course materials for Stat 133, Spring 2019, at UC Berkeley
rockhound
NOTICE: This library is no longer being developed. Use Ensaio instead (https://www.fatiando.org/ensaio). -- Download geophysical models/datasets and load them in Python
simpleopendata
simple guidelines for publishing open data in useful formats
rastercube
rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Data-Export
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
zpy
Synthetic data for computer vision. An open source toolkit using Blender and Python.
xfinity-data-usage
Fetch Xfinity data usage and serve it via an HTTP endpoint, publish it to MQTT or post it to an URL.
widgets
Widgets for blockchain data visualizations
godmt
Tool that can parse Go files into an abstract syntax tree and translate it to several programming languages.
ESSE
Encrypted peer-to-peer system for data security. Own data, own privacy. (Rust+Flutter)
knime-r
KNIME Interactive R Statistics Integration
541-600 of 763 data projects