Top 763 data open source projects

Rtrek
R package for Star Trek datasets and related R functions.
Seek
For finding, sharing and exchanging Data, Models, Simulations and Processes in Science.
Unicode Tr51
Emoji data extracted from Unicode Technical Report #51.
Tools
My MATLAB tools + other stuff
✭ 37
matlabdata
D3 In Motion
Code examples and references for the course "D3.js in Motion"
Auto Value Bundle
Extends Autovalue to extract data from a bundle into a value object.
Apogee
Tools for dealing with APOGEE data
Us Polling Places
Standardized data on historical general election polling places in the United States.
Universityrecruitment Ssurvey
用严肃的数据来回答“什么样的企业会到什么样的大学招聘”?
Jaymock
Minimal fake JSON test data generator.
Cryptoinscriber
📈 A live cryptocurrency historical trade data blotter. Download live historical trade data from any cryptoexchange, be it for machine learning, backtesting/visualizing trading strategies or for Quantopian/Zipline.
English synonyms antonyms list
List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896
Python Stream
更优雅的流式数据处理方式
Struct
A Modern, Scalable , Graceful, Easy Use data structure validator
Go Mesh
Realtime data exchange platform for Smart Cities
Dart
Self-service data workflow management
Nada
National Data Archive (NADA) is an open source data cataloging system that serves as a portal for researchers to browse, search, compare, apply for access, and download relevant census or survey information. It was originally developed to support the establishment of national survey data archives.
✭ 14
data
Samples Viewer Generator
🎉 A CLI utility tool to generate web app of data visualization samples for presentation purpose
Kakajson
Fast conversion between JSON and model in Swift.
Dataproperty
A Python library for extract property from data.
✭ 10
pythondata
Graph
Graph is a semantic database that is used to create data-driven applications.
Modelassistant
Elegant library to manage the interactions between view and model in Swift
Dendro
"Open-source Dropbox" with added description features. It is a data storage and description platform designed to help researchers and other users to describe their data files, built on Linked Open Data and ontologies. Users can use Dendro to publish data to CKAN, Zenodo, DSpace or EUDAT's B2Share and others.
Agots
Anomaly Generator on Time Series
Pytest Patterns
A couple of examples showing how pytest and its plugins can be combined to solve real-world needs.
Dztalkapp
Delphi non-visual component to communicate between applications
Poetry
非常全的古诗词数据,收录了从先秦到现代的共计85万余首古诗词。
Gcamdata
The GCAM data system
✭ 22
rdata
Mithril Data
A rich data model library for Mithril javascript framework
Flight Prices Scraper
Automated Script to scrape flight prices from any website into a csv format
Z1p
Zip Codes Validation and Parse.
Bits
A bite sized library for dealing with bytes.
Web
React web interface for the OpenDota platform
Awesome Ai Ml Dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Sensei Grid
Simple and lightweight data grid in JS/HTML
Brasil.io
Backend do Brasil.IO (para código dos scripts de coleta de dados, veja o link na página de cada dataset)
Awesome Streamlit
The purpose of this project is to share knowledge on how awesome Streamlit is and can be
Datacurator Filetree
a standard filetree for /r/datacurator [ and r/datahoarder ]
Rows
A common, beautiful interface to tabular data, no matter the format
Terriajs
A library for building rich, web-based geospatial data platforms.
Listen To Wikipedia
Live, generative music from Wikipedia edits
Mcw
Microsoft Cloud Workshop Project
Octo Cli
CLI tool to expose data from any database as a serverless web service.
Pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
Fsharp.data
F# Data: Library for Data Access
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Datafusion
DataFusion has now been donated to the Apache Arrow project
Valid.js
📝 A library for data validation.
Datasheets
Read data from, write data to, and modify the formatting of Google Sheets