Top 763 data open source projects

Datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Http Fake Backend
Build a fake backend by providing the content of JSON files or JavaScript objects through configurable routes.
Vis Academy
A set of tutorials on how our frameworks make effective data visualization applications.
Datbase
[DEPRECATED] Open data sharing powered by Dat
Wikibase Sdk
JS utils functions to query a Wikibase instance and simplify its results
Aresdb
A GPU-powered real-time analytics storage and query engine.
Blazortable
Blazor Table Component with Sorting, Paging and Filtering
Vscode Data Preview
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Pandas Gbq
Pandas Google BigQuery
Retriever
Quickly download, clean up, and install public datasets into a database management system
Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
Voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Faker
Provides fake data to your Android apps :)
Bigbash
A converter that generates a bash one-liner from an SQL Select query (no DB necessary)
Data
Data and code behind the articles and graphics at FiveThirtyEight
Gspread Pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Minecraft Data
Language independent module providing minecraft data for minecraft clients, servers and libraries.
Chord
Python package for creating beautiful interactive Chord Diagrams. Pro version available at https://m8.fyi/chord
Taxize
A taxonomic toolbelt for R
Charlatan
Create fake data in R
Hodur Engine
Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.
Splitgraph
Splitgraph command line client and python library
Tianyancha
pip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stupidedi
Ruby API for parsing and generating ASC X12 EDI transactions
Gopherlabs
Go - Beginners | Intermediate | Advanced
Dython
A set of data tools in Python
Temporal
☄️ Temporal is an easy-to-use, enterprise-grade interface into distributed and decentralized storage
Awesome Json Datasets
A curated list of awesome JSON datasets that don't require authentication.
100daysofcode
#100DaysOfCode - Learn by developing 100 unique apps to explore exciting tech stacks
Climate Change Data
🌍 A curated list of APIs, open data and ML/AI projects on climate change
Elasticsearch Test Data
Generate and upload test data to Elasticsearch for performance and load testing
Bitglitter
⚡ Embed data payloads inside of ordinary images or video with high-performance animated 2-D barcodes. (Python library)
Rediscompare
rediscompare is a tool for chech two redis db data consistency. 是用来对比、校验redis 多个数据库数据一致性的命令行工具,支持单实例到单实例、单实例到原生集群、多实例多库到单实例等场景。
Opendata
CRAN OpenData Task View
Vue Smooth Picker
🏄🏼 A SmoothPicker for Vue 2 (like native datetime picker of iOS)
California Coronavirus Data
The Los Angeles Times' independent tally of coronavirus cases in California.
Nestedtext
Human Readable and Writable Data Interchange Format
Mirador
Tool for visual exploration of complex data.
Scio
A Scala API for Apache Beam and Google Cloud Dataflow.
Dfuse Eosio
dfuse for EOSIO
Cryptag
Encrypted, taggable, searchable cloud storage
Pygeoapi
pygeoapi is a Python server implementation of the OGC API suite of standards. The project emerged as part of the next generation OGC API efforts in 2018 and provides the capability for organizations to deploy a RESTful OGC API endpoint using OpenAPI, GeoJSON, and HTML. pygeoapi is open source and released under an MIT license.
Datash
Send and Receive files directly from your browser with end-to-end encryption
Uiemptystate
An empty state control to give visually appealing context when building iOS applications.
Nessie
Nessie provides Git-like capabilities for your Data Lake
✭ 176
javadata
Fake2db
Generate fake but valid data filled databases for test purposes using most popular patterns(AFAIK). Current support is sqlite, mysql, postgresql, mongodb, redis, couchdb.
Openintro
📦 R package for data and supplemental functions for OpenIntro resources
✭ 176
rdatarstats
Ncov2019 data crawler
疫情数据爬虫,2019新型冠状病毒数据仓库,轨迹数据,同乘数据,报道
Databay
Databay is a Python interface for scheduled data transfer. It facilitates transfer of (any) data from A to B, on a scheduled interval.
Grafter
Linked Data & RDF Manufacturing Tools in Clojure
Everypolitician Data
data for national legislatures worldwide
Lfai Landscape
🌄 Open Source AI Landscape - provides overview of top tier projects in the open source AI ecosystem, shows projects through GitHub data, funding or market cap, first and last commits, contributor count and much other information.
General Store
Simple, flexible store implementation for Flux. #hubspot-open-source
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
1-60 of 763 data projects