ExportsheetdataAdd-on for Google Sheets that allows sheets to be exported as JSON or XML.
OnyxDistributed, masterless, high performance, fault tolerant data processing
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
PytubesA module for getting data into python from large data sources
StatsA well tested and comprehensive Golang statistics library package with no dependencies.
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
DopJavaScript implementation for Distributed Object Protocol
GobblinA distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Anaconda ProjectTool for encapsulating, running, and reproducing data science projects
HottboxHOTTBOX: Higher Order Tensors ToolBOX.
TeraAn Internet-Scale Database.
AudioowlFast and simple music and audio analysis using RNN in Python 🕵️♀️ 🥁
PyfunctionalPython library for creating data pipelines with chain functional programming
Azkarra Streams🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
App Dirs RsPut your Rust app's data in the right place on every platform
DatacompyPandas and Spark DataFrame comparison for humans
Xlsx.jlExcel file reader and writer coded in pure Julia.
Fiware Orion An implementation of the Publish/Subscribe Context Broker GE, providing NGSI interfaces.
ApisMaking data readily available to anyone interested
GeneratedataA powerful, feature-rich, random test data generator.
GofakeitRandom fake data generator written in go
CylonCylon is a fast, scalable distributed memory data parallel library for processing structured data
Data science blogsA repository to keep track of all the code that I end up writing for my blog posts.
KeaProduction Ready State Management for React
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
LookerbotLookerbot lets you access all your Looker data from Slack! Super fun!
Dataspice🌶 Create lightweight schema.org descriptions of your datasets
Datasets🎁 3,000,000+ Unsplash images made available for research and machine learning
DataAssorted data from the General Services Administration.
Kotlin FakerGenerate realistically looking fake data such as names, addresses, banking details, and many more, that can be used for testing and data anonymization purposes.
DxrDXR is a Unity package for rapid prototyping of immersive data visualizations in augmented, mixed, and virtual reality (AR, MR, VR) or XR for short.
MobileReact Native apps for viewing Dota 2 data on Android/iOS
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Csv2ofxA Python library and command line tool for converting csv to ofx and qif files
Data PopulatorA plugin for Sketch and Adobe XD to populate your design mockups with meaningful data. Goodbye Lorem Ipsum. Hello JSON.
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Census ApiThe home for the API that powers the Census Reporter project.
Noobaa CoreNooBaa is a Dynamic Data Gateway for cloud-native, hybrid and multi cloud environments ☁️🚀
JhtalibTechnical Analysis Library Time-Series
PopoPoPo is the grid layout tool, the best choice for runtime layout.
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Awesome Brazil DataCurated list of Brazilian datasets for anyone interested in studying the country.
Wxconn统计你的微信连接多少人,包括好友、群聊人数,并提供去重后的长图结果
FpartSort files and pack them into partitions
Githubrankingsspain⬆️ Rankings with the most active GitHub users in Spain (sorted by public contributions) 🇪🇸
Police SettlementsA FiveThirtyEight/The Marshall Project effort to collect comprehensive data on police misconduct settlements from 2010-19.