Top 763 data open source projects

datalang
Package to translate R data sets
connect
Toolsets for retrieving data from a remote source
realgpserver
程序采用Python语言进行编写开发,用来接收GPS原始数据,并进行解析入库Mysql。主要用到SocketServer,log,command,dbhandler,config几个模块。
RecoverPy
🙈 Interactively find and recover deleted or 👉 overwritten 👈 files from your terminal
idsa
This is the main repository of International Data Spaces Association on GitHub, where you can find general overview and required information on IDS Open Source Landscape.
apaga-luz
💡 ¿Cuánto cuesta la luz? 💶
minifaker
A lightweight alternative to faker.js
parcours-r
Valise pédagogique pour la formation à R
SNAP
Easy data format saving and loading for GameMaker Studio 2.3.2
awesome-data-show
Show most interesting data-source around the financial world
usmap
🗺 Create US maps including Alaska and Hawaii in R
raccoon
Python DataFrame with fast insert and appends
sfdc-generate-data-dictionary
Generate data dictionary from a Salesforce Org. This tool can also generate a file that can be imported in Lucidchart to define entities and relationships.
jds
Jenesis Data Store: a dynamic, cross platform, high performance, ORM data-mapper. Designed to assist in rapid development and data mining
fc4-framework
A Docs as Code tool that helps software creators and documentarians author software architecture diagrams using the C4 model for visualising software architecture.
yt-channels-DS-AI-ML-CS
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
course-17-18
🎓 Frontend 3 · 2017-2018 · Curriculum and Syllabus 📊
silky-charts
A silky smooth D3/React library
humanparser
Parse a human name string into salutation, first name, middle name, last name, suffix.
ODSC India 2018
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
wdpar
Interface to the World Database on Protected Areas
wumpfetch
🚀🔗 A modern, lightweight, fast and easy to use Node.js HTTP client
data-inspector
Data Inspector is an open-source python library that brings 15++ types of different functions to make EDA, data cleaning easier.
dit-cli
The interface for dit, a universal container file.
laravel-json-seeder
Create and use JSON files to seed your database in your Laravel applications
datalize
Parameter, query, form data validation and filtering for NodeJS.
tftargets
🎯 Human transcription factor target genes.
datatasks
Задачи для волонтеров/стажеров/всех желающих по работе с открытыми, большими данными. А также всеми иными задачами связанными с темами краудсорсинга, понятного языка и электронной архивации
algo-ds-101
Curated list of data structures and algorithms in 10+ programming languages.
bitcoin-development-history
Data and a example for a open source timeline of the history of Bitcoin development
flytekit
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
data science chile
Lista de cursos de Data Science en Chile 📈📊🇨🇱
zoe
Zoe: Container Analytics as a Service -- mirror of https://gitlab.eurecom.fr/zoe/main/
sacred
📖 Sacred texts in R
SDC-to-Compustat-Mapping
A mapping between SDCs M&A database and the gvkey's in Compustat
pyvaru
Rule based data validation library for python 3.
adage
Data and code related to the paper "ADAGE-Based Integration of Publicly Available Pseudomonas aeruginosa..." Jie Tan, et al · mSystems · 2016
metacritic api
PHP Metacritic API - Mirrored by my GitLab
tscompdata
Time series competition data
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
covid-19
Current and historical coronavirus covid-19 confirmed, recovered, deaths and active case counts segmented by country and region. Includes csv, json and sqlite data along with an interactive website explorer.
bx-data
Удобные классы для 1C-Bitrix.
SciDataTool
SciDataTool is an open-source Python package for scientific data handling. The objective is to provide a user-friendly, unified, flexible module to postprocess any kind of signal. It is meant to be used by researchers, R&D engineers and teachers in any scientific area. This package allows to efficiently store data fields in the time/space or in …
python-for-data-and-media-communication-gitbook
An open source book on Python tailed for communication students with zero background
cue
The new home of the CUE language! Validate and define text-based and dynamic configuration
S4
S4 is 100% S3 compatible storage, accessed through Tor and distributed using IPFS.
shared-row
This is an open data specification for describing the right-of-way (ROW) for street centerline networks. It is intended to establish a common set of attributes (schema) to describe how space is allocated along a streets right of way from sidewalk edge to sidewalk edge.
soCareers-Data
Data and data processing scripts of StackOverflow Careers pages
awesome-csv
Awesome Comma-Separated Values (CSV) - What's Next? - Frequently Asked Questions (F.A.Q.s) - Libraries & Tools
TED-Talks
All TED talks narratives extracted and cleaned.