All Projects → datamade → Data Making Guidelines

datamade / Data Making Guidelines

Licence: mit
📘 Making Data, the DataMade Way

Projects that are alternatives of or similar to Data Making Guidelines

Getting Started
This repository is a getting started guide to Singer.
Stars: ✭ 734 (+195.97%)
Mutual labels:  makefile, etl
Write Readable Javascript Code
📖 All about writing maintainable JavaScript
Stars: ✭ 244 (-1.61%)
Mutual labels:  principles
Clean Code Javascript Tr
JavaScript için Uyarlanmış Temiz Kod Kavramları
Stars: ✭ 232 (-6.45%)
Mutual labels:  principles
Source Code Examples
Examples of code for the ESP8266
Stars: ✭ 237 (-4.44%)
Mutual labels:  makefile
Openwrt Trojan
trojan and its dependencies for OpenWrt
Stars: ✭ 236 (-4.84%)
Mutual labels:  makefile
Dircolors Solarized
This is a repository of themes for GNU ls (configured via GNU dircolors) that support Ethan Schoonover’s Solarized color scheme.
Stars: ✭ 2,671 (+977.02%)
Mutual labels:  makefile
Smallest Secured Golang Docker Image
Create the smallest and secured golang docker image based on scratch
Stars: ✭ 229 (-7.66%)
Mutual labels:  makefile
Aws Etl Orchestrator
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (-1.21%)
Mutual labels:  etl
Personal Server
Personal server configuration with k3s
Stars: ✭ 2,784 (+1022.58%)
Mutual labels:  makefile
Rhplaceholder
Show pleasant loading view for your users 😍
Stars: ✭ 238 (-4.03%)
Mutual labels:  makefile
Memory Hack
打造超人大脑
Stars: ✭ 237 (-4.44%)
Mutual labels:  makefile
Bullet Train.zsh
🚄 An oh-my-zsh shell theme based on the Powerline Vim plugin
Stars: ✭ 2,628 (+959.68%)
Mutual labels:  makefile
Mach
A remake of make (in ClojureScript)
Stars: ✭ 240 (-3.23%)
Mutual labels:  makefile
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-5.24%)
Mutual labels:  etl
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (-2.02%)
Mutual labels:  etl
Storagetapper
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (-6.45%)
Mutual labels:  etl
Build Harness
🤖Collection of Makefiles to facilitate building Golang projects, Dockerfiles, Helm charts, and more
Stars: ✭ 236 (-4.84%)
Mutual labels:  makefile
Tesla Menu
The Nintendo Switch overlay menu
Stars: ✭ 236 (-4.84%)
Mutual labels:  makefile
Python Ios Support
A meta-package for building a version of Python that can be embedded into an iOS project.
Stars: ✭ 246 (-0.81%)
Mutual labels:  makefile
Verified Smart Contracts
Smart contracts which are formally verified
Stars: ✭ 243 (-2.02%)
Mutual labels:  makefile

Making Data, the DataMade Way

This is DataMade's guide to extracting, transforming and loading (ETL) data using Make, a common command line utility.

This guide is part of a body of technical and process documentation maintained by DataMade. Head over to datamade/how-to for other guides on topics ranging from AWS to work practices!

What is ETL?

ETL refers to the general process of:

  1. taking raw source data ("Extract")
  2. doing some stuff to get the data in shape, possibly involving intermediate derived files ("Transform")
  3. producing final output in a more usable form (for "Loading" into something that consumes the data - be it an app, a system, a visualization, etc.)

Having a standard ETL workflow helps us make sure that our work is clean, consistent, and easy to reproduce. By following these guidelines you'll be able to keep your work up to date and share it with the world in a standard format - all with as few headaches as possible.

Basic Principles

These five principles inform all of our data work:

  1. Never destroy data - treat source data as immutable, and show your work when you modify it
  2. Be able to deterministically produce the final data with one command
  3. Write as little custom code as possible
  4. Use standard tools whenever possible
  5. Keep source data under version control

Unsure how to follow these principles? Read on!

The Guide

  1. Make & Makefile Overview
  2. ETL Styleguide

Code examples

Further reading

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].