All Projects → oleg-agapov → Data Engineering Book

oleg-agapov / Data Engineering Book

Accumulated knowledge and experience in the field of Data Engineering

Projects that are alternatives of or similar to Data Engineering Book

Pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+37.37%)
Mutual labels:  data-engineering, data
Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (-2.76%)
Mutual labels:  data-engineering, data
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+944.37%)
Mutual labels:  data, data-engineering
Quilt
Quilt is a self-organizing data hub for S3
Stars: ✭ 1,007 (+113.8%)
Mutual labels:  data-engineering, data
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+220.81%)
Mutual labels:  data-engineering, data
Learn Something Every Day
📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General. Read the rendered results here ->
Stars: ✭ 362 (-23.14%)
Mutual labels:  data-engineering, engineering
Gspread Pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-52.02%)
Mutual labels:  data-engineering, data
Tensorbase
TensorBase BE is building a high performance, cloud neutral bigdata warehouse for SMEs fully in Rust.
Stars: ✭ 440 (-6.58%)
Mutual labels:  data, engineering
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+1133.12%)
Mutual labels:  data-engineering
Cs193p Winter 2017
These are the lectures, slides, reading assignments, and problem sets for the 'Developing iOS 10 Apps with Swift' CS193p course offered at the Stanford School of Engineering and available on iTunes U.
Stars: ✭ 447 (-5.1%)
Mutual labels:  engineering
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-12.31%)
Mutual labels:  data
Data
This repository contains general data for Web technologies
Stars: ✭ 418 (-11.25%)
Mutual labels:  data
Fetch
Simple & Efficient data access for Scala and Scala.js
Stars: ✭ 453 (-3.82%)
Mutual labels:  data
Active workflow
Turn complex requirements to workflows without leaving the comfort of your technology stack.
Stars: ✭ 413 (-12.31%)
Mutual labels:  data-engineering
React Spreadsheet
Simple, customizable yet performant spreadsheet for React
Stars: ✭ 393 (-16.56%)
Mutual labels:  data
Tabulator
Interactive Tables and Data Grids for JavaScript
Stars: ✭ 4,329 (+819.11%)
Mutual labels:  data
Rio
A Swiss-Army Knife for Data I/O
Stars: ✭ 467 (-0.85%)
Mutual labels:  data
Gop
GoPlus - The Go+ language for engineering, STEM education, and data science
Stars: ✭ 7,829 (+1562.21%)
Mutual labels:  engineering
Lexpredict Lexnlp
LexNLP by LexPredict
Stars: ✭ 439 (-6.79%)
Mutual labels:  data
Empathy In Engineering
A curated list of resources for building and promoting more compassionate engineering cultures
Stars: ✭ 425 (-9.77%)
Mutual labels:  engineering

Data Engineering Book

Accumulated knowledge and experience in the field of Data Engineering

Data engineering book cover

About this book

The book covers different aspects of Data Engineering, from basic topics like databases, SQL and ETL to advanced like data architecture and Big Data stacks.

But it is still under development. It has no strict set of topics I want to cover, but it will be pretty close to what I've described in my Data Engineering Roadmap.

How to read this Book

If you are an absolute novice – start with introduction to Data Engineering. I will explain who are data engineers, what tasks they perform, which skill are required etc.

If you already decided to learn data engineering, but don't know where to start – head on to the Data Engineering roadmap. There I show three paths you can take, from absolute beginner to advanced levels.

Lastly, if you know what exactly you want to learn then head to the table of content down below and find the most interesting topics for you.

Updates

Table of content

  1. Introduction to Data Engineering
    1. What is Data Engineering?
    2. Data Engineering Roadmap?
  2. Beginner path
    1. Intro to databases
    2. SQL for beginners
  3. Big Data path
  4. Data Architect path

Feedback

If you have any feedback or other questions, please refer to this form.

About author

My name is Oleg Agapov and I'm a data analyst.

I work with data, analytics, engineering and sometimes magic ✨

twitter: @oleg_agapov_

License

You may freely copy and distribute portions of this book as long as you give appropriate credit and indicate if changes were made. You cannot use this book for any commercial purpose.

Copyright ©2020 Oleg Agapov.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].