All Projects → dataform-co → Dataform

dataform-co / Dataform

Licence: mit
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift

Programming Languages

typescript
32286 projects

Projects that are alternatives of or similar to Dataform

Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-76.9%)
Mutual labels:  analytics, etl, data-engineering
beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (-80.99%)
Mutual labels:  etl, analytics, data-engineering
Redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+5790.94%)
Mutual labels:  hacktoberfest, analytics, business-intelligence
Ananas Desktop
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Stars: ✭ 551 (+61.11%)
Mutual labels:  analytics, etl, business-intelligence
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+12366.08%)
Mutual labels:  analytics, data-engineering, business-intelligence
Aws Serverless Data Lake Framework
Enterprise-grade, production-hardened, serverless data lake on AWS
Stars: ✭ 179 (-47.66%)
Mutual labels:  analytics, etl, data-engineering
Grafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
Stars: ✭ 45,930 (+13329.82%)
Mutual labels:  hacktoberfest, analytics, business-intelligence
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (-57.89%)
Mutual labels:  etl, data-engineering
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (-87.13%)
Mutual labels:  etl, data-engineering
pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (-81.29%)
Mutual labels:  etl, data-engineering
growthbook
Open Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+584.8%)
Mutual labels:  analytics, data-engineering
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+78.95%)
Mutual labels:  etl, data-engineering
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-86.26%)
Mutual labels:  etl, data-engineering
open-semantic-desktop-search
Virtual Machine for Desktop Search with Open Semantic Search
Stars: ✭ 22 (-93.57%)
Mutual labels:  etl, analytics
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-83.33%)
Mutual labels:  etl, data-engineering
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-84.5%)
Mutual labels:  etl, data-engineering
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (-93.57%)
Mutual labels:  etl, data-engineering
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-92.98%)
Mutual labels:  etl, data-engineering
Laravel Gamp
📊 Laravel Google Analytics Measurement Protocol Package
Stars: ✭ 271 (-20.76%)
Mutual labels:  hacktoberfest, analytics
etl manager
A python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-95.91%)
Mutual labels:  etl, data-engineering

Intro

Dataform is an application to manage data in BigQuery, Snowflake, Redshift, and other data warehouses. It enables data teams to build scalable, tested, SQL based data transformation pipelines using version control and engineering inspired best practices.

Compile hundreds of data models in under a second using SQLX. SQLX extends your existing SQL warehouse dialect to add features that support dependency management, testing, documentation and more.



Supported warehouses

  • BigQuery
  • Snowflake
  • Redshift
  • Postgres
  • Azure SQL data warehouse
  • Presto (under development)

Data modeling with Dataform

Dependency tree in a Dataform project Dependency tree in a Dataform project

More examples and packages

Get started

With the CLI

You can install the Dataform SDK using the following command line. Follow the docs to get started.

npm i -g @dataform/cli

With Dataform web

Dataform web is a development environment and production ready application for the Dataform SDK. You can learn more on dataform.co

How it works

More about Dataform

Join the Dataform community

Want to report a bug or request a feature?

  • Create and upvote feature requests on Canny
  • Message us on Slack
  • Open an issue

Want to contribute?

Check out our contributors guide to get started with setting up the repo.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].