All Projects → dc-sdk-js → Similar Projects or Alternatives

23 Open source projects that are alternatives of or similar to dc-sdk-js

datajob
Build and deploy a serverless data pipeline on AWS with no effort.
Stars: ✭ 101 (+94.23%)
Mutual labels:  data-pipeline
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-61.54%)
Mutual labels:  data-pipeline
aws-pdf-textract-pipeline
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Stars: ✭ 141 (+171.15%)
Mutual labels:  data-pipeline
scicloj.ml
A Clojure machine learning library
Stars: ✭ 152 (+192.31%)
Mutual labels:  data-pipeline
React Native Firebase
🔥 A well-tested feature-rich modular Firebase implementation for React Native. Supports both iOS & Android platforms for all Firebase services.
Stars: ✭ 9,674 (+18503.85%)
Mutual labels:  web-sdk
Data Engineering Howto
A list of useful resources to learn Data Engineering from scratch
Stars: ✭ 2,056 (+3853.85%)
Mutual labels:  data-pipeline
Snowplow
The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
Stars: ✭ 5,935 (+11313.46%)
Mutual labels:  data-pipeline
trembita
Model complex data transformation pipelines easily
Stars: ✭ 44 (-15.38%)
Mutual labels:  data-pipeline
pipeline
OONI data processing pipeline
Stars: ✭ 36 (-30.77%)
Mutual labels:  data-pipeline
serverless-data-pipeline-sam
Serverless Data Pipeline powered by Kinesis Firehose, API Gateway, Lambda, S3, and Athena
Stars: ✭ 78 (+50%)
Mutual labels:  data-pipeline
network-pipeline
Network traffic data pipeline for real-time predictions and building datasets for deep neural networks
Stars: ✭ 36 (-30.77%)
Mutual labels:  data-pipeline
augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Stars: ✭ 49 (-5.77%)
Mutual labels:  data-pipeline
richflow
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Stars: ✭ 17 (-67.31%)
Mutual labels:  data-pipeline
rivery cli
Rivery CLI
Stars: ✭ 16 (-69.23%)
Mutual labels:  data-pipeline
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-51.92%)
Mutual labels:  data-pipeline
ATOM
Automated Tool for Optimized Modelling
Stars: ✭ 85 (+63.46%)
Mutual labels:  data-pipeline
opentrials-airflow
Configuration and definitions of Airflow for OpenTrials
Stars: ✭ 18 (-65.38%)
Mutual labels:  data-pipeline
practical-data-engineering
Real estate dagster pipeline
Stars: ✭ 110 (+111.54%)
Mutual labels:  data-pipeline
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-25%)
Mutual labels:  data-pipeline
Data-pipeline-project
Data pipeline project
Stars: ✭ 18 (-65.38%)
Mutual labels:  data-pipeline
machine-learning-data-pipeline
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (-57.69%)
Mutual labels:  data-pipeline
ob bulkstash
Bulk Stash is a docker rclone service to sync, or copy, files between different storage services. For example, you can copy files either to or from a remote storage services like Amazon S3 to Google Cloud Storage, or locally from your laptop to a remote storage.
Stars: ✭ 113 (+117.31%)
Mutual labels:  data-pipeline
saisoku
Saisoku is a Python module that helps you build complex pipelines of batch file/directory transfer/sync jobs.
Stars: ✭ 40 (-23.08%)
Mutual labels:  data-pipeline
1-23 of 23 similar projects