All Projects → spark2-etl-examples → Similar Projects or Alternatives

26 Open source projects that are alternatives of or similar to spark2-etl-examples

databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Stars: ✭ 19 (-17.39%)
Mutual labels:  spark-sql
spark-vcf
Spark VCF data source implementation for Dataframes
Stars: ✭ 15 (-34.78%)
Mutual labels:  spark-sql
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+69.57%)
Mutual labels:  spark-sql
opaque-sql
An encrypted data analytics platform
Stars: ✭ 169 (+634.78%)
Mutual labels:  spark-sql
albis
Albis: High-Performance File Format for Big Data Systems
Stars: ✭ 20 (-13.04%)
Mutual labels:  spark-sql
dt-sql-parser
SQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (+486.96%)
Mutual labels:  spark-sql
Tweet-Analysis-With-Kafka-and-Spark
A real time analytics dashboard to analyze the trending hashtags and @ mentions at any location using kafka and spark streaming.
Stars: ✭ 18 (-21.74%)
Mutual labels:  spark-sql
geospark
bring sf to spark in production
Stars: ✭ 53 (+130.43%)
Mutual labels:  spark-sql
spark-twitter-sentiment-analysis
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Stars: ✭ 55 (+139.13%)
Mutual labels:  spark-sql
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (+47.83%)
Mutual labels:  spark-sql
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+7382.61%)
Mutual labels:  spark-sql
Redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+87495.65%)
Mutual labels:  spark-sql
spark-structured-streaming-examples
Spark structured streaming examples with using of version 3.0.0
Stars: ✭ 23 (+0%)
Mutual labels:  spark-sql
MCW-Big-data-analytics-and-visualization
MCW Big data analytics and visualization
Stars: ✭ 172 (+647.83%)
Mutual labels:  spark-sql
spark learning
尚硅谷大数据Spark-2019版最新 Spark 学习
Stars: ✭ 42 (+82.61%)
Mutual labels:  spark-sql
spark-data-sources
Developing Spark External Data Sources using the V2 API
Stars: ✭ 36 (+56.52%)
Mutual labels:  spark-sql
recsys spark
Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,推荐算法,协同过滤
Stars: ✭ 76 (+230.43%)
Mutual labels:  spark-sql
litemall-dw
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (+56.52%)
Mutual labels:  spark-sql
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+47.83%)
Mutual labels:  spark-sql
SparkProgrammingInScala
Apache Spark Course Material
Stars: ✭ 57 (+147.83%)
Mutual labels:  spark-sql
spark-sql-internals
The Internals of Spark SQL
Stars: ✭ 331 (+1339.13%)
Mutual labels:  spark-sql
Spark
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
Stars: ✭ 55 (+139.13%)
Mutual labels:  spark-sql
Real-time-Data-Warehouse
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Stars: ✭ 52 (+126.09%)
Mutual labels:  spark-sql
Movies-Analytics-in-Spark-and-Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Stars: ✭ 47 (+104.35%)
Mutual labels:  spark-sql
wow-spark
🔆 spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。
Stars: ✭ 20 (-13.04%)
Mutual labels:  spark-sql
NYC Taxi Pipeline
Design/Implement stream/batch architecture on NYC taxi data | #DE
Stars: ✭ 16 (-30.43%)
Mutual labels:  spark-batch
1-26 of 26 similar projects