GitPlanet
Projects
Users
Categories
Languages
About
All Git Users
→ vim89
1 open source projects by vim89
[ Open user page on Github ]
1.
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
✭ 39
python
big-data
spark
apache-spark
hadoop
etl
xml
xml-parsing
pyspark
data-pipeline
datalake
hadoop-mapreduce
spark-sql
etl-framework
hadoop-hdfs
etl-pipeline
etl-components
1-1
of
1
user projects