GitPlanet
Projects
Users
Categories
Languages
About
All Categories
→
No Category
→ etl-components
Top 2 etl-components open source projects
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
✭ 39
python
big-data
spark
apache-spark
hadoop
etl
xml
xml-parsing
pyspark
data-pipeline
datalake
hadoop-mapreduce
spark-sql
etl-framework
hadoop-hdfs
etl-pipeline
etl-components
vixtract
www.vixtract.ru
✭ 40
HTML
python
Jupyter Notebook
shell
javascript
Dockerfile
etl
etl-framework
etl-pipeline
etl-components
etl-job
etl-automation
1-2
of
2
etl-components projects