All Git Users → GoogleCloudDataproc

7 open source projects by GoogleCloudDataproc

1. Hadoop Connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
2. Cloud Dataproc
Cloud Dataproc: Samples and Utils
3. Spark Bigquery Connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
4. Bdutil
[DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine
✭ 111
shell
5. Initialization Actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
✭ 490
shell
6. custom-images
Tools for creating Dataproc custom images
7. hive-bigquery-storage-handler
Hive Storage Handler for interoperability between BigQuery and Apache Hive
1-7 of 7 user projects