All Projects → housepower → spark-clickhouse-connector

housepower / spark-clickhouse-connector

Licence: Apache-2.0 license
Spark ClickHouse Connector build on DataSourceV2 API

Programming Languages

scala
5932 projects
java
68154 projects - #9 most used programming language
ANTLR
299 projects
Dockerfile
14818 projects
shell
77523 projects

Projects that are alternatives of or similar to spark-clickhouse-connector

Awkward 0.x
Manipulate arrays of complex data structures as easily as Numpy.
Stars: ✭ 216 (+116%)
Mutual labels:  arrow
AndroidFunctionalValidation
Simple form validation using Arrow
Stars: ✭ 45 (-55%)
Mutual labels:  arrow
boxball
Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
Stars: ✭ 79 (-21%)
Mutual labels:  clickhouse
prometheus to clickhouse
Prometheus to Clickhouse Adapter
Stars: ✭ 14 (-86%)
Mutual labels:  clickhouse
docker-clickhouse-grafana
Baseline Compose for Clickhouse + Grafana
Stars: ✭ 24 (-76%)
Mutual labels:  clickhouse
chconn
Low-level ClickHouse database driver for Golang
Stars: ✭ 152 (+52%)
Mutual labels:  clickhouse
Ballista
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+2174%)
Mutual labels:  arrow
konad
Monads composition API that just works. For OOP developers
Stars: ✭ 62 (-38%)
Mutual labels:  arrow
ClickhouseBuilder
Fluent queries builder for Clickhouse. Also has integration with Laravel / Lumen.
Stars: ✭ 155 (+55%)
Mutual labels:  clickhouse
arrow-finder
These docs help you to find and use arrows you need more quickly
Stars: ✭ 24 (-76%)
Mutual labels:  arrow
arrow-optics
Λrrow Optics is part of Λrrow, a functional companion to Kotlin's Standard Library
Stars: ✭ 20 (-80%)
Mutual labels:  arrow
one-ck
php tcp client for clickhouse
Stars: ✭ 61 (-39%)
Mutual labels:  clickhouse
gazelle plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Stars: ✭ 243 (+143%)
Mutual labels:  arrow
Vscode Data Preview
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (+145%)
Mutual labels:  arrow
django-clickhouse
This project's goal is to build Yandex ClickHouse database into Django project.
Stars: ✭ 88 (-12%)
Mutual labels:  clickhouse
Actual Number Picker
Android: A horizontal number picker
Stars: ✭ 206 (+106%)
Mutual labels:  arrow
aioch
aioch - is a library for accessing a ClickHouse database over native interface from the asyncio
Stars: ✭ 145 (+45%)
Mutual labels:  clickhouse
your-analytics
Web analytics platform. Open source, privacy-focused and simple.
Stars: ✭ 33 (-67%)
Mutual labels:  clickhouse
Parrows
Using Arrows to model parallel processes/computations.
Stars: ✭ 18 (-82%)
Mutual labels:  arrow
tag-manager
Website analytics, JavaScript error tracking + analytics, tag manager, data ingest endpoint creation (tracking pixels). GDPR + CCPA compliant.
Stars: ✭ 279 (+179%)
Mutual labels:  clickhouse

Spark ClickHouse Connector

Maven Central License

Build on Apache Spark DataSourceV2 API.

Overview

Usage

See the documentation for how to use this connector.

Requirements

  • Java 8 or 11
  • Scala 2.12 or 2.13
  • Apache Spark 3.3

Notes:

  1. As of 0.5.0, this connector switches from ClickHouse raw gRPC Client to ClickHouse Official Java Client, which brings HTTP protocol support, extending the range of supported versions of ClickHouse Server.

Build

Build w/o test

./gradlew clean build -x test

Test

The project leverage Testcontainers and Docker Compose to do integration tests, you should install Docker and Docker Compose before running test, and check more details on Testcontainers document if you'd like to run test with remote Docker daemon.

Run all test

./gradlew clean test

Run all test w/ Spark 3.3 and Scala 2.13

./gradlew clean test -Dspark_binary_version=3.3 -Dscala_binary_version=2.13

Run single test

./gradlew test --tests=ConvertDistToLocalWriteSuite

ARM Platform

For developers/users who use ARM platform, e.g. Apple Silicon chips, Kunpeng chips, you may not able to run TPC-DS integrations test using gRPC in local directly, because ClickHouse does not provide gRPC support in official ARM image.

As a workaround, you can set the environment variable CLICKHOUSE_IMAGE to use a custom image which supports gRPC on ARM platform for testing.

export CLICKHOUSE_IMAGE=pan3793/clickhouse-server:22.5.1-alpine-arm-grpc
./gradlew clean test
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].