r-exasolThe EXASOL package for R provides an interface to the EXASOL database.
Stars: ✭ 22 (+69.23%)
virtual-schemasEntry point repository for the EXASOL Virtual Schemas
Stars: ✭ 24 (+84.62%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+976.92%)
hadoop-etl-udfsThe Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (+30.77%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (+561.54%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+1053.85%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (+323.08%)
Spark Sklearn(Deprecated) Scikit-learn integration package for Apache Spark
Stars: ✭ 1,055 (+8015.38%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+1007.69%)
Spark TdaSparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (+246.15%)
Docker SparkApache Spark docker image
Stars: ✭ 1,396 (+10638.46%)
Spark Atlas ConnectorA Spark Atlas connector to track data lineage in Apache Atlas
Stars: ✭ 160 (+1130.77%)
MlflowOpen source platform for the machine learning lifecycle
Stars: ✭ 10,898 (+83730.77%)
Quinnpyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (+1569.23%)
Awesome SparkA curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+8061.54%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (+1015.38%)
DblinkDistributed Bayesian Entity Resolution in Apache Spark
Stars: ✭ 38 (+192.31%)
Datahacksummit 2017Apache Zeppelin notebooks for Recommendation Engines using Keras and Machine Learning on Apache Spark
Stars: ✭ 30 (+130.77%)
SparktorchTrain and run Pytorch models on Apache Spark.
Stars: ✭ 195 (+1400%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (+7.69%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (+707.69%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (+1161.54%)
Pyspark StubsApache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (+653.85%)
Spark WorkshopApache Spark™ and Scala Workshops
Stars: ✭ 224 (+1623.08%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (+538.46%)
Awesome PulsarA curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (+338.46%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+1800%)
Sparkit LearnPySpark + Scikit-learn = Sparkit-learn
Stars: ✭ 1,073 (+8153.85%)
AlbedoA recommender system for discovering GitHub repos, built with Apache Spark
Stars: ✭ 149 (+1046.15%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (+284.62%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+1553.85%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+13630.77%)
keda-connectorsGeneric connectors for Keda which can be used as worker images as part of scaleTargetRef.
Stars: ✭ 22 (+69.23%)
Scalable Data ScienceScalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Stars: ✭ 142 (+992.31%)
Analytics ZooDistributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Stars: ✭ 2,448 (+18730.77%)
Spark FlamegraphEasy CPU Profiling for Apache Spark applications
Stars: ✭ 30 (+130.77%)
PysparklingA pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+1676.92%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+7046.15%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+13138.46%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+6000%)
SparklyrR interface for Apache Spark
Stars: ✭ 775 (+5861.54%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+1261.54%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (+884.62%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+5500%)
SupermarktConnectorCollecting product information from Dutch supermarkets: Albert Heijn and Jumbo using the Mobile API
Stars: ✭ 91 (+600%)