SparklensQubole Sparklens tool for performance tuning Apache Spark
Stars: ✭ 345 (+9.18%)
KInspectorKInspector is an application for analyzing health, performance and security of your Kentico solution.
Stars: ✭ 54 (-82.91%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-92.09%)
PewpewFlexible HTTP command line stress tester for websites and web services
Stars: ✭ 269 (-14.87%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-94.94%)
ElasticlusterCreate clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-5.7%)
Big Data Rosetta CodeCode snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (-19.62%)
CasperA compiler for automatically re-targeting sequential Java code to Apache Spark.
Stars: ✭ 45 (-85.76%)
Stackimpact GoDEPRECATED StackImpact Go Profiler - Production-Grade Performance Profiler: CPU, memory allocations, blocking calls, errors, metrics, and more
Stars: ✭ 276 (-12.66%)
daf-kyloKylo integration with PDND (previously DAF).
Stars: ✭ 20 (-93.67%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-4.11%)
spark-data-sourcesDeveloping Spark External Data Sources using the V2 API
Stars: ✭ 36 (-88.61%)
HelkThe Hunting ELK
Stars: ✭ 3,097 (+880.06%)
IGUANAIGUANA is a benchmark execution framework for querying HTTP endpoints and CLI Applications such as Triple Stores. Contact:
[email protected] Stars: ✭ 22 (-93.04%)
Learningsparkv2This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Stars: ✭ 307 (-2.85%)
blogblog entries
Stars: ✭ 39 (-87.66%)
Spark Jupyter AwsA guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-18.04%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-92.09%)
Spark NotebookInteractive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+875%)
goanalyzerimproved go tool trace goroutine analysis
Stars: ✭ 30 (-90.51%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (-56.96%)
Hbase RddSpark RDD to read, write and delete from HBase
Stars: ✭ 277 (-12.34%)
spark-http-streamspark structured streaming via HTTP communication
Stars: ✭ 17 (-94.62%)
SplineData Lineage Tracking And Visualization Solution
Stars: ✭ 306 (-3.16%)
powerstationA Tool for Detecting Performance Bugs in Rails Applications
Stars: ✭ 57 (-81.96%)
DatavecETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (-13.92%)
dllibdllib is a distributed deep learning library running on Apache Spark
Stars: ✭ 32 (-89.87%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+950%)
EasyloggingppSingle header C++ logging library. It is extremely powerful, extendable, light-weight, fast performing, thread and type safe and consists of many built-in features. It provides ability to write logs in your own customized format. It also provide support for logging your classes, third-party libraries, STL and third-party containers etc.
Stars: ✭ 3,032 (+859.49%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-82.91%)
Awesome AdaA curated list of awesome resources related to the Ada and SPARK programming language
Stars: ✭ 299 (-5.38%)
confluent-spark-avroSpark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
Stars: ✭ 18 (-94.3%)
Docker Spark ClusterA simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (-17.41%)
Covid19TrackerA Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.
Stars: ✭ 65 (-79.43%)
CookFair job scheduler on Kubernetes and Mesos for batch workloads and Spark
Stars: ✭ 314 (-0.63%)
autobenchBenchmark your application on CI
Stars: ✭ 16 (-94.94%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (-17.72%)
SparkV🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.
Stars: ✭ 24 (-92.41%)
trembitaModel complex data transformation pipelines easily
Stars: ✭ 44 (-86.08%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (-18.67%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-95.57%)
CrayonSimple framework agnostic UI router for SPAs
Stars: ✭ 310 (-1.9%)
smolderHL7 Apache Spark Datasource
Stars: ✭ 33 (-89.56%)
Spark Druid OlapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 282 (-10.76%)
spark-demosCollection of different demo applications using Apache Spark
Stars: ✭ 15 (-95.25%)
Frontendwingman Frontend Wingman, Learn frontend faster!
Stars: ✭ 315 (-0.32%)
DeltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+1135.13%)
CloudflowCloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Stars: ✭ 278 (-12.03%)