Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+833.33%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+6093.33%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+11373.33%)
SparkTwitterAnalysisAn Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.
Stars: ✭ 29 (+93.33%)
spark-utilsBasic framework utilities to quickly start writing production ready Apache Spark applications
Stars: ✭ 25 (+66.67%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (+26.67%)
coolplayflinkFlink: Stateful Computations over Data Streams
Stars: ✭ 14 (-6.67%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+1080%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+1546.67%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (+600%)
ClearlyClearly see and debug your celery cluster in real time!
Stars: ✭ 287 (+1813.33%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (+453.33%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+1333.33%)
ExDeMonA general purpose metrics monitor implemented with Apache Spark. Kafka source, Elastic sink, aggregate metrics, different analysis, notifications, actions, live configuration update, missing metrics, ...
Stars: ✭ 19 (+26.67%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-13.33%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+22020%)
Monstachea go daemon that syncs MongoDB to Elasticsearch in realtime
Stars: ✭ 736 (+4806.67%)
10 Weeks10-weeks of technology exploration
Stars: ✭ 22 (+46.67%)
VaexOut-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+45186.67%)
SublertSublert is a security and reconnaissance tool which leverages certificate transparency to automatically monitor new subdomains deployed by specific organizations and issued TLS/SSL certificate.
Stars: ✭ 699 (+4560%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+4866.67%)
DotnetifySimple, lightweight, yet powerful way to build real-time web apps.
Stars: ✭ 927 (+6080%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+4753.33%)
Three.js Pathtracing RendererReal-time PathTracing with global illumination and progressive rendering, all on top of the Three.js WebGL framework. Click here for Live Demo: https://erichlof.github.io/THREE.js-PathTracing-Renderer/Geometry_Showcase.html
Stars: ✭ 872 (+5713.33%)
Yolact edgeThe first competitive instance segmentation approach that runs on small edge devices at real-time speeds.
Stars: ✭ 697 (+4546.67%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (+26.67%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-6.67%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+4446.67%)
EonAn open-source chart and map framework for realtime data.
Stars: ✭ 875 (+5733.33%)
HbcA Java HTTP client for consuming Twitter's realtime Streaming API
Stars: ✭ 898 (+5886.67%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-66.67%)
OpentrackerReal-time C++ ECO tracker etc. speed-up by SSE/NEON, support Linux, Mac, Jetson TX1/2, raspberry pi
Stars: ✭ 619 (+4026.67%)
Tf trt modelsTensorFlow models accelerated with NVIDIA TensorRT
Stars: ✭ 621 (+4040%)
Clusterws💥 Lightweight, fast and powerful framework for building scalable WebSocket applications in Node.js
Stars: ✭ 868 (+5686.67%)
AngularfireThe official Angular library for Firebase.
Stars: ✭ 7,029 (+46760%)
Dist KerasDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+3986.67%)
SupabaseThe open source Firebase alternative. Follow to stay updated about our public Beta.
Stars: ✭ 25,142 (+167513.33%)
Event ReduceAn algorithm to optimize database queries that run multiple times
Stars: ✭ 589 (+3826.67%)
FlintrockA command-line tool for launching Apache Spark clusters.
Stars: ✭ 568 (+3686.67%)
Quadray EngineRealtime raytracer using SIMD on ARM, MIPS, PPC and x86
Stars: ✭ 13 (-13.33%)
WormholeWormhole is a SPaaS (Stream Processing as a Service) Platform
Stars: ✭ 863 (+5653.33%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+5346.67%)
BigartmFast topic modeling platform
Stars: ✭ 563 (+3653.33%)
AlphaposeReal-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Stars: ✭ 5,697 (+37880%)
FluidsynthSoftware synthesizer based on the SoundFont 2 specifications
Stars: ✭ 811 (+5306.67%)
Pay个人网站即时到账收款解决方案 / Personal website instant payment solution
Stars: ✭ 558 (+3620%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+5613.33%)
Kube BatchA batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
Stars: ✭ 804 (+5260%)
OpenscoringREST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
Stars: ✭ 536 (+3473.33%)
Ttyplota realtime plotting utility for terminal/console with data input from stdin
Stars: ✭ 532 (+3446.67%)