Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-21.79%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (-78.21%)
Springboot Templatesspringboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎
Stars: ✭ 100 (-44.13%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+936.87%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-37.43%)
Nagios Plugins450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Stars: ✭ 1,000 (+458.66%)
Spring Boot Quick🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌
Stars: ✭ 1,819 (+916.2%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-29.61%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+817.32%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-27.37%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-1.12%)
RegistrySchema Registry
Stars: ✭ 184 (+2.79%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+701.68%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+37.99%)
Spiderman基于 scrapy-redis 的通用分布式爬虫框架
Stars: ✭ 392 (+118.99%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-54.19%)
StreamlineStreamLine - Streaming Analytics
Stars: ✭ 151 (-15.64%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+925.14%)
AztkAZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-15.08%)
MirusMirus is a cross data-center data replication tool for Apache Kafka
Stars: ✭ 171 (-4.47%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-8.38%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-16.2%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-17.88%)
PhpkafkaPHP Kafka client is used in PHP-FPM and Swoole. PHP Kafka client supports 50 APIs, which might be one that supports the most message types ever.
Stars: ✭ 149 (-16.76%)
Ferolight, fast, scalable, streaming microservices made easy
Stars: ✭ 175 (-2.23%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+6758.66%)
Dcos CommonsDC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
Stars: ✭ 162 (-9.5%)
Kafka EagleA easy and high-performance monitoring system, for comprehensive monitoring and management of kafka cluster.
Stars: ✭ 2,240 (+1151.4%)
KcliA kafka command line browser
Stars: ✭ 148 (-17.32%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+1197.77%)
A Kafka StoryKafka ecosystem ... but step by step!
Stars: ✭ 148 (-17.32%)
Azkarra Streams🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
Stars: ✭ 146 (-18.44%)
KopKafka-on-Pulsar - A protocol handler that brings native Kafka protocol to Apache Pulsar
Stars: ✭ 159 (-11.17%)
KafkajsA modern Apache Kafka client for node.js
Stars: ✭ 2,315 (+1193.3%)
Camelliacamellia framework by netease-im. provider: 1) redis-client; 2) redis-proxy(redis-sentinel/redis-cluster); 3) hbase-client; 4) others
Stars: ✭ 146 (-18.44%)
Vue Info CardSimple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (-11.17%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-17.88%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-2.23%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+1064.25%)
Event Sourcing JamboAn Hexagonal Architecture with DDD + Aggregates + Event Sourcing using .NET Core, Kafka e MongoDB (Blog Engine)
Stars: ✭ 159 (-11.17%)
SupersafebankSample Event Sourcing implementation with .NET Core
Stars: ✭ 142 (-20.67%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-11.17%)
Awesome KafkaEverything about Apache Kafka
Stars: ✭ 144 (-19.55%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+897.21%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-11.73%)