LeharVisualize data using relative ordering
Stars: ✭ 81 (-53.71%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-9.14%)
KcliA kafka command line browser
Stars: ✭ 148 (-15.43%)
Scala SamplesThere are pieces of scala code that explain Scala syntax and related things - like what you can do with all this
Stars: ✭ 125 (-28.57%)
Spark GbtlrHybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (-53.71%)
KsppA high performance/ real-time C++ Kafka streams framework (C++17)
Stars: ✭ 80 (-54.29%)
LabsResearch on distributed system
Stars: ✭ 73 (-58.29%)
Kafka Zk RestapiKafka Zookeeper RESTful API to perform topic/consumer group administration/metric(offset\lag\message) collection and monitor
Stars: ✭ 121 (-30.86%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-58.86%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-16%)
DeequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Stars: ✭ 2,020 (+1054.29%)
Kafka Elasticsearch InjectorGolang app to read records from a set of kafka topics and write them to an elasticsearch cluster
Stars: ✭ 70 (-60%)
Queue本人的RabbitMQ和Kafka详细笔记以及示例代码
Stars: ✭ 158 (-9.71%)
BurrowuiThis is a NodeJS/Angular 2 frontend UI for Kafka cluster monitoring with Burrow
Stars: ✭ 69 (-60.57%)
Distributed frameworkpython通用分布式函数调度框架 pip install function_scheduling_distributed_framework
Stars: ✭ 123 (-29.71%)
KontextfreiWriting application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (-61.71%)
Pqa command-line Protobuf parser with Kafka support and JSON output
Stars: ✭ 120 (-31.43%)
Community一个仿照牛客网实现的讨论社区,不仅实现了基本的注册,登录,发帖,评论,点赞,回复功能,同时使用前缀树实现敏感词过滤,使用wkhtmltopdf生成长图和pdf,实现网站UV和DAU统计,并将用户头像等信息存于七牛云服务器。
Stars: ✭ 80 (-54.29%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-62.86%)
TeddySpark Streaming监控平台,支持任务部署与告警、自启动
Stars: ✭ 120 (-31.43%)
Awesome KafkaEverything about Apache Kafka
Stars: ✭ 144 (-17.71%)
ElassandraElassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+820%)
Springwolf CoreAutomated documentation for async APIs built with Spring Boot
Stars: ✭ 63 (-64%)
SephsplaceMy own version of r/place, done in a weekend
Stars: ✭ 119 (-32%)
Camel Kafka ConnectorCamel Kafka Connector allows you to use all Camel components as Kafka Connect connectors
Stars: ✭ 63 (-64%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+920%)
Silexsomething to help you spark
Stars: ✭ 61 (-65.14%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+831.43%)
Pg2kafkaShip changes in Postgres 🐘 to Kafka 📖
Stars: ✭ 61 (-65.14%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1338.86%)
KaramelKafka Browser that supports standalone Kafka and Strimzi operator
Stars: ✭ 61 (-65.14%)
Awesome KafkaA collection of kafka-resources
Stars: ✭ 116 (-33.71%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (-65.71%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+895.43%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-66.86%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+5925.14%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-13.14%)
A Kafka StoryKafka ecosystem ... but step by step!
Stars: ✭ 148 (-15.43%)
KarafkaFramework for Apache Kafka based Ruby and Rails applications development.
Stars: ✭ 1,223 (+598.86%)
Awesome PulsarA curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-67.43%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-18.86%)
KafkawizeKafkawize : A Self service Apache Kafka Topic Management tool/portal. A Web application which automates the process of creating and browsing Kafka topics, acls, schemas by introducing roles/authorizations to users of various teams of an org.
Stars: ✭ 79 (-54.86%)
Apiproject[https://www.sofineday.com], golang项目开发脚手架,集成最佳实践(gin+gorm+go-redis+mongo+cors+jwt+json日志库zap(支持日志收集到kafka或mongo)+消息队列kafka+微信支付宝支付gopay+api加密+api反向代理+go modules依赖管理+headless爬虫chromedp+makefile+二进制压缩+livereload热加载)
Stars: ✭ 124 (-29.14%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-54.86%)
Docker Spark🚢 Docker image for Apache Spark
Stars: ✭ 78 (-55.43%)
Azkarra Streams🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
Stars: ✭ 146 (-16.57%)
Spark Infotheoretic Feature SelectionThis package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
Stars: ✭ 123 (-29.71%)