Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-67.23%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (+383.05%)
AtsdAxibase Time Series Database Documentation
Stars: ✭ 68 (-61.58%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-53.11%)
CamusMirror of Linkedin's Camus
Stars: ✭ 81 (-54.24%)
Kaufmann exKafka backed service library.
Stars: ✭ 86 (-51.41%)
Wifi基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (-47.46%)
MoosefsMooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+479.1%)
Communitya community based on Node.js
Stars: ✭ 44 (-75.14%)
Awesome Recommendation EngineThe purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-73.45%)
Streamxkafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-45.76%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-45.2%)
Gcs ToolsGCS support for avro-tools, parquet-tools and protobuf
Stars: ✭ 57 (-67.8%)
Event Sourcing CastanhaAn Event Sourcing service template with DDD, TDD and SOLID. It has High Cohesion and Loose Coupling, it's a good start for your next Microservice application.
Stars: ✭ 68 (-61.58%)
Zaneperfor前端性能监控系统,消息队列,高可用,集群等相关架构
Stars: ✭ 1,085 (+512.99%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-53.67%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-51.41%)
Wertik Js💪 A library that powers your app with GraphQL + Rest API
Stars: ✭ 56 (-68.36%)
UnchainedHeadless & open-source e-commerce toolkit. The Unchained Engine is our core product and is written in Node.js ES6
Stars: ✭ 92 (-48.02%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-45.2%)
Parquet MrApache Parquet
Stars: ✭ 1,278 (+622.03%)
Pmacctpmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
Stars: ✭ 677 (+282.49%)
SpringbootSpringBoot 整合各类框架和应用
Stars: ✭ 54 (-69.49%)
AntsdbAntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase
Stars: ✭ 99 (-44.07%)
Production Ready Expressjs ServerExpress.js server that implements production-ready error handling and logging following latest best practices.
Stars: ✭ 101 (-42.94%)
Rsysloga Rocket-fast SYStem for LOG processing
Stars: ✭ 1,385 (+682.49%)
Schema RegistryConfluent Schema Registry for Kafka
Stars: ✭ 1,647 (+830.51%)
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (-37.85%)
Parquet GoGo package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.
Stars: ✭ 114 (-35.59%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+948.59%)
Kkbinlog支持mysql、MongoDB数据变更订阅分发
Stars: ✭ 112 (-36.72%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (-35.59%)
Graphql Nodejs Hapi ApiHow to set-up a powerful API with Nodejs, GraphQL, MongoDB, Hapi, and Swagger
Stars: ✭ 116 (-34.46%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+5857.06%)
Haproxy Configs80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (-40.11%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-35.03%)
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-33.9%)
Parquet4sRead and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (-29.38%)
Scrapy demoall kinds of scrapy demo
Stars: ✭ 128 (-27.68%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+872.32%)
Frisky🍿 Open Source GraphQL API for Online Shows
Stars: ✭ 161 (-9.04%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+6328.25%)
SlimmessagebusLightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
Stars: ✭ 120 (-32.2%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-26.55%)
Space CloudOpen source Firebase + Heroku to develop, scale and secure serverless apps on Kubernetes
Stars: ✭ 3,323 (+1777.4%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-18.64%)
My MomentsInstagram Clone - Cloning Instagram for learning purpose
Stars: ✭ 140 (-20.9%)