Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+47308.33%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (+304.17%)
influxdb-php-sdkInfluxDB PHP SDK - UDP/IP or HTTP adapters for read and write data
Stars: ✭ 88 (+266.67%)
SparklyrR interface for Apache Spark
Stars: ✭ 775 (+3129.17%)
Tapirtapir, or Typed API descRiptions
Stars: ✭ 677 (+2720.83%)
FreestyleA cohesive & pragmatic framework of FP centric Scala libraries
Stars: ✭ 627 (+2512.5%)
Sparkling WaterSparkling Water provides H2O functionality inside Spark cluster
Stars: ✭ 887 (+3595.83%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+3004.17%)
ZeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+22870.83%)
Awesome InfluxdbA curated list of awesome projects, libraries, tools, etc. related to InfluxDB
Stars: ✭ 686 (+2758.33%)
Spark RedisA connector for Spark that allows reading and writing to/from Redis cluster
Stars: ✭ 773 (+3120.83%)
CryptofeedCryptocurrency Exchange Websocket Data Feed Handler
Stars: ✭ 643 (+2579.17%)
LibnetA portable framework for low-level network packet construction
Stars: ✭ 640 (+2566.67%)
Stream ReactorStreaming reference architecture for ETL with Kafka and Kafka-Connect. You can find more on http://lenses.io on how we provide a unified solution to manage your connectors, most advanced SQL engine for Kafka and Kafka Streams, cluster monitoring and alerting, and more.
Stars: ✭ 753 (+3037.5%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+23466.67%)
Cdhprojecthadoop各组件使用,持续更新
Stars: ✭ 733 (+2954.17%)
Mongo SparkThe MongoDB Spark Connector
Stars: ✭ 588 (+2350%)
AlluxioAlluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+22312.5%)
Spark DariaEssential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (+2204.17%)
LsquicLiteSpeed QUIC and HTTP/3 Library
Stars: ✭ 727 (+2929.17%)
Impulse💣 Impulse Denial-of-service ToolKit
Stars: ✭ 538 (+2141.67%)
ScriptisScriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+2800%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+3204.17%)
MmposeOpenMMLab Pose Estimation Toolbox and Benchmark.
Stars: ✭ 674 (+2708.33%)
OnboardingA list of resources we at flyeralarm use to get new developers up and running
Stars: ✭ 648 (+2600%)
ParallecFast Parallel Async HTTP/SSH/TCP/UDP/Ping Client Java Library. Aggregate 100,000 APIs & send anywhere in 20 lines of code. Ping/HTTP Calls 8000 servers in 12 seconds. (Akka) www.parallec.io
Stars: ✭ 777 (+3137.5%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+3716.67%)
DhcpwnAll your IPs are belong to us.
Stars: ✭ 642 (+2575%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+26808.33%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+2537.5%)
Dev SetupmacOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
Stars: ✭ 5,590 (+23191.67%)
Coding Now学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
Stars: ✭ 750 (+3025%)
DatafusionDataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (+2445.83%)
ScaleAnother example of a REST API with Akka HTTP
Stars: ✭ 23 (-4.17%)
Ngtcp2ngtcp2 project is an effort to implement IETF QUIC protocol
Stars: ✭ 589 (+2354.17%)
SparkctrCTR prediction model based on spark(LR, GBDT, DNN)
Stars: ✭ 740 (+2983.33%)
BlinksocksA framework for building composable proxy protocol stack.
Stars: ✭ 587 (+2345.83%)
VarkenStandalone application to aggregate data from the Plex ecosystem into InfluxDB using Grafana for a frontend
Stars: ✭ 829 (+3354.17%)
SparklearningLearning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (+2225%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+2933.33%)
TsbsTime Series Benchmark Suite, a tool for comparing and evaluating databases for time series data
Stars: ✭ 545 (+2170.83%)
Es Cqrs Shopping CartA resilient and scalable shopping cart system designed using Event Sourcing (ES) and Command Query Responsibility Segregation (CQRS)
Stars: ✭ 19 (-20.83%)
FramelessExpressive types for Spark.
Stars: ✭ 717 (+2887.5%)
LeafA lightweight and fast proxy utility tries to include any useful features.
Stars: ✭ 530 (+2108.33%)
JustenoughscalaforsparkA tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+2141.67%)
Akka Http JsonIntegrate some of the best JSON libs in Scala with Akka HTTP
Stars: ✭ 530 (+2108.33%)
Node Influx📈 The InfluxDB Client for Node.js and Browsers
Stars: ✭ 820 (+3316.67%)
HailScalable genomic data analysis.
Stars: ✭ 706 (+2841.67%)
LaminarA simple semi-reliable UDP protocol for multiplayer games
Stars: ✭ 530 (+2108.33%)
LopqTraining of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Stars: ✭ 530 (+2108.33%)