Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-70.83%)
Delta ArchitectureStreaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Stars: ✭ 43 (-80.09%)
Elasticsearch JdbcA elasticsearch specified SQL interface on Java, no need to tweak your es instance.
Stars: ✭ 41 (-81.02%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+395.37%)
HeraclesHigh performance HBase / Spark SQL engine
Stars: ✭ 27 (-87.5%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-73.15%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-69.91%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-70.37%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+4773.15%)
RegistrySchema Registry
Stars: ✭ 184 (-14.81%)
EbeanEbean ORM
Stars: ✭ 1,172 (+442.59%)
LabsResearch on distributed system
Stars: ✭ 73 (-66.2%)
Project FortisRepository for all parts of the Fortis architecture
Stars: ✭ 27 (-87.5%)
Kafka Elasticsearch InjectorGolang app to read records from a set of kafka topics and write them to an elasticsearch cluster
Stars: ✭ 70 (-67.59%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-7.41%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-62.04%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-61.57%)
Community一个仿照牛客网实现的讨论社区,不仅实现了基本的注册,登录,发帖,评论,点赞,回复功能,同时使用前缀树实现敏感词过滤,使用wkhtmltopdf生成长图和pdf,实现网站UV和DAU统计,并将用户头像等信息存于七牛云服务器。
Stars: ✭ 80 (-62.96%)
Bitcoin Value Predictor[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-57.87%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-63.43%)
Streamxkafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-55.56%)
WhatsmarsJava生态研究(Spring Boot + Redis + Dubbo + RocketMQ + Elasticsearch)🔥🔥🔥🔥🔥
Stars: ✭ 1,389 (+543.06%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+564.35%)
Rsysloga Rocket-fast SYStem for LOG processing
Stars: ✭ 1,385 (+541.2%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-5.09%)
Amazonriveramazonriver 是一个将postgresql的实时数据同步到es或kafka的服务
Stars: ✭ 198 (-8.33%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-50%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-49.07%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-48.15%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-50%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+759.26%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-0.46%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-44.44%)
TunnelPG数据同步工具(Java实现)
Stars: ✭ 122 (-43.52%)
Docker BroBro IDS Dockerfile
Stars: ✭ 126 (-41.67%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-40.74%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+1092.59%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+649.54%)
OpenubaA robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-41.2%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+696.76%)
SamsaraSamsara is a real-time analytics platform
Stars: ✭ 132 (-38.89%)
Echo🦄 开源社区系统:基于 SpringBoot + MyBatis + MySQL + Redis + Kafka + Elasticsearch + Spring Security + ... 并提供详细的开发文档和配套教程。包含帖子、评论、私信、系统通知、点赞、关注、搜索、用户设置、数据统计等模块。
Stars: ✭ 129 (-40.28%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-35.19%)
My MomentsInstagram Clone - Cloning Instagram for learning purpose
Stars: ✭ 140 (-35.19%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+4781.48%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-39.81%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-35.65%)