Movie recommend基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+2250.56%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-34.83%)
bandar-logMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (-77.53%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (+471.91%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (+34.83%)
NYC Taxi PipelineDesign/Implement stream/batch architecture on NYC taxi data | #DE
Stars: ✭ 16 (-82.02%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+98.88%)
Project FortisRepository for all parts of the Fortis architecture
Stars: ✭ 27 (-69.66%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+1833.71%)
SylphStream computing platform for bigdata
Stars: ✭ 362 (+306.74%)
RegistrySchema Registry
Stars: ✭ 184 (+106.74%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1985.39%)
spark-utilsBasic framework utilities to quickly start writing production ready Apache Spark applications
Stars: ✭ 25 (-71.91%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-58.43%)
interview-refresh-java-bigdataa one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
Stars: ✭ 25 (-71.91%)
WormholeWormhole is a SPaaS (Stream Processing as a Service) Platform
Stars: ✭ 863 (+869.66%)
Tweet-Analysis-With-Kafka-and-SparkA real time analytics dashboard to analyze the trending hashtags and @ mentions at any location using kafka and spark streaming.
Stars: ✭ 18 (-79.78%)
OptikeyOptiKey - Full computer control and speech with your eyes
Stars: ✭ 3,906 (+4288.76%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+7156.18%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+57.3%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+476.4%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (+130.34%)
LearningsparkScala examples for learning to use Spark
Stars: ✭ 421 (+373.03%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+3628.09%)
ExDeMonA general purpose metrics monitor implemented with Apache Spark. Kafka source, Elastic sink, aggregate metrics, different analysis, notifications, actions, live configuration update, missing metrics, ...
Stars: ✭ 19 (-78.65%)
litemall-dw基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-59.55%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-6.74%)
cassandra.realtimeDifferent ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-71.91%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-78.65%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+1102.25%)
ScramjetSimple yet powerful live data computation framework
Stars: ✭ 171 (+92.13%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+177.53%)
AsusSMCA VirtualSMC plugin provides native macOS support for ALS, keyboard backlight and Fn keys on Asus laptops
Stars: ✭ 151 (+69.66%)
T-WatchReal Time Twitter Sentiment Analysis Product
Stars: ✭ 20 (-77.53%)
StreamlineStreamLine - Streaming Analytics
Stars: ✭ 151 (+69.66%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+943.82%)
Machine-LearningExamples of all Machine Learning Algorithm in Apache Spark
Stars: ✭ 15 (-83.15%)
fdp-modelserverAn umbrella project for multiple implementations of model serving
Stars: ✭ 47 (-47.19%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+142.7%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-78.65%)