All Projects → Cdap → Similar Projects or Alternatives

1654 Open source projects that are alternatives of or similar to Cdap

Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+82.51%)
Mutual labels:  dataset, spark, spark-streaming, mapreduce
Thingsboard
Open-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+1967.98%)
Mutual labels:  platform, middleware, spark
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+0.79%)
Mutual labels:  spark, spark-streaming
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-83.69%)
Mutual labels:  spark, spark-streaming
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+2059.33%)
Mutual labels:  spark, mapreduce
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+424.17%)
Mutual labels:  spark, mapreduce
data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Stars: ✭ 34 (-93.32%)
Mutual labels:  spark, mapreduce
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-92.73%)
Mutual labels:  spark, spark-streaming
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-92.73%)
Mutual labels:  spark, spark-streaming
Example Spark
Spark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-59.72%)
Mutual labels:  spark, spark-streaming
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-51.47%)
Mutual labels:  spark, spark-streaming
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-97.25%)
Mutual labels:  dataset, spark
Spark Streaming With Kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
Stars: ✭ 180 (-64.64%)
Mutual labels:  spark, spark-streaming
Kinesis Sql
Kinesis Connector for Structured Streaming
Stars: ✭ 120 (-76.42%)
Mutual labels:  spark, spark-streaming
Sitewhere
SiteWhere is an industrial strength open-source application enablement platform for the Internet of Things (IoT). It provides a multi-tenant microservice-based infrastructure that includes device/asset management, data ingestion, big-data storage, and integration through a modern, scalable architecture. SiteWhere provides REST APIs for all system functionality. SiteWhere provides SDKs for many common device platforms including Android, iOS, Arduino, and any Java-capable platform such as Raspberry Pi rapidly accelerating the speed of innovation.
Stars: ✭ 788 (+54.81%)
Mutual labels:  platform, integration
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-67.78%)
Mutual labels:  dataset, spark
Bigdata Interview
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+68.37%)
Mutual labels:  spark, mapreduce
Mare
MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-97.84%)
Mutual labels:  spark, mapreduce
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-86.05%)
Mutual labels:  spark, mapreduce
Utils4s
scala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+110.22%)
Mutual labels:  spark, spark-streaming
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-72.5%)
Mutual labels:  spark, spark-streaming
Pyspark Learning
Updated repository
Stars: ✭ 147 (-71.12%)
Mutual labels:  spark, spark-streaming
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (-10.41%)
Mutual labels:  spark, mapreduce
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-57.56%)
Mutual labels:  spark, spark-streaming
interview-refresh-java-bigdata
a one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
Stars: ✭ 25 (-95.09%)
Mutual labels:  spark-streaming, mapreduce
qs-hadoop
大数据生态圈学习
Stars: ✭ 18 (-96.46%)
Mutual labels:  spark-streaming, mapreduce
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+4231.63%)
Mutual labels:  spark, mapreduce
Example Spark Kafka
Apache Spark and Apache Kafka integration example
Stars: ✭ 120 (-76.42%)
Mutual labels:  spark, spark-streaming
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+238.11%)
Mutual labels:  spark, spark-streaming
Learningspark
Scala examples for learning to use Spark
Stars: ✭ 421 (-17.29%)
Mutual labels:  spark, spark-streaming
Kafka Storm Starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+43.03%)
Mutual labels:  spark, integration
Spark Mllib Twitter Sentiment Analysis
🌟 ✨ Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Stars: ✭ 113 (-77.8%)
Mutual labels:  spark, spark-streaming
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-81.93%)
Mutual labels:  spark, mapreduce
Openremote
100% open-source IoT Platform - Integrate your assets, create rules, and visualize your data
Stars: ✭ 254 (-50.1%)
Mutual labels:  platform, middleware
Yandex Big Data Engineering
Stars: ✭ 17 (-96.66%)
Mutual labels:  spark, mapreduce
Sparkling Water
Sparkling Water provides H2O functionality inside Spark cluster
Stars: ✭ 887 (+74.26%)
Mutual labels:  spark, integration
Data Algorithms Book
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+86.44%)
Mutual labels:  spark, mapreduce
Angel
A Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+1168.76%)
Mutual labels:  spark, spark-streaming
Pyspark Examples
Code examples on Apache Spark using python
Stars: ✭ 58 (-88.61%)
Mutual labels:  spark, spark-streaming
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+264.64%)
Mutual labels:  spark, spark-streaming
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-84.48%)
Mutual labels:  dataset, spark
Product Ei
An open source, a high-performance hybrid integration platform that allows developers quick integration with any application, data, or system.
Stars: ✭ 277 (-45.58%)
Mutual labels:  middleware, integration
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+551.87%)
Mutual labels:  spark, spark-streaming
Inat comp
iNaturalist competition details
Stars: ✭ 444 (-12.77%)
Mutual labels:  dataset
Ace tao
ACE and TAO
Stars: ✭ 472 (-7.27%)
Mutual labels:  middleware
Nodequant
一个基于Node.js的开源量化交易平台,轻巧地开发和部署量化投资策略
Stars: ✭ 444 (-12.77%)
Mutual labels:  platform
Express Jwt Permissions
🚦 Express middleware for JWT permissions
Stars: ✭ 444 (-12.77%)
Mutual labels:  middleware
Mortar
Mortar is a GO framework/library for building gRPC (and REST) web services.
Stars: ✭ 492 (-3.34%)
Mutual labels:  middleware
Quadpy
Numerical integration (quadrature, cubature) in Python
Stars: ✭ 471 (-7.47%)
Mutual labels:  integration
Bigdataie
大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (-12.57%)
Mutual labels:  spark
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+1080.35%)
Mutual labels:  spark
Spark
Cross-platform real-time collaboration client optimized for business and organizations.
Stars: ✭ 471 (-7.47%)
Mutual labels:  spark
Express Openapi Validator
🦋 Auto-validates api requests, responses, and securities using ExpressJS and an OpenAPI 3.x specification
Stars: ✭ 436 (-14.34%)
Mutual labels:  middleware
Gongular
A different approach to Go web frameworks
Stars: ✭ 438 (-13.95%)
Mutual labels:  middleware
Voice datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (-2.95%)
Mutual labels:  dataset
Datafire
A framework for building integrations and APIs
Stars: ✭ 487 (-4.32%)
Mutual labels:  integration
Chinese rumor dataset
中文谣言数据
Stars: ✭ 470 (-7.66%)
Mutual labels:  dataset
High Performance Spark Examples
Examples for High Performance Spark
Stars: ✭ 436 (-14.34%)
Mutual labels:  spark
Syndesis
A flexible, customizable, open source platform that provides core integration capabilities as a service.
Stars: ✭ 433 (-14.93%)
Mutual labels:  integration
Bigslice
A serverless cluster computing system for the Go programming language
Stars: ✭ 469 (-7.86%)
Mutual labels:  mapreduce
1-60 of 1654 similar projects