MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+82.51%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+1967.98%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+0.79%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-83.69%)
DparkPython clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+424.17%)
Example SparkSpark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-59.72%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-51.47%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-97.25%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-76.42%)
SitewhereSiteWhere is an industrial strength open-source application enablement platform for the Internet of Things (IoT). It provides a multi-tenant microservice-based infrastructure that includes device/asset management, data ingestion, big-data storage, and integration through a modern, scalable architecture. SiteWhere provides REST APIs for all system functionality. SiteWhere provides SDKs for many common device platforms including Android, iOS, Arduino, and any Java-capable platform such as Raspberry Pi rapidly accelerating the speed of innovation.
Stars: ✭ 788 (+54.81%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-67.78%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+68.37%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-97.84%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+110.22%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-72.5%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (-10.41%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-57.56%)
interview-refresh-java-bigdataa one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
Stars: ✭ 25 (-95.09%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+4231.63%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+238.11%)
LearningsparkScala examples for learning to use Spark
Stars: ✭ 421 (-17.29%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+43.03%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-81.93%)
Openremote100% open-source IoT Platform - Integrate your assets, create rules, and visualize your data
Stars: ✭ 254 (-50.1%)
Sparkling WaterSparkling Water provides H2O functionality inside Spark cluster
Stars: ✭ 887 (+74.26%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+86.44%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+1168.76%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-88.61%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+264.64%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-84.48%)
Product EiAn open source, a high-performance hybrid integration platform that allows developers quick integration with any application, data, or system.
Stars: ✭ 277 (-45.58%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+551.87%)
Inat compiNaturalist competition details
Stars: ✭ 444 (-12.77%)
Ace taoACE and TAO
Stars: ✭ 472 (-7.27%)
Nodequant一个基于Node.js的开源量化交易平台,轻巧地开发和部署量化投资策略
Stars: ✭ 444 (-12.77%)
MortarMortar is a GO framework/library for building gRPC (and REST) web services.
Stars: ✭ 492 (-3.34%)
QuadpyNumerical integration (quadrature, cubature) in Python
Stars: ✭ 471 (-7.47%)
Bigdataie大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (-12.57%)
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+1080.35%)
SparkCross-platform real-time collaboration client optimized for business and organizations.
Stars: ✭ 471 (-7.47%)
Express Openapi Validator🦋 Auto-validates api requests, responses, and securities using ExpressJS and an OpenAPI 3.x specification
Stars: ✭ 436 (-14.34%)
GongularA different approach to Go web frameworks
Stars: ✭ 438 (-13.95%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (-2.95%)
DatafireA framework for building integrations and APIs
Stars: ✭ 487 (-4.32%)
SyndesisA flexible, customizable, open source platform that provides core integration capabilities as a service.
Stars: ✭ 433 (-14.93%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (-7.86%)