All Projects → Spark-and-Kafka_IoT-Data-Processing-and-Analytics → Similar Projects or Alternatives

300 Open source projects that are alternatives of or similar to Spark-and-Kafka_IoT-Data-Processing-and-Analytics

optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+3116.67%)
Mutual labels:  bigdata, pyspark
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-19.05%)
Mutual labels:  bigdata, pyspark
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (-19.05%)
Mutual labels:  bigdata, spark-streaming
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+2247.62%)
Mutual labels:  bigdata, pyspark
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+2111.9%)
Mutual labels:  bigdata, spark-streaming
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+414.29%)
Mutual labels:  pyspark, spark-streaming
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+233.33%)
Mutual labels:  bigdata, spark-streaming
qs-hadoop
大数据生态圈学习
Stars: ✭ 18 (-57.14%)
Mutual labels:  bigdata, spark-streaming
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+3997.62%)
Mutual labels:  bigdata, spark-streaming
Pyspark Learning
Updated repository
Stars: ✭ 147 (+250%)
Mutual labels:  pyspark, spark-streaming
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+3085.71%)
Mutual labels:  bigdata, pyspark
kafka-twitter-spark-streaming
Counting Tweets Per User in Real-Time
Stars: ✭ 38 (-9.52%)
Mutual labels:  pyspark, spark-streaming
Spark Streaming Monitoring With Lightning
Plot live-stats as graph from ApacheSpark application using Lightning-viz
Stars: ✭ 15 (-64.29%)
Mutual labels:  bigdata, spark-streaming
anovos
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+83.33%)
Mutual labels:  bigdata, pyspark
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (+19.05%)
Mutual labels:  bigdata, pyspark
flokkr
Documentation placeholder and utilities for all the other containers.
Stars: ✭ 30 (-28.57%)
Mutual labels:  bigdata
pulsar-user-group-loc-cn
Workspace for China local user group.
Stars: ✭ 19 (-54.76%)
Mutual labels:  bigdata
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+7888.1%)
Mutual labels:  pyspark
Exposure
Exposure是一个帮助做曝光统计需求的库,可以很方便的对曝光事件进行埋点,在现有代码上少量侵入即可实现曝光埋点。支持RV的线性布局、网格布局、瀑布流布局、横向滑动RV,ScrollView等各种滚动布局。支持配置item的有效曝光面积。
Stars: ✭ 51 (+21.43%)
Mutual labels:  bigdata
litemall-dw
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-14.29%)
Mutual labels:  spark-streaming
NYC Taxi Pipeline
Design/Implement stream/batch architecture on NYC taxi data | #DE
Stars: ✭ 16 (-61.9%)
Mutual labels:  spark-streaming
UnROOT.jl
Native Julia I/O package to work with CERN ROOT files
Stars: ✭ 52 (+23.81%)
Mutual labels:  bigdata
cds
Data syncing in golang for ClickHouse.
Stars: ✭ 839 (+1897.62%)
Mutual labels:  bigdata
datasphere-service
an open source dataworks platform
Stars: ✭ 20 (-52.38%)
Mutual labels:  bigdata
meetups-archivos
Ppts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (+42.86%)
Mutual labels:  bigdata
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (+85.71%)
Mutual labels:  pyspark
check-engine
Data validation library for PySpark 3.0.0
Stars: ✭ 29 (-30.95%)
Mutual labels:  pyspark
v6.dooring.public
可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+669.05%)
Mutual labels:  bigdata
spark-utils
Basic framework utilities to quickly start writing production ready Apache Spark applications
Stars: ✭ 25 (-40.48%)
Mutual labels:  spark-streaming
learning notes
学习笔记
Stars: ✭ 18 (-57.14%)
Mutual labels:  bigdata
BigDataTools
tools for bigData
Stars: ✭ 36 (-14.29%)
Mutual labels:  bigdata
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+1028.57%)
Mutual labels:  pyspark
cassandra.realtime
Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-40.48%)
Mutual labels:  spark-streaming
ai-deployment
关注AI模型上线、模型部署
Stars: ✭ 149 (+254.76%)
Mutual labels:  pyspark
SparkTwitterAnalysis
An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.
Stars: ✭ 29 (-30.95%)
Mutual labels:  bigdata
Springboard-Data-Science-Immersive
No description or website provided.
Stars: ✭ 52 (+23.81%)
Mutual labels:  pyspark
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+26311.9%)
Mutual labels:  bigdata
pyspark-cheatsheet
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+173.81%)
Mutual labels:  pyspark
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (-4.76%)
Mutual labels:  spark-streaming
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-59.52%)
Mutual labels:  pyspark
vor
The new IoT Office Experience.
Stars: ✭ 44 (+4.76%)
Mutual labels:  iot-sensors
Azure-Databricks-NYC-Taxi-Workshop
An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
Stars: ✭ 71 (+69.05%)
Mutual labels:  pyspark
phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
Stars: ✭ 115 (+173.81%)
Mutual labels:  pyspark
room-renting
用Python爬取安居客房源信息,并用高德地图进行可视化
Stars: ✭ 16 (-61.9%)
Mutual labels:  bigdata
pyspark-k8s-boilerplate
Boilerplate for PySpark on Cloud Kubernetes
Stars: ✭ 24 (-42.86%)
Mutual labels:  pyspark
wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-54.76%)
Mutual labels:  spark-streaming
ETL-Starter-Kit
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Stars: ✭ 21 (-50%)
Mutual labels:  bigdata
flink-learn
Learning Flink : Flink CEP,Flink Core,Flink SQL
Stars: ✭ 70 (+66.67%)
Mutual labels:  bigdata
databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Stars: ✭ 19 (-54.76%)
Mutual labels:  pyspark
dlsa
Distributed least squares approximation (dlsa) implemented with Apache Spark
Stars: ✭ 25 (-40.48%)
Mutual labels:  pyspark
taller SparkR
Taller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-71.43%)
Mutual labels:  bigdata
hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+33.33%)
Mutual labels:  bigdata
coolplayflink
Flink: Stateful Computations over Data Streams
Stars: ✭ 14 (-66.67%)
Mutual labels:  bigdata
bqv
The simplest tool to manage views of BigQuery.
Stars: ✭ 22 (-47.62%)
Mutual labels:  bigdata
learning-spark
Tidy up Spark and Hadoop tutorials.
Stars: ✭ 28 (-33.33%)
Mutual labels:  bigdata
python mozetl
ETL jobs for Firefox Telemetry
Stars: ✭ 25 (-40.48%)
Mutual labels:  pyspark
ODSC India 2018
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-38.1%)
Mutual labels:  pyspark
machine-learning-course
Machine Learning Course @ Santa Clara University
Stars: ✭ 17 (-59.52%)
Mutual labels:  pyspark
SparkProgrammingInScala
Apache Spark Course Material
Stars: ✭ 57 (+35.71%)
Mutual labels:  bigdata
bigdata-tech-index
Big Data Technology Index
Stars: ✭ 24 (-42.86%)
Mutual labels:  bigdata
1-60 of 300 similar projects