All Projects → Big Data → Similar Projects or Alternatives

868 Open source projects that are alternatives of or similar to Big Data

Intro To R
Stars: ✭ 71 (-23.66%)
Mutual labels:  rstudio, workshop
R-advantages-over-python
This repository enumerates all the reasons why R is better than python for DS
Stars: ✭ 59 (-36.56%)
Mutual labels:  dplyr, rstudio
Sparklyr
R interface for Apache Spark
Stars: ✭ 775 (+733.33%)
Mutual labels:  spark, dplyr
Hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+164.52%)
Mutual labels:  spark, databases
Spark Workshop
Apache Spark™ and Scala Workshops
Stars: ✭ 224 (+140.86%)
Mutual labels:  spark, workshop
db.rstudio.com
Website dedicated to all things R and Databases
Stars: ✭ 13 (-86.02%)
Mutual labels:  dplyr, databases
Tidyheatmap
Draw heatmap simply using a tidy data frame
Stars: ✭ 151 (+62.37%)
Mutual labels:  dplyr, rstudio
rworkshops
Materials for R Workshops
Stars: ✭ 43 (-53.76%)
Mutual labels:  workshop, rstudio
Installations mac ubuntu windows
Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).
Stars: ✭ 231 (+148.39%)
Mutual labels:  spark, rstudio
learning R
List of resources for learning R
Stars: ✭ 32 (-65.59%)
Mutual labels:  dplyr, rstudio
Moderndive book
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (+466.67%)
Mutual labels:  dplyr, rstudio
Delta Architecture
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Stars: ✭ 43 (-53.76%)
Mutual labels:  spark, databases
Ds Cheatsheets
List of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+10063.44%)
Mutual labels:  spark
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-9.68%)
Mutual labels:  spark
Workshops
Workshops for The Things Network
Stars: ✭ 74 (-20.43%)
Mutual labels:  workshop
Lpa Detector
Optimize and improve the Label propagation algorithm
Stars: ✭ 75 (-19.35%)
Mutual labels:  spark
Ammonite Spark
Run spark calculations from Ammonite
Stars: ✭ 88 (-5.38%)
Mutual labels:  spark
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-10.75%)
Mutual labels:  spark
Labs
Research on distributed system
Stars: ✭ 73 (-21.51%)
Mutual labels:  spark
Luigi Warehouse
A luigi powered analytics / warehouse stack
Stars: ✭ 72 (-22.58%)
Mutual labels:  spark
Spark Dependencies
Spark job for dependency links
Stars: ✭ 82 (-11.83%)
Mutual labels:  spark
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-23.66%)
Mutual labels:  spark
Avocado
Strongly-typed MongoDB driver for Rust
Stars: ✭ 70 (-24.73%)
Mutual labels:  databases
Trackmd
Tools for tracking changes in Markdown format within RStudio
Stars: ✭ 89 (-4.3%)
Mutual labels:  rstudio
Wraprmd
RStudio addin for wrapping RMarkdown paragraphs
Stars: ✭ 87 (-6.45%)
Mutual labels:  rstudio
Lehar
Visualize data using relative ordering
Stars: ✭ 81 (-12.9%)
Mutual labels:  spark
Usersessionbehaviorofflineanalysis
四川大学拓思爱诺用户session行为数据离线分析项目
Stars: ✭ 69 (-25.81%)
Mutual labels:  spark
Go Craq
CRAQ (Chain Replication with Apportioned Queries) in Go
Stars: ✭ 75 (-19.35%)
Mutual labels:  databases
Flint
Webex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)
Stars: ✭ 85 (-8.6%)
Mutual labels:  spark
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+1184.95%)
Mutual labels:  spark
P2p Internet Workshop
Building the Peer-to-Peer Internet workshop series
Stars: ✭ 88 (-5.38%)
Mutual labels:  workshop
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-20.43%)
Mutual labels:  spark
Chaingear
The consensus computer driven database framework
Stars: ✭ 83 (-10.75%)
Mutual labels:  databases
Spark Twitter Stream Example
"Sentiment analysis" on a live Twitter feed with Apache Spark and Apache Bahir
Stars: ✭ 73 (-21.51%)
Mutual labels:  spark
Jcabi Jdbc
Fluent Wrapper of JDBC
Stars: ✭ 90 (-3.23%)
Mutual labels:  databases
Kamu Cli
Next generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-25.81%)
Mutual labels:  spark
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-11.83%)
Mutual labels:  spark
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-5.38%)
Mutual labels:  spark
Shrtcts
Make Anything an RStudio Shortcut
Stars: ✭ 71 (-23.66%)
Mutual labels:  rstudio
Mleap
MLeap: Deploy ML Pipelines to Production
Stars: ✭ 1,232 (+1224.73%)
Mutual labels:  spark
Starter Academic
🎓 Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify
Stars: ✭ 1,158 (+1145.16%)
Mutual labels:  rstudio
Spark On Kubernetes Helm
Spark on Kubernetes infrastructure Helm charts repo
Stars: ✭ 92 (-1.08%)
Mutual labels:  spark
Spark Gbtlr
Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (-12.9%)
Mutual labels:  spark
Distkv
A light weight distributed key-value database system with table concept.
Stars: ✭ 69 (-25.81%)
Mutual labels:  databases
Fast Mrmr
An improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (-27.96%)
Mutual labels:  spark
Kontextfrei
Writing application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (-27.96%)
Mutual labels:  spark
Spark python ml examples
Spark 2.0 Python Machine Learning examples
Stars: ✭ 87 (-6.45%)
Mutual labels:  spark
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-15.05%)
Mutual labels:  spark
Thingsboard
Open-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+11218.28%)
Mutual labels:  spark
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-30.11%)
Mutual labels:  spark
Docker Spark
🚢 Docker image for Apache Spark
Stars: ✭ 78 (-16.13%)
Mutual labels:  spark
Spark Bigquery
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-30.11%)
Mutual labels:  spark
Pydata Pandas Workshop
Material for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
Stars: ✭ 65 (-30.11%)
Mutual labels:  workshop
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-4.3%)
Mutual labels:  spark
Testing Workshop
A workshop for learning how to test JavaScript applications
Stars: ✭ 1,276 (+1272.04%)
Mutual labels:  workshop
Rsqlserver
SQL Server DBI for R, based on the jTDS driver
Stars: ✭ 76 (-18.28%)
Mutual labels:  dplyr
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-31.18%)
Mutual labels:  spark
Pyspark Twitter Stream Mining
Real-time Machine Learning with Apache Spark on Twitter Public Stream
Stars: ✭ 64 (-31.18%)
Mutual labels:  spark
Home
ApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (+1189.25%)
Mutual labels:  spark
Xaringan
Presentation Ninja 幻灯忍者 · 写轮眼
Stars: ✭ 1,129 (+1113.98%)
Mutual labels:  rstudio
1-60 of 868 similar projects