All Projects → Data Ingestion Platform → Similar Projects or Alternatives

707 Open source projects that are alternatives of or similar to Data Ingestion Platform

Cleanframes
type-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (+92.31%)
Mutual labels:  spark
Oap
Optimized Analytics Package for Spark* Platform
Stars: ✭ 343 (+779.49%)
Mutual labels:  spark
yuzhouwan
Code Library for My Blog
Stars: ✭ 39 (+0%)
Mutual labels:  spark
Learningapachespark
LearningApacheSpark
Stars: ✭ 155 (+297.44%)
Mutual labels:  spark
flink-connectors
Apache Flink connectors for Pravega.
Stars: ✭ 84 (+115.38%)
Mutual labels:  flink
Lpa Detector
Optimize and improve the Label propagation algorithm
Stars: ✭ 75 (+92.31%)
Mutual labels:  spark
Sparkctr
CTR prediction model based on spark(LR, GBDT, DNN)
Stars: ✭ 740 (+1797.44%)
Mutual labels:  spark
SFDCRules
Simple yet powerful Rule Engine for Salesforce - SFDCRules
Stars: ✭ 38 (-2.56%)
Mutual labels:  apex
Luigi Warehouse
A luigi powered analytics / warehouse stack
Stars: ✭ 72 (+84.62%)
Mutual labels:  spark
Scalnet
A Scala wrapper for Deeplearning4j, inspired by Keras. Scala + DL + Spark + GPUs
Stars: ✭ 342 (+776.92%)
Mutual labels:  spark
Usersessionbehaviorofflineanalysis
四川大学拓思爱诺用户session行为数据离线分析项目
Stars: ✭ 69 (+76.92%)
Mutual labels:  spark
stormnode
Node js node client for storm.dev
Stars: ✭ 11 (-71.79%)
Mutual labels:  storm
Kontextfrei
Writing application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (+71.79%)
Mutual labels:  spark
Force.com Utility Library
Salesforce Utility
Stars: ✭ 9 (-76.92%)
Mutual labels:  apex
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (+66.67%)
Mutual labels:  spark
hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+43.59%)
Mutual labels:  flink
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+64.1%)
Mutual labels:  spark
Ytk Learn
Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (+764.1%)
Mutual labels:  spark
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (+61.54%)
Mutual labels:  spark
Archived-SANSA-Query
SANSA Query Layer
Stars: ✭ 31 (-20.51%)
Mutual labels:  flink
Sparkmonitor
Monitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (+294.87%)
Mutual labels:  spark
Sparkle
Haskell on Apache Spark.
Stars: ✭ 419 (+974.36%)
Mutual labels:  spark
docker-apex-stack
Utility scripts for creating an Oracle Application Express stack as a Docker container.
Stars: ✭ 67 (+71.79%)
Mutual labels:  apex
Quill
Compile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+5023.08%)
Mutual labels:  spark
Waimak
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (+53.85%)
Mutual labels:  spark
flink-demo
Flink Demo
Stars: ✭ 39 (+0%)
Mutual labels:  flink
Zemberek Nlp Server
Zemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (+53.85%)
Mutual labels:  spark
Pmd
An extensible multilanguage static code analyzer.
Stars: ✭ 3,667 (+9302.56%)
Mutual labels:  apex
Rumble
⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (+48.72%)
Mutual labels:  spark
xxhadoop
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-5.13%)
Mutual labels:  storm
Weblogsanalysissystem
A big data platform for analyzing web access logs
Stars: ✭ 37 (-5.13%)
Mutual labels:  spark
Apex-Code-Conventions
Apex conventions and best practices for Salesforce Developers
Stars: ✭ 28 (-28.21%)
Mutual labels:  apex
Docker Hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (+38.46%)
Mutual labels:  spark
Cook
Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark
Stars: ✭ 314 (+705.13%)
Mutual labels:  spark
Spark Submit Ui
This is a based on playframwork for submit spark app
Stars: ✭ 53 (+35.9%)
Mutual labels:  spark
ApexConfigs
Apex Legends configs for a competitve player
Stars: ✭ 52 (+33.33%)
Mutual labels:  apex
Spark Nkp
Natural Korean Processor for Apache Spark
Stars: ✭ 50 (+28.21%)
Mutual labels:  spark
Hail
Scalable genomic data analysis.
Stars: ✭ 706 (+1710.26%)
Mutual labels:  spark
Awesome Recommendation Engine
The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (+20.51%)
Mutual labels:  spark
apex-tmLanguage
Salesforce Apex Language syntax grammar used for colorization
Stars: ✭ 27 (-30.77%)
Mutual labels:  apex
Spark Tda
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (+15.38%)
Mutual labels:  spark
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+8407.69%)
Mutual labels:  spark
Spark Examples
Spark examples
Stars: ✭ 41 (+5.13%)
Mutual labels:  spark
qs-hadoop
大数据生态圈学习
Stars: ✭ 18 (-53.85%)
Mutual labels:  storm
Azure Kusto Spark
Apache Spark Connector for Azure Kusto
Stars: ✭ 40 (+2.56%)
Mutual labels:  spark
Affiliationsecurity
HEDA Affiliation-Based Security for Salesforce
Stars: ✭ 8 (-79.49%)
Mutual labels:  apex
Flink Doc Zh
Apache Flink 中文文档
Stars: ✭ 242 (+520.51%)
Mutual labels:  flink
Storm-Kafka
Storm Kafka 流数据 处理系统
Stars: ✭ 20 (-48.72%)
Mutual labels:  storm
Flink Recommandsystem Demo
🚁🚀基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
Stars: ✭ 3,115 (+7887.18%)
Mutual labels:  flink
Apex Recipes
A library of concise, meaningful examples of Apex code for common use cases following best practices.
Stars: ✭ 307 (+687.18%)
Mutual labels:  apex
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (+292.31%)
Mutual labels:  spark
spark-gradle-template
Apache Spark in your IDE with gradle
Stars: ✭ 39 (+0%)
Mutual labels:  spark
Powderkeg
Live-coding the cluster!
Stars: ✭ 152 (+289.74%)
Mutual labels:  spark
webmorph
Average and morph faces online http://webmorph.org/
Stars: ✭ 55 (+41.03%)
Mutual labels:  batch-processing
Snappydata
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Stars: ✭ 995 (+2451.28%)
Mutual labels:  spark
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-5.13%)
Mutual labels:  spark
Objectmerge
Open-source solution for merging Salesforce objects and their related objects.
Stars: ✭ 35 (-10.26%)
Mutual labels:  apex
Spark Flamegraph
Easy CPU Profiling for Apache Spark applications
Stars: ✭ 30 (-23.08%)
Mutual labels:  spark
Apex Test Tracker
Lightweight native continuous integration tool for Salesforce
Stars: ✭ 12 (-69.23%)
Mutual labels:  apex
Spark Tsne
Distributed t-SNE via Apache Spark
Stars: ✭ 151 (+287.18%)
Mutual labels:  spark
601-660 of 707 similar projects