All Projects → Spark Lucenerdd → Similar Projects or Alternatives

520 Open source projects that are alternatives of or similar to Spark Lucenerdd

Elassandra
Elassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+1312.28%)
Mutual labels:  spark, lucene
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+58.77%)
Mutual labels:  spark, deduplication
Roaringbitmap
A better compressed bitset in Java
Stars: ✭ 2,460 (+2057.89%)
Mutual labels:  spark, lucene
experiments
Code examples for my blog posts
Stars: ✭ 21 (-81.58%)
Mutual labels:  spark, lucene
Spark On K8s Operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1461.4%)
Mutual labels:  spark
Spark Summit 2017 Sanfrancisco
spark summit 2017 SanFrancisco
Stars: ✭ 93 (-18.42%)
Mutual labels:  spark
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-21.93%)
Mutual labels:  spark
Spark python ml examples
Spark 2.0 Python Machine Learning examples
Stars: ✭ 87 (-23.68%)
Mutual labels:  spark
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-3.51%)
Mutual labels:  spark
Spark Ffm
FFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-11.4%)
Mutual labels:  spark
Flint
Webex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)
Stars: ✭ 85 (-25.44%)
Mutual labels:  spark
Blog
基于SpringBoot+Thymeleaf+Mybatis+LayUi+Lucene的粗糙个人博客
Stars: ✭ 95 (-16.67%)
Mutual labels:  lucene
Seldon Server
Machine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+1158.77%)
Mutual labels:  spark
Spark On Kubernetes Helm
Spark on Kubernetes infrastructure Helm charts repo
Stars: ✭ 92 (-19.3%)
Mutual labels:  spark
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1528.07%)
Mutual labels:  spark
Ammonite Spark
Run spark calculations from Ammonite
Stars: ✭ 88 (-22.81%)
Mutual labels:  spark
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-7.89%)
Mutual labels:  spark
Laravel Spark Google2fa
Google Authenticator support for Laravel Spark
Stars: ✭ 86 (-24.56%)
Mutual labels:  spark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-1.75%)
Mutual labels:  spark
Springboot Templates
springboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎
Stars: ✭ 100 (-12.28%)
Mutual labels:  lucene
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-27.19%)
Mutual labels:  spark
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-28.07%)
Mutual labels:  spark
Mleap
MLeap: Deploy ML Pipelines to Production
Stars: ✭ 1,232 (+980.7%)
Mutual labels:  spark
Distributed Dataset
A distributed data processing framework in Haskell.
Stars: ✭ 108 (-5.26%)
Mutual labels:  spark
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+1087.72%)
Mutual labels:  spark
Spark Gbtlr
Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (-28.95%)
Mutual labels:  spark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1073.68%)
Mutual labels:  spark
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+9880.7%)
Mutual labels:  spark
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-19.3%)
Mutual labels:  spark
Lambda Arch
Applying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-2.63%)
Mutual labels:  spark
Big Data
🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-18.42%)
Mutual labels:  spark
Logigsk
A Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-6.14%)
Mutual labels:  spark
Fingerprints
Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.
Stars: ✭ 91 (-20.18%)
Mutual labels:  deduplication
Ik Analyzer
支持Lucene5/6/7/8+版本, 长期维护。
Stars: ✭ 112 (-1.75%)
Mutual labels:  lucene
Jeeplatform
一款企业信息化开发基础平台,拟集成OA(办公自动化)、CMS(内容管理系统)等企业系统的通用业务功能 JeePlatform项目是一款以SpringBoot为核心框架,集ORM框架Mybatis,Web层框架SpringMVC和多种开源组件框架而成的一款通用基础平台,代码已经捐赠给开源中国社区
Stars: ✭ 1,285 (+1027.19%)
Mutual labels:  lucene
Sparktutorial
Source code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-7.89%)
Mutual labels:  spark
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-22.81%)
Mutual labels:  spark
Java learning practice
java 进阶之路:面试高频算法、akka、多线程、NIO、Netty、SpringBoot、Spark&&Flink 等
Stars: ✭ 110 (-3.51%)
Mutual labels:  spark
Solrplugins
Dice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (-24.56%)
Mutual labels:  lucene
Spark Terasort
Spark Terasort
Stars: ✭ 101 (-11.4%)
Mutual labels:  spark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-24.56%)
Mutual labels:  spark
Spark Mllib Twitter Sentiment Analysis
🌟 ✨ Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Stars: ✭ 113 (-0.88%)
Mutual labels:  spark
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-26.32%)
Mutual labels:  spark
Bigdata Notebook
Stars: ✭ 100 (-12.28%)
Mutual labels:  spark
Smartstore
Open Source ASP.NET Core Enterprise eCommerce Shopping Cart Solution
Stars: ✭ 82 (-28.07%)
Mutual labels:  lucene
Parquet Index
Spark SQL index for Parquet tables
Stars: ✭ 109 (-4.39%)
Mutual labels:  spark
Spark Dependencies
Spark job for dependency links
Stars: ✭ 82 (-28.07%)
Mutual labels:  spark
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+9541.23%)
Mutual labels:  spark
Lehar
Visualize data using relative ordering
Stars: ✭ 81 (-28.95%)
Mutual labels:  spark
Archivespark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (-2.63%)
Mutual labels:  spark
Straw
A platform for real-time streaming search
Stars: ✭ 98 (-14.04%)
Mutual labels:  lucene
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-30.7%)
Mutual labels:  spark
Docker Spark
🚢 Docker image for Apache Spark
Stars: ✭ 78 (-31.58%)
Mutual labels:  spark
Home
ApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (+951.75%)
Mutual labels:  spark
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-5.26%)
Mutual labels:  spark
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-14.91%)
Mutual labels:  spark
Laravel Lucene Search
Laravel 4.2, 5.* package for full-text search over Eloquent models based on ZF2 Lucene.
Stars: ✭ 75 (-34.21%)
Mutual labels:  lucene
Spark Website
Apache Spark Website
Stars: ✭ 75 (-34.21%)
Mutual labels:  spark
Schemer
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-14.91%)
Mutual labels:  spark
Spring Shiro Spark
Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Stars: ✭ 114 (+0%)
Mutual labels:  spark
1-60 of 520 similar projects