All Projects → Roaringbitmap → Similar Projects or Alternatives

584 Open source projects that are alternatives of or similar to Roaringbitmap

Bootplus
基于SpringBoot + Shiro + MyBatisPlus的权限管理框架
Stars: ✭ 88 (-96.42%)
Mutual labels:  druid
Anserini
A Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (-76.71%)
Mutual labels:  lucene
Powderkeg
Live-coding the cluster!
Stars: ✭ 152 (-93.82%)
Mutual labels:  spark
Fess
Fess is very powerful and easily deployable Enterprise Search Server.
Stars: ✭ 561 (-77.2%)
Mutual labels:  lucene
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-96.42%)
Mutual labels:  spark
Spark Daria
Essential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (-77.52%)
Mutual labels:  spark
Pyroaringbitmap
An efficient and light-weight ordered set of 32 bits integers.
Stars: ✭ 128 (-94.8%)
Mutual labels:  bitset
Lopq
Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Stars: ✭ 530 (-78.46%)
Mutual labels:  spark
Solrplugins
Dice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (-96.5%)
Mutual labels:  lucene
Cdap
An open source framework for building data analytic applications.
Stars: ✭ 509 (-79.31%)
Mutual labels:  spark
Sparkstreaming
💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-92.72%)
Mutual labels:  spark
Pointblank
Data validation and organization of metadata for data frames and database tables
Stars: ✭ 480 (-80.49%)
Mutual labels:  spark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-96.5%)
Mutual labels:  spark
Javaewah
A compressed alternative to the Java BitSet class
Stars: ✭ 474 (-80.73%)
Mutual labels:  bitset
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+4.72%)
Mutual labels:  spark
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (-81.46%)
Mutual labels:  spark
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-96.59%)
Mutual labels:  spark
Spark Ml Source Analysis
spark ml 算法原理剖析以及具体的源码实现分析
Stars: ✭ 1,873 (-23.86%)
Mutual labels:  spark
Flint
A Time Series Library for Apache Spark
Stars: ✭ 878 (-64.31%)
Mutual labels:  spark
Lucene Solr
Apache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+71.42%)
Mutual labels:  lucene
Smartstore
Open Source ASP.NET Core Enterprise eCommerce Shopping Cart Solution
Stars: ✭ 82 (-96.67%)
Mutual labels:  lucene
Turnilo
Business intelligence, data exploration and visualization web application for Druid, formerly known as Swiv and Pivot
Stars: ✭ 427 (-82.64%)
Mutual labels:  druid
Openuba
A robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-94.84%)
Mutual labels:  spark
Dji Firmware Tools
Tools for handling firmwares of DJI products, with focus on quadcopters.
Stars: ✭ 424 (-82.76%)
Mutual labels:  spark
Spark Dependencies
Spark job for dependency links
Stars: ✭ 82 (-96.67%)
Mutual labels:  spark
Featran
A Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (-82.93%)
Mutual labels:  spark
Eclipse Instasearch
Eclipse plug-in for fast code search
Stars: ✭ 165 (-93.29%)
Mutual labels:  lucene
Listenbrainz Server
Server for the ListenBrainz project
Stars: ✭ 420 (-82.93%)
Mutual labels:  spark
Hibitset
Hierarchical bit set container
Stars: ✭ 81 (-96.71%)
Mutual labels:  bitset
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-83.21%)
Mutual labels:  spark
Cape Python
Collaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-94.92%)
Mutual labels:  spark
Marmaray
Generic Data Ingestion & Dispersal Library for Hadoop
Stars: ✭ 414 (-83.17%)
Mutual labels:  spark
Spark Gbtlr
Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (-96.71%)
Mutual labels:  spark
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (-83.5%)
Mutual labels:  spark
Aztk
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-93.82%)
Mutual labels:  spark
Bitvec
A crate for managing memory bit by bit
Stars: ✭ 411 (-83.29%)
Mutual labels:  bitset
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-96.79%)
Mutual labels:  spark
Awesome Elasticsearch
A curated list of the most important and useful resources about elasticsearch: articles, videos, blogs, tips and tricks, use cases. All about Elasticsearch!
Stars: ✭ 4,168 (+69.43%)
Mutual labels:  lucene
Spark Bigquery Connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Stars: ✭ 126 (-94.88%)
Mutual labels:  spark
Iceberg
Iceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (-84.02%)
Mutual labels:  spark
Home
ApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (-51.26%)
Mutual labels:  spark
Hibernate Search
Hibernate Search: full-text search for domain model
Stars: ✭ 382 (-84.47%)
Mutual labels:  lucene
Kraps Rpc
A RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-92.89%)
Mutual labels:  spark
Redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+718.98%)
Mutual labels:  spark
Spark Website
Apache Spark Website
Stars: ✭ 75 (-96.95%)
Mutual labels:  spark
Bigdl
Building Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+55%)
Mutual labels:  spark
Nimrod
Nimrod - 基于 Spring Boot 构建 的 Java Web 平台企业级单体应用快速开发框架,适合中小型项目的应用和开发。所采用的技术栈包括 Spring Boot、Spring、Spring Web MVC、MyBatis、Thymeleaf 等,遵守阿里巴巴 Java 开发规约,帮助养成良好的编码习惯。整体采用 RBAC ( Role-Based Access Control ,基于角色的访问控制),具有严格的权限控制模块,支持系统与模块分离开发。最后希望这个项目能够对你有所帮助。Nimrod 开发交流群:547252502(QQ 群)
Stars: ✭ 125 (-94.92%)
Mutual labels:  druid
Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (-84.88%)
Mutual labels:  spark
Ds Cheatsheets
List of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+284.23%)
Mutual labels:  spark
Sparkmeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Stars: ✭ 368 (-85.04%)
Mutual labels:  spark
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-94.02%)
Mutual labels:  spark
Logigsk
A Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-95.65%)
Mutual labels:  spark
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-99.43%)
Mutual labels:  spark
Live log analyzer spark
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-99.43%)
Mutual labels:  spark
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-96.99%)
Mutual labels:  spark
Spark Streaming With Kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
Stars: ✭ 180 (-92.68%)
Mutual labels:  spark
Seconds Kill
基于 Springboot + Redis + Kafka 的秒杀系统,乐观锁 + 缓存 + 限流 + 异步,TPS 从 500 优化到 3000
Stars: ✭ 180 (-92.68%)
Mutual labels:  druid
Xsql
Unified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-92.85%)
Mutual labels:  spark
Transmogrifai
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (-15.28%)
Mutual labels:  spark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-93.58%)
Mutual labels:  spark
301-360 of 584 similar projects