🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes

Stars: ✭ 499 (+159.9%)

Mutual labels: big-data

Bigdata Playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Stars: ✭ 177 (-7.81%)

Mutual labels: big-data

Fit Sne

Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE)

Stars: ✭ 485 (+152.6%)

Mutual labels: big-data

Setl

A simple Spark-powered ETL framework that just works 🍺

Stars: ✭ 79 (-58.85%)

Mutual labels: big-data

Hazelcast

Open-source distributed computation and storage platform

Stars: ✭ 4,662 (+2328.13%)

Mutual labels: big-data

Richdem

High-performance Terrain and Hydrology Analysis

Stars: ✭ 127 (-33.85%)

Mutual labels: big-data

Conjure Up

Deploying complex solutions, magically.

Stars: ✭ 454 (+136.46%)

Mutual labels: big-data

Spark Website

Apache Spark Website

Stars: ✭ 75 (-60.94%)

Mutual labels: big-data

Circosjs

d3 library to build circular graphs

Stars: ✭ 436 (+127.08%)

Mutual labels: big-data

Metamodel

Mirror of Apache Metamodel

Stars: ✭ 143 (-25.52%)

Mutual labels: big-data

Listenbrainz Server

Server for the ListenBrainz project

Stars: ✭ 420 (+118.75%)

Mutual labels: big-data

Labs

Research on distributed system

Stars: ✭ 73 (-61.98%)

Mutual labels: big-data

Opendata.cern.ch

Source code for the CERN Open Data portal

Stars: ✭ 411 (+114.06%)

Mutual labels: big-data

Hazelcast Nodejs Client

Hazelcast IMDG Node.js Client

Stars: ✭ 124 (-35.42%)

Mutual labels: big-data

Mockneat

MockNeat is a Java 8+ library that facilitates the generation of arbitrary data for your applications.

Stars: ✭ 410 (+113.54%)

Mutual labels: big-data

My Journey In The Data Science World

📢 Ready to learn or review your knowledge!

Stars: ✭ 1,175 (+511.98%)

Mutual labels: big-data

Kafka Connect Hdfs

Kafka Connect HDFS connector

Stars: ✭ 400 (+108.33%)

Mutual labels: big-data

Fluo

Apache Fluo

Stars: ✭ 159 (-17.19%)

Mutual labels: big-data

Ignite

Apache Ignite

Stars: ✭ 4,027 (+1997.4%)

Mutual labels: big-data

Appdocs

Application Performance Optimization Summary

Stars: ✭ 1,169 (+508.85%)

Mutual labels: big-data

Hive

Apache Hive

Stars: ✭ 4,031 (+1999.48%)

Mutual labels: big-data

Scala Spark Tutorial

Project for James' Apache Spark with Scala course

Stars: ✭ 121 (-36.98%)

Mutual labels: big-data

Carbondata

Mirror of Apache CarbonData

Stars: ✭ 1,158 (+503.13%)

Mutual labels: big-data

Gun

An open source cybersecurity protocol for syncing decentralized graph data.

Stars: ✭ 15,172 (+7802.08%)

Mutual labels: big-data

Flume

Mirror of Apache Flume

Stars: ✭ 2,200 (+1045.83%)

Mutual labels: big-data

Attic Predictionio

PredictionIO, a machine learning server for developers and ML engineers.

Stars: ✭ 12,522 (+6421.88%)

Mutual labels: big-data

Fili

Easily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.

Stars: ✭ 151 (-21.35%)

Mutual labels: big-data

Open Source Handbook

⭐️ Open source projects for all skill levels

Stars: ✭ 131 (-31.77%)

Mutual labels: big-data

Samza Hello Samza

Mirror of Apache Samza

Stars: ✭ 99 (-48.44%)

Mutual labels: big-data

Dataflowjavasdk

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Stars: ✭ 854 (+344.79%)

Mutual labels: big-data

121-180 of 369 similar projects

first

‹

›