All Projects → awesome-bigdata → Similar Projects or Alternatives

472 Open source projects that are alternatives of or similar to awesome-bigdata

A project to help people stand in line at the market as little as possible

Stars: ✭ 95 (-99.14%)

Mutual labels: bigdata

🦖 Streaming-Serverless Framework for Low-latency Edge Computing applications, running atop QUIC protocol, engaging 5G technology.

Stars: ✭ 279 (-97.48%)

Mutual labels: stream-processing

zdh web

大数据采集,抽取平台

Stars: ✭ 292 (-97.37%)

Mutual labels: bigdata

Flink Sql Cookbook

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.

Stars: ✭ 189 (-98.3%)

Mutual labels: stream-processing

Biglasso

biglasso: Extending Lasso Model Fitting to Big Data in R

Stars: ✭ 87 (-99.22%)

Mutual labels: bigdata

kerala

Distributed KV Streams

Stars: ✭ 16 (-99.86%)

Mutual labels: stream-processing

Clustering4Ever

C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.

Stars: ✭ 126 (-98.86%)

Mutual labels: bigdata

Akka-Streams-custom-stream-processing-examples

Demos of how to do custom stream processing using the Akka Streams GraphStages API

Stars: ✭ 13 (-99.88%)

Mutual labels: stream-processing

Bigdata File Viewer

A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.

Stars: ✭ 86 (-99.22%)

Mutual labels: bigdata

flink-connectors

Apache Flink connectors for Pravega.

Stars: ✭ 84 (-99.24%)

Mutual labels: stream-processing

plexus

Plexus - Interactive Emotion Visualization based on Social Media

Stars: ✭ 27 (-99.76%)

Mutual labels: visualize-data

Javainterview

最全的Java技术知识点，以及Java源码分析。为开源贡献自己的一份力。

Stars: ✭ 154 (-98.61%)

Mutual labels: bigdata

Media Stream Library Js

JavaScript library to handle media streams on the command line (Node.js) and in the browser.

Stars: ✭ 192 (-98.27%)

Mutual labels: stream-processing

ottla

An opinionated clojure framework for writing kafka machines

Stars: ✭ 14 (-99.87%)

Mutual labels: stream-processing

Athena Cli

Presto-like CLI tool for AWS Athena

Stars: ✭ 85 (-99.23%)

Mutual labels: bigdata

PersonNotes

个人笔记集中营，快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧

Stars: ✭ 61 (-99.45%)

Mutual labels: bigdata

stream-registry

Stream Discovery and Stream Orchestration

Stars: ✭ 105 (-99.05%)

Mutual labels: stream-processing

Uproot4

ROOT I/O in pure Python and NumPy.

Stars: ✭ 80 (-99.28%)

Mutual labels: bigdata

storm-ml

an online learning algorithm library for Storm

Stars: ✭ 18 (-99.84%)

Mutual labels: stream-processing

cdc

A library for performing Content-Defined Chunking (CDC) on data streams.

Stars: ✭ 18 (-99.84%)

Mutual labels: data-stream

Apache Spark Hands On

Educational notes,Hands on problems w/ solutions for hadoop ecosystem

Stars: ✭ 74 (-99.33%)

Mutual labels: bigdata

talaria

TalariaDB is a distributed, highly available, and low latency time-series database for Presto

Stars: ✭ 148 (-98.67%)

Mutual labels: stream-processing

intersect

一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集

Stars: ✭ 54 (-99.51%)

Mutual labels: bigdata

Gspread Pandas

A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.

Stars: ✭ 226 (-97.96%)

Mutual labels: data-analytics

Logrange

High performance data aggregating storage

Stars: ✭ 181 (-98.37%)

Mutual labels: stream-processing

Hstream

The streaming database built for IoT data storage and real-time processing in the 5G Era

Stars: ✭ 166 (-98.5%)

Mutual labels: stream-processing

Koolreport

This is an Open Source PHP Reporting Framework which you can use to write perfect data reports or to construct awesome dashboards using PHP

Stars: ✭ 204 (-98.16%)

Mutual labels: data-analytics

go-rivers

Collection of stream processing / multiplexing / networking libs in Go

Stars: ✭ 35 (-99.68%)

Mutual labels: stream-processing

Ranalyticshhe

Repository for Online Classes

Stars: ✭ 183 (-98.35%)

Mutual labels: data-analytics

Optimus

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (-91.11%)

Mutual labels: bigdata

Countly Sdk Web

Countly Product Analytics SDK for websites and web applications

Stars: ✭ 165 (-98.51%)

Mutual labels: data-analytics

datapackage-m

Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel

Stars: ✭ 26 (-99.77%)

Mutual labels: data-analytics

Big Dipper

A block explorer for Cosmos

Stars: ✭ 119 (-98.93%)

Mutual labels: data-analytics

Aws Auto Terminate Idle Emr

AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.

Stars: ✭ 21 (-99.81%)

Mutual labels: bigdata

Superset

Apache Superset is a Data Visualization and Data Exploration Platform

Stars: ✭ 42,634 (+284.33%)

Mutual labels: data-analytics

jhdf

A pure Java HDF5 library

Stars: ✭ 83 (-99.25%)

Mutual labels: bigdata

Danfojs

danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

Stars: ✭ 1,304 (-88.24%)

Mutual labels: data-analytics

Spark Streaming Monitoring With Lightning

Plot live-stats as graph from ApacheSpark application using Lightning-viz

Stars: ✭ 15 (-99.86%)

Mutual labels: bigdata

Basketball analytics

Repository which contains various scripts and work with various basketball statistics

Stars: ✭ 88 (-99.21%)

Mutual labels: data-analytics

young-examples

java学习和项目中一些典型的应用场景样例代码

Stars: ✭ 21 (-99.81%)

Mutual labels: bigdata

Mobius

C# and F# language binding and extensions to Apache Spark

Stars: ✭ 929 (-91.63%)

Mutual labels: bigdata

Trck

Query engine for TrailDB

Stars: ✭ 48 (-99.57%)

Mutual labels: data-analytics

kafka-workers

Kafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.

Stars: ✭ 30 (-99.73%)

Mutual labels: stream-processing

Arcon

Runtime for Writing Streaming Applications in Rust.

Stars: ✭ 44 (-99.6%)

Mutual labels: data-analytics

Hadoop For Geoevent

ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.

Stars: ✭ 5 (-99.95%)

Mutual labels: bigdata

Insights

Open Source Self-Hosted Business Intelligence Platform

Stars: ✭ 917 (-91.73%)

Mutual labels: data-analytics

twitter-archive-reader

Full featured TypeScript Twitter archive reader and browser

Stars: ✭ 43 (-99.61%)

Mutual labels: bigdata

Datasheets

Read data from, write data to, and modify the formatting of Google Sheets

Stars: ✭ 593 (-94.65%)

Mutual labels: data-analytics

Kube Batch

A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC

Stars: ✭ 804 (-92.75%)

Mutual labels: bigdata

Dataanalysisinaction

(已完结)《极客时间数据分析实战45讲-详细笔记》包含markdown、图片、思维导图、代码、数据。可直接阅读代码、测试!

Stars: ✭ 482 (-95.65%)

Mutual labels: data-analytics

awesome-bigquery-views

Useful SQL queries for Blockchain ETL datasets in BigQuery.

Stars: ✭ 325 (-97.07%)

Mutual labels: data-analytics

Fero

light, fast, scalable, streaming microservices made easy

Stars: ✭ 175 (-98.42%)

Mutual labels: stream-processing

gostream

Stream Processing Library for Go

Stars: ✭ 51 (-99.54%)

Mutual labels: stream-processing

cnosdb

An Open Source Distributed Time Series Database with high performance, high compression ratio and high usability.

Stars: ✭ 858 (-92.27%)

Mutual labels: distributed-database

ramen

A stream processing language and compiler for small-scale monitoring

Stars: ✭ 14 (-99.87%)

Mutual labels: stream-processing

learning-spark

Tidy up Spark and Hadoop tutorials.

Stars: ✭ 28 (-99.75%)

Mutual labels: bigdata

anovos

Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark

Stars: ✭ 77 (-99.31%)

Mutual labels: bigdata

download-using-streaming-response-body

An example for streaming large files in chunks using StreamingResponseBody in Spring MVC

Stars: ✭ 62 (-99.44%)

Mutual labels: streaming-data

Athenacli

AthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.

Stars: ✭ 151 (-98.64%)

Mutual labels: bigdata

Akka Stream Contrib

Add-ons to Akka Stream

Stars: ✭ 173 (-98.44%)

Mutual labels: stream-processing

301-360 of 472 similar projects

first

‹

›