All Projects → Thrill → Similar Projects or Alternatives

525 Open source projects that are alternatives of or similar to Thrill

Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-71.59%)
Mutual labels:  big-data, distributed-computing
Hazelcast
Open-source distributed computation and storage platform
Stars: ✭ 4,662 (+782.95%)
Mutual labels:  big-data, distributed-computing
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (-86.36%)
Mutual labels:  big-data, distributed-computing
Selinon
An advanced distributed task flow management on top of Celery
Stars: ✭ 237 (-55.11%)
Mutual labels:  big-data, distributed-computing
Moosefs
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+94.13%)
Mutual labels:  big-data, distributed-computing
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-31.63%)
Mutual labels:  big-data, distributed-computing
Nakedtensor
Bare bone examples of machine learning in TensorFlow
Stars: ✭ 2,443 (+362.69%)
Mutual labels:  big-data, distributed-computing
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-71.21%)
Mutual labels:  big-data, distributed-computing
dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (-92.61%)
Mutual labels:  big-data, distributed-computing
nebula
A distributed block-based data storage and compute engine
Stars: ✭ 127 (-75.95%)
Mutual labels:  big-data, distributed-computing
Sylph
Stream computing platform for bigdata
Stars: ✭ 362 (-31.44%)
Mutual labels:  big-data
Protoactor Go
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 3,934 (+645.08%)
Mutual labels:  distributed-computing
Circosjs
d3 library to build circular graphs
Stars: ✭ 436 (-17.42%)
Mutual labels:  big-data
Stream Framework
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
Stars: ✭ 4,576 (+766.67%)
Mutual labels:  big-data
Bigtop
Mirror of Apache Bigtop
Stars: ✭ 356 (-32.58%)
Mutual labels:  big-data
Listenbrainz Server
Server for the ListenBrainz project
Stars: ✭ 420 (-20.45%)
Mutual labels:  big-data
Devops Roadmap
DevOps methodology & roadmap for a devops developer in 2019. Interesting books to learn new technologies.
Stars: ✭ 349 (-33.9%)
Mutual labels:  big-data
Stroom
Stroom is a highly scalable data storage, processing and analysis platform.
Stars: ✭ 344 (-34.85%)
Mutual labels:  big-data
Opendata.cern.ch
Source code for the CERN Open Data portal
Stars: ✭ 411 (-22.16%)
Mutual labels:  big-data
Ozone
Scalable, redundant, and distributed object store for Apache Hadoop
Stars: ✭ 330 (-37.5%)
Mutual labels:  big-data
Paracel
Distributed training framework with parameter server
Stars: ✭ 335 (-36.55%)
Mutual labels:  distributed-computing
Onlinestats.jl
Single-pass algorithms for statistics
Stars: ✭ 507 (-3.98%)
Mutual labels:  big-data
Easylambda
distributed dataflows with functional list operations for data processing with C++14
Stars: ✭ 475 (-10.04%)
Mutual labels:  distributed-computing
Mockneat
MockNeat is a Java 8+ library that facilitates the generation of arbitrary data for your applications.
Stars: ✭ 410 (-22.35%)
Mutual labels:  big-data
Platon Go
Golang implementation of the PlatON protocol
Stars: ✭ 331 (-37.31%)
Mutual labels:  distributed-computing
Datafuse
Datafuse is a free Cloud-Native Analytics DBMS(Inspired by ClickHouse) implemented in Rust
Stars: ✭ 327 (-38.07%)
Mutual labels:  distributed-computing
Couler
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
Stars: ✭ 405 (-23.3%)
Mutual labels:  distributed-computing
Tez
Apache Tez
Stars: ✭ 313 (-40.72%)
Mutual labels:  big-data
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+4075.76%)
Mutual labels:  big-data
Sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (-31.44%)
Mutual labels:  big-data
Pgm Index
🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes
Stars: ✭ 499 (-5.49%)
Mutual labels:  big-data
Diplomat
A HTTP Ruby API for Consul
Stars: ✭ 358 (-32.2%)
Mutual labels:  distributed-computing
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-19.32%)
Mutual labels:  big-data
Vespa
The open big data serving engine. https://vespa.ai
Stars: ✭ 3,747 (+609.66%)
Mutual labels:  big-data
Awesome Distributed Systems
Awesome list of distributed systems resources
Stars: ✭ 512 (-3.03%)
Mutual labels:  distributed-computing
Attic Apex Core
Mirror of Apache Apex core
Stars: ✭ 346 (-34.47%)
Mutual labels:  big-data
Datascience Ai Machinelearning Resources
Alex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-21.59%)
Mutual labels:  big-data
Parquet Cpp
Apache Parquet
Stars: ✭ 339 (-35.8%)
Mutual labels:  big-data
Fit Sne
Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE)
Stars: ✭ 485 (-8.14%)
Mutual labels:  big-data
Grouparoo
🦘 The Grouparoo Monorepo - open source customer data sync framework
Stars: ✭ 334 (-36.74%)
Mutual labels:  big-data
Cogcomp Nlp
CogComp's Natural Language Processing libraries and Demos:
Stars: ✭ 410 (-22.35%)
Mutual labels:  big-data
Beeva Best Practices
Best Practices and Style Guides in BEEVA
Stars: ✭ 335 (-36.55%)
Mutual labels:  big-data
Arkime
Arkime (formerly Moloch) is an open source, large scale, full packet capturing, indexing, and database system.
Stars: ✭ 4,994 (+845.83%)
Mutual labels:  big-data
Sleuth
A Go library for master-less peer-to-peer autodiscovery and RPC between HTTP services
Stars: ✭ 331 (-37.31%)
Mutual labels:  distributed-computing
Decentralized Internet
A SDK/library for decentralized web and distributing computing projects
Stars: ✭ 406 (-23.11%)
Mutual labels:  big-data
Fishnet
Distributed Stockfish analysis for lichess.org
Stars: ✭ 306 (-42.05%)
Mutual labels:  distributed-computing
Redislite
Redis in a python module.
Stars: ✭ 464 (-12.12%)
Mutual labels:  big-data
Awesome Federated Computing
📚 👓 A collection of research papers, codes, tutorials and blogs on Federated Computing/Learning.
Stars: ✭ 314 (-40.53%)
Mutual labels:  distributed-computing
Kafka Connect Hdfs
Kafka Connect HDFS connector
Stars: ✭ 400 (-24.24%)
Mutual labels:  big-data
Uproot3
ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (-40.91%)
Mutual labels:  big-data
Magellan
Geo Spatial Data Analytics on Spark
Stars: ✭ 507 (-3.98%)
Mutual labels:  big-data
Delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+639.2%)
Mutual labels:  big-data
Orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Stars: ✭ 389 (-26.33%)
Mutual labels:  big-data
Mist
Serverless proxy for Spark cluster
Stars: ✭ 309 (-41.48%)
Mutual labels:  big-data
Fluid
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud
Stars: ✭ 265 (-49.81%)
Mutual labels:  big-data
Ignite
Apache Ignite
Stars: ✭ 4,027 (+662.69%)
Mutual labels:  big-data
Helix
Mirror of Apache Helix
Stars: ✭ 304 (-42.42%)
Mutual labels:  big-data
Morpheus
Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Stars: ✭ 303 (-42.61%)
Mutual labels:  big-data
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-14.02%)
Mutual labels:  big-data
Bigdl
Building Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+622.16%)
Mutual labels:  big-data
1-60 of 525 similar projects