Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+395.65%)
Svm kernelx86_64 AMD kernel optimized for performance & hypervisor usage
Stars: ✭ 32 (-98.49%)
TranspyleHPC-oriented transpiler for C, C++, Cython, Fortran, OpenCL and Python.
Stars: ✭ 90 (-95.74%)
Wfl A Simple Way of Creating Job Workflows in Go running in Processes, Containers, Tasks, Pods, or Jobs
Stars: ✭ 30 (-98.58%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-94.75%)
Aws Auto Terminate Idle EmrAWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
Stars: ✭ 21 (-99.01%)
Ignite Book Code SamplesAll code samples, scripts and more in-depth examples for the book high performance in-memory computing with Apache Ignite. Please use the repository "the-apache-ignite-book" for Ignite version 2.6 or above.
Stars: ✭ 86 (-95.93%)
SimgridMIRROR of the SimGrid framework, for the simulation of distributed applications (Clouds, HPC, Grids, IoT and others). Most of the dev occurs on FramaGit.
Stars: ✭ 106 (-94.99%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (-59.46%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (-95.98%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-56.05%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-94.23%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-99.76%)
Athena CliPresto-like CLI tool for AWS Athena
Stars: ✭ 85 (-95.98%)
OndemandSupercomputing. Seamlessly. Open, Interactive HPC Via the Web
Stars: ✭ 40 (-98.11%)
Coding Now学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
Stars: ✭ 750 (-64.52%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-96.22%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-64.76%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (-22.37%)
Su2SU2: An Open-Source Suite for Multiphysics Simulation and Design
Stars: ✭ 731 (-65.42%)
Wlm OperatorSingularity implementation of k8s operator for interacting with SLURM.
Stars: ✭ 78 (-96.31%)
VaexOut-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+221.33%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-95.03%)
MfemLightweight, general, scalable C++ library for finite element methods
Stars: ✭ 667 (-68.45%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-96.45%)
CromwellScientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
Stars: ✭ 655 (-69.02%)
Coreparallel finite element unstructured meshes
Stars: ✭ 124 (-94.13%)
BigartmFast topic modeling platform
Stars: ✭ 563 (-73.37%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-96.5%)
Easylambdadistributed dataflows with functional list operations for data processing with C++14
Stars: ✭ 475 (-77.53%)
Flux Corecore services for the Flux resource management framework
Stars: ✭ 73 (-96.55%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (-77.81%)
Flinkstreamsql基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (-20.44%)
Bigdataie大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (-78.95%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-96.64%)
Circosjsd3 library to build circular graphs
Stars: ✭ 436 (-79.38%)
Singularity CriThe Singularity implementation of the Kubernetes Container Runtime Interface
Stars: ✭ 97 (-95.41%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (-79.85%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-96.74%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (-81.65%)
Liteflowliteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (-94.7%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (-83.07%)
GeopmGlobal Extensible Open Power Manager
Stars: ✭ 57 (-97.3%)
DatawaveDataWave is an ingest/query framework that leverages Apache Accumulo to provide fast, secure data access.
Stars: ✭ 347 (-83.59%)
NextflowA DSL for data-driven computational pipelines
Stars: ✭ 1,337 (-36.75%)
Api.rssRSS as RESTful. This service allows you to transform RSS feed into an awesome API.
Stars: ✭ 340 (-83.92%)
CbrainCBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Stars: ✭ 51 (-97.59%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+74.69%)
PymapdPython client for OmniSci GPU-accelerated SQL engine and analytics platform
Stars: ✭ 109 (-94.84%)
Jean Zay DocCollaborative documentation for and from Jean Zay users. Official Jean Zay documentation is here: http://www.idris.fr/eng/jean-zay/
Stars: ✭ 45 (-97.87%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-94.04%)
SeissolA scientific software for the numerical simulation of seismic wave phenomena and earthquake dynamics
Stars: ✭ 123 (-94.18%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (-26.96%)
Daudit🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!
Stars: ✭ 108 (-94.89%)
OffOFF, Open source Finite volume Fluid dynamics code
Stars: ✭ 93 (-95.6%)
Reddit sse streamA Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
Stars: ✭ 39 (-98.16%)