All Projects → dcfjs → dcf

dcfjs / dcf

Licence: other
Yet another distributed compute framework

Programming Languages

typescript
32286 projects
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to dcf

meesee
Task queue, Long lived workers for work based parallelization, with processes and Redis as back-end. For distributed computing.
Stars: ✭ 14 (-70.83%)
Mutual labels:  distributed-computing
microcore
.NET Core framework for inter-service communication
Stars: ✭ 24 (-50%)
Mutual labels:  distributed-computing
ParallelUtilities.jl
Fast and easy parallel mapreduce on HPC clusters
Stars: ✭ 28 (-41.67%)
Mutual labels:  distributed-computing
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+50%)
Mutual labels:  distributed-computing
SlurmClusterManager.jl
julia package for running code on slurm clusters
Stars: ✭ 27 (-43.75%)
Mutual labels:  distributed-computing
protoactor-python
Proto Actor - Ultra fast distributed actors
Stars: ✭ 78 (+62.5%)
Mutual labels:  distributed-computing
paleo
An analytical performance modeling tool for deep neural networks.
Stars: ✭ 76 (+58.33%)
Mutual labels:  distributed-computing
hydra-hpp
Hydra Hot Potato Player (game)
Stars: ✭ 12 (-75%)
Mutual labels:  distributed-computing
python-json-socket
JSON messaging based socket interface with multi-threaded server and client
Stars: ✭ 52 (+8.33%)
Mutual labels:  distributed-computing
raven-distribution-framework
Decentralized Computing Backend for Artificial Intelligence, Web3, Metaverse, and Gaming Application
Stars: ✭ 31 (-35.42%)
Mutual labels:  distributed-computing
open-stream-processing-benchmark
This repository contains the code base for the Open Stream Processing Benchmark.
Stars: ✭ 37 (-22.92%)
Mutual labels:  distributed-computing
tasq
A simple task queue implementation to enqeue jobs on local or remote processes.
Stars: ✭ 83 (+72.92%)
Mutual labels:  distributed-computing
ShadowClone
Unleash the power of cloud
Stars: ✭ 224 (+366.67%)
Mutual labels:  distributed-computing
SANSA-Stack
Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Stars: ✭ 130 (+170.83%)
Mutual labels:  distributed-computing
gordo
An API-first distributed deployment system of deep learning models using timeseries data to predict the behaviour of systems
Stars: ✭ 25 (-47.92%)
Mutual labels:  distributed-computing
dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (-18.75%)
Mutual labels:  distributed-computing
wrench
WRENCH: Cyberinfrastructure Simulation Workbench
Stars: ✭ 25 (-47.92%)
Mutual labels:  distributed-computing
pycondor
Build and submit workflows to HTCondor in Python
Stars: ✭ 23 (-52.08%)
Mutual labels:  distributed-computing
machinaris
An easy-to-use WebUI for crypto plotting and farming. Offers Plotman, MadMax, Chiadog, Bladebit, Farmr, and Forktools in a Docker container. Supports Chia, MMX, Chives, Flax, HDDCoin, and BPX among others.
Stars: ✭ 324 (+575%)
Mutual labels:  distributed-computing
rce
Distributed, workflow-driven integration environment
Stars: ✭ 42 (-12.5%)
Mutual labels:  distributed-computing

Distributed Computing Framework for Node.js

Early development stage: this project was still under early development, many necessery feature was not done yet, use it on your own risk.

Document

API Reference

A node.js version of Spark, without hadoop or jvm.

You should read tutorial first, then you can learn Spark but use this project instead.

Async API & deferred API

Any api that requires a RDD and generate a result is async, like count, take, max ... Any api that creates a RDD is deferred API, which is not async, so you can chain them like this:

await dcc
  .parallelize([1, 2, 3, 4, 5, 6, 7, 8, 9, 10])
  .map(v => v + 1)
  .filter(v => v % 2 === 0)
  .take(10); // take is not deferred api but async

Milestones

0.1.x: Basic

  • local master.
  • rdd & partition creation & release.
  • map & reduce
  • repartition & reduceByKey
  • disk storage partitions
  • cache
  • file loader & saver
  • export module to npm
  • decompresser & compresser
  • use debug module for information/error
  • provide a progress bar.
  • sampler
  • sort
  • object hash(for key) method
  • storage MEMORY_OR_DISK, and use it in sort
  • storage MEMORY_SER,storage in memory but off v8 heap.
  • config default partition count.

0.2.x: Remote mode

  • distributed master
  • runtime sandbox
  • plugin system
  • remote dependency management
  • aliyun oss loader
  • hdfs loader

How to use

Install from npm(shell only)

npm install -g dcf
#or
yarn global add dcf

Then you can use command: dcf-shell

Install from npm(as dependency)

npm install --save dcf
#or
yarn add dcf

Then you can use dcf with javascript or typescript.

Run samples & cli

download this repo, install dependencies

npm install
# or
yarn

Run samples:

npm run ts-node src/samples/tutorial-0.ts
npm run ts-node src/samples/repartition.ts

Run interactive cli:

npm start
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].