EmanuelOverflow / LSTM-TensorSpark

Licence: MIT license

Implementation of a LSTM with TensorFlow and distributed on Apache Spark

Programming Languages

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to LSTM-TensorSpark

The origin of the Non-IID phenomenon is the personalization of users, who generate the Non-IID data. With Non-IID (Not Independent and Identically Distributed) issues existing in the federated learning setting, a myriad of approaches has been proposed to crack this hard nut. In contrast, the personalized federated learning may take the advantage…

Stars: ✭ 58 (+45%)

Mutual labels: distributed-computing, rnn

Image-Captioning-with-Beam-Search

Generating image captions using Xception Network and Beam Search in Keras

Stars: ✭ 18 (-55%)

Mutual labels: rnn

MetaTraderForecast

RNN based Forecasting App for Meta Trader and similar trading platforms

Stars: ✭ 103 (+157.5%)

Mutual labels: rnn

paleo

An analytical performance modeling tool for deep neural networks.

Stars: ✭ 76 (+90%)

Mutual labels: distributed-computing

name2gender

Extrapolate gender from first names using Naïve-Bayes and PyTorch Char-RNN

Stars: ✭ 24 (-40%)

Mutual labels: rnn

text-rnn-tensorflow

Tutorial: Multi-layer Recurrent Neural Networks (LSTM, RNN) for text models in Python using TensorFlow.

Stars: ✭ 22 (-45%)

Mutual labels: rnn

fahclient

Dockerized Folding@home client with NVIDIA GPU support to help battle COVID-19

Stars: ✭ 38 (-5%)

Mutual labels: distributed-computing

address-net

A package to structure Australian addresses

Stars: ✭ 171 (+327.5%)

Mutual labels: rnn

pyspark-algorithms

PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2

Stars: ✭ 72 (+80%)

Mutual labels: distributed-computing

Clockwork-RNN

This repository is a reproduction of the clockwork RNN paper.

Stars: ✭ 20 (-50%)

Mutual labels: rnn

mesos-pinspider

A framework called "pinspider" on Apache mesos, to get basic user information from a pinterest page of a user.

Stars: ✭ 18 (-55%)

Mutual labels: distributed-computing

neural-namer

Fantasy name generator in TensorFlow

Stars: ✭ 65 (+62.5%)

Mutual labels: rnn

meesee

Task queue, Long lived workers for work based parallelization, with processes and Redis as back-end. For distributed computing.

Stars: ✭ 14 (-65%)

Mutual labels: distributed-computing

Theano-MPI

MPI Parallel framework for training deep learning models built in Theano

Stars: ✭ 55 (+37.5%)

Mutual labels: distributed-computing

open-stream-processing-benchmark

This repository contains the code base for the Open Stream Processing Benchmark.

Stars: ✭ 37 (-7.5%)

Mutual labels: distributed-computing

tf-attend-infer-repeat

TensorFlow-based implementation of "Attend, Infer, Repeat" paper (Eslami et al., 2016, arXiv:1603.08575).

Stars: ✭ 44 (+10%)

Mutual labels: rnn

Orleans.CosmosDB

Orleans providers for Azure Cosmos DB

Stars: ✭ 36 (-10%)

Mutual labels: distributed-computing

dislib

The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.

Stars: ✭ 39 (-2.5%)

Mutual labels: distributed-computing

udacity-cvnd-projects

My solutions to the projects assigned for the Udacity Computer Vision Nanodegree

Stars: ✭ 36 (-10%)

Mutual labels: rnn

catacomb

The simplest machine learning library for launching UIs, running evaluations, and comparing model performance.

Stars: ✭ 13 (-67.5%)

Mutual labels: rnn

View All Similar Projects ➔

LSTM-TensorSpark

Implementation of a LSTM with TensorFlow and distributed on Apache Spark

There are provided two different implementations:

Distributed on Spark;
Standalone;

Detailed explanation here: Distributed implementation of a LSTM on Spark and Tensorflow

Developed for academic purpose

Dependencies

Distributed model needs:

Python 2.6+
Pyspark
TensorFlow 1.0+
Numpy
Argparse
TQDM

Standalone model needs:

Python 2.6+
TensorFlow 1.0+
Numpy
Argparse
TQDM

Usage

Example using Spark:

From src directory

spark-submit rnn.py --training_path ../dataset/iris.data --labels_path ../dataset/labels.data --output_path train_dir_iris --partitions 4

usage: rnn.py [-h] [--master MASTER] [--spark_exec_memory SPARK_EXEC_MEMORY]
              [--partitions PARTITIONS] [--epochs EPOCHS]
              [--hidden_units HIDDEN_UNITS] [--batch_size BATCH_SIZE]
              [--num_classes NUM_CLASSES] [--in_features IN_FEATURES]
              [--evaluate_every EVALUATE_EVERY]
              [--learning_rate LEARNING_RATE] [--training_path TRAINING_PATH]
              [--labels_path LABELS_PATH] [--output_path OUTPUT_PATH]
              [--mode MODE] [--checkpoint_path CHECKPOINT_PATH]

optional arguments:
  -h, --help            show this help message and exit
  --master MASTER       Host or master node location (can be node name)
  --spark_exec_memory SPARK_EXEC_MEMORY
                        Spark executor memory
  --partitions PARTITIONS
                        Number of distributed partitions
  --epochs EPOCHS       Number of epochs
  --hidden_units HIDDEN_UNITS
                        List of hidden units per layer (seprated by comma)
  --batch_size BATCH_SIZE
                        Mini batch size
  --num_classes NUM_CLASSES
                        Number of classes in dataset
  --in_features IN_FEATURES
                        Number of input features
  --evaluate_every EVALUATE_EVERY
                        Numbers of steps for each evaluation
  --learning_rate LEARNING_RATE
                        Learning rate
  --training_path TRAINING_PATH
                        Path to training set
  --labels_path LABELS_PATH
                        Path to training_labels
  --output_path OUTPUT_PATH
                        Path for store network state
  --mode MODE           Execution mode
  --checkpoint_path CHECKPOINT_PATH
                        Directory where to save network model and logs

Example without Spark:

From src directory

python lstm-no-spark.py --training_path ../dataset/iris.data --labels_path ../dataset/labels.data --output_path train_dir_iris

usage: rnn.py [-h] [--hidden_units HIDDEN_UNITS] [--batch_size BATCH_SIZE]
              [--num_classes NUM_CLASSES] [--in_features IN_FEATURES]
              [--evaluate_every EVALUATE_EVERY]
              [--learning_rate LEARNING_RATE] [--training_path TRAINING_PATH]
              [--labels_path LABELS_PATH] [--output_path OUTPUT_PATH]
              [--mode MODE] [--checkpoint_path CHECKPOINT_PATH]

optional arguments:
  --epochs EPOCHS       Number of epochs
  --hidden_units HIDDEN_UNITS
                        List of hidden units per layer (seprated by comma)
  --batch_size BATCH_SIZE
                        Mini batch size
  --num_classes NUM_CLASSES
                        Number of classes in dataset
  --in_features IN_FEATURES
                        Number of input features
  --evaluate_every EVALUATE_EVERY
                        Numbers of steps for each evaluation
  --learning_rate LEARNING_RATE
                        Learning rate
  --training_path TRAINING_PATH
                        Path to training set
  --labels_path LABELS_PATH
                        Path to training_labels
  --output_path OUTPUT_PATH
                        Path for store network state
  --mode MODE           Execution mode
  --checkpoint_path CHECKPOINT_PATH
                        Directory where to save network model and logs

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

EmanuelOverflow / LSTM-TensorSpark

Programming Languages

Labels

Projects that are alternatives of or similar to LSTM-TensorSpark

LSTM-TensorSpark

Dependencies

Usage

Example using Spark:

Example without Spark: