All Projects → machine-learning-data-pipeline → Similar Projects or Alternatives

363 Open source projects that are alternatives of or similar to machine-learning-data-pipeline

Moose
Multiphysics Object Oriented Simulation Environment
Stars: ✭ 652 (+2863.64%)
Mutual labels:  parallel
Texar Pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+2790.91%)
Mutual labels:  data-processing
datajob
Build and deploy a serverless data pipeline on AWS with no effort.
Stars: ✭ 101 (+359.09%)
Mutual labels:  data-pipeline
Awesome Web Scraping
List of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+20400%)
Mutual labels:  data-processing
Adaptive
📈 Adaptive: parallel active learning of mathematical functions
Stars: ✭ 646 (+2836.36%)
Mutual labels:  parallel
Xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+1422.73%)
Mutual labels:  data-processing
Metasync
Asynchronous Programming Library for JavaScript & Node.js
Stars: ✭ 164 (+645.45%)
Mutual labels:  parallel
Dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+16372.73%)
Mutual labels:  data-processing
Ems
Extended Memory Semantics - Persistent shared object memory and parallelism for Node.js and Python
Stars: ✭ 552 (+2409.09%)
Mutual labels:  parallel
Rapidtables
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Stars: ✭ 292 (+1227.27%)
Mutual labels:  data-processing
pyfuncol
Functional collections extension functions for Python
Stars: ✭ 32 (+45.45%)
Mutual labels:  parallel
baleen3
Baleen 3 is a data processing tool based on the Annot8 framework
Stars: ✭ 15 (-31.82%)
Mutual labels:  data-processing
Proxypool
给爬虫使用的代理IP池
Stars: ✭ 508 (+2209.09%)
Mutual labels:  parallel
pulserl
Apache Pulsar client library for Erlang/Elixir
Stars: ✭ 15 (-31.82%)
Mutual labels:  data-processing
Raytracer
Ray tracer with phong lighting, reflections, refractions, normal mapping, procedural textures, super sampling, and depth of field.
Stars: ✭ 155 (+604.55%)
Mutual labels:  parallel
meta-schema
Little DSL to make data processing sane with clojure.spec and spec-tools
Stars: ✭ 25 (+13.64%)
Mutual labels:  data-processing
Asyncro
⛵️ Beautiful Array utilities for ESnext async/await ~
Stars: ✭ 487 (+2113.64%)
Mutual labels:  parallel
bonobo-sqlalchemy
PREVIEW - SQL databases in Bonobo, using sqlalchemy
Stars: ✭ 23 (+4.55%)
Mutual labels:  data-processing
WAND-PIC
WAND-PIC
Stars: ✭ 20 (-9.09%)
Mutual labels:  parallel
Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-4.55%)
Mutual labels:  data-processing
Easylambda
distributed dataflows with functional list operations for data processing with C++14
Stars: ✭ 475 (+2059.09%)
Mutual labels:  parallel
traceml
Engine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.
Stars: ✭ 445 (+1922.73%)
Mutual labels:  data-processing
Oneflow
LargeScale Multiphysics Scientific Simulation Environment-OneFLOW CFD
Stars: ✭ 150 (+581.82%)
Mutual labels:  parallel
Anatomy-of-System-Engineering
System Engineering Memory Map
Stars: ✭ 17 (-22.73%)
Mutual labels:  data-processing
Optuna
A hyperparameter optimization framework
Stars: ✭ 5,679 (+25713.64%)
Mutual labels:  parallel
parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (+59.09%)
Mutual labels:  data-processing
pareach
a tiny function that "parallelizes" work in NodeJS
Stars: ✭ 19 (-13.64%)
Mutual labels:  parallel
Processor
Ontology-driven Linked Data processor and server for SPARQL backends. Apache License.
Stars: ✭ 54 (+145.45%)
Mutual labels:  data-processing
Libmesh
libMesh github repository
Stars: ✭ 450 (+1945.45%)
Mutual labels:  parallel
AMIDD
Introduction to Applied Mathematics and Informatics in Drug Discovery (AMIDD)
Stars: ✭ 13 (-40.91%)
Mutual labels:  computing
Pytest Parallel
A pytest plugin for parallel and concurrent testing
Stars: ✭ 146 (+563.64%)
Mutual labels:  parallel
SciCompforChemists
Scientific Computing for Chemists text for teaching basic computing skills to chemistry students using Python, Jupyter notebooks, and the SciPy stack. This text makes use of a variety of packages including NumPy, SciPy, matplotlib, pandas, seaborn, NMRglue, SymPy, scikit-image, and scikit-learn.
Stars: ✭ 65 (+195.45%)
Mutual labels:  computing
Machma
Easy parallel execution of commands with live feedback
Stars: ✭ 438 (+1890.91%)
Mutual labels:  parallel
ideas4
An Additional 100 Ideas for Computing https://samsquire.github.io/ideas4/
Stars: ✭ 26 (+18.18%)
Mutual labels:  computing
await
28Kb, small memory footprint, single binary that run list of commands in parallel and waits for their termination
Stars: ✭ 73 (+231.82%)
Mutual labels:  parallel
flux
Flux, Your Gateway to a Decentralized World. https://home.runonflux.io https://api.runonflux.io https://docs.runonflux.io https://source.runonflux.io https://wiki.runonflux.io
Stars: ✭ 150 (+581.82%)
Mutual labels:  computing
Rush
A cross-platform command-line tool for executing jobs in parallel
Stars: ✭ 421 (+1813.64%)
Mutual labels:  parallel
foofah
Foofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (+9.09%)
Mutual labels:  data-preparation
Veros
The versatile ocean simulator, in pure Python, powered by Bohrium.
Stars: ✭ 136 (+518.18%)
Mutual labels:  parallel
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+445.45%)
Mutual labels:  data-preparation
Cloe
Cloe programming language
Stars: ✭ 398 (+1709.09%)
Mutual labels:  parallel
Data Engineering Howto
A list of useful resources to learn Data Engineering from scratch
Stars: ✭ 2,056 (+9245.45%)
Mutual labels:  data-pipeline
HPC
A collection of various resources, examples, and executables for the general NREL HPC user community's benefit. Use the following website for accessing documentation.
Stars: ✭ 64 (+190.91%)
Mutual labels:  computing
pipeline
OONI data processing pipeline
Stars: ✭ 36 (+63.64%)
Mutual labels:  data-pipeline
Blackbox
A Python module for parallel optimization of expensive black-box functions
Stars: ✭ 378 (+1618.18%)
Mutual labels:  parallel
network-pipeline
Network traffic data pipeline for real-time predictions and building datasets for deep neural networks
Stars: ✭ 36 (+63.64%)
Mutual labels:  data-pipeline
Fpart
Sort files and pack them into partitions
Stars: ✭ 127 (+477.27%)
Mutual labels:  parallel
richflow
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Stars: ✭ 17 (-22.73%)
Mutual labels:  data-pipeline
Asyncenumerable
Defines IAsyncEnumerable, IAsyncEnumerator, ForEachAsync(), ParallelForEachAsync(), and other useful stuff to use with async-await
Stars: ✭ 367 (+1568.18%)
Mutual labels:  parallel
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+13.64%)
Mutual labels:  data-pipeline
optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+6040.91%)
Mutual labels:  data-preparation
Msrsync
Multi-stream rsync wrapper
Stars: ✭ 328 (+1390.91%)
Mutual labels:  parallel
sst-core
SST Structural Simulation Toolkit Parallel Discrete Event Core and Services
Stars: ✭ 82 (+272.73%)
Mutual labels:  parallel
sciblox
sciblox - Easier Data Science and Machine Learning
Stars: ✭ 48 (+118.18%)
Mutual labels:  data-preprocessing
raptor
General, high performance algebraic multigrid solver
Stars: ✭ 50 (+127.27%)
Mutual labels:  parallel
PTTmineR
Parallel Searching and Crawling Data from PTT 🚀
Stars: ✭ 31 (+40.91%)
Mutual labels:  parallel
etl
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+1168.18%)
Mutual labels:  data-processing
Transducers.jl
Efficient transducers for Julia
Stars: ✭ 226 (+927.27%)
Mutual labels:  parallel
Ray Tracing Iow Rust
Ray Tracing in One Weekend written in Rust
Stars: ✭ 57 (+159.09%)
Mutual labels:  parallel
python-appium-framework
Complete Python Appium framework in 360 degree
Stars: ✭ 43 (+95.45%)
Mutual labels:  parallel
301-360 of 363 similar projects