Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → eBay → Accelerator

eBay / Accelerator

Licence: apache-2.0

The Accelerator is a tool for fast and reproducible processing of large amounts of data.

Programming Languages

139335 projects - #7 most used programming language

Labels

data-science big-data data-mining high-performance-computing reproducibility data-engineering parallel-processing

Projects that are alternatives of or similar to Accelerator

A Clojure dataframe library that runs on Spark

Stars: ✭ 152 (+10.95%)

Mutual labels: data-science, big-data, data-engineering, high-performance-computing

Explore high-dimensional datasets and how your algo handles specific regions.

Stars: ✭ 100 (-27.01%)

Mutual labels: data-science, big-data, data-mining

Function-oriented Make-like declarative workflows for R

Stars: ✭ 293 (+113.87%)

Mutual labels: data-science, reproducibility, high-performance-computing

📊 📋 Dashboards using YAML or JSON files

Stars: ✭ 1,511 (+1002.92%)

Mutual labels: data-science, big-data, data-engineering

A simple Spark-powered ETL framework that just works 🍺

Stars: ✭ 79 (-42.34%)

Mutual labels: data-science, big-data, data-engineering

Dataflowjavasdk

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Stars: ✭ 854 (+523.36%)

Mutual labels: data-science, big-data, data-mining

Example workflows for the drake R package

Stars: ✭ 57 (-58.39%)

Mutual labels: data-science, reproducibility, high-performance-computing

An R-focused pipeline toolkit for reproducibility and high-performance computing

Stars: ✭ 1,301 (+849.64%)

Mutual labels: data-science, reproducibility, high-performance-computing

Tennis Crystal Ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction

Stars: ✭ 107 (-21.9%)

Mutual labels: data-science, big-data

Apache Superset is a Data Visualization and Data Exploration Platform

Stars: ✭ 42,634 (+31019.71%)

Mutual labels: data-science, data-engineering

repo for code published on pythondata.com

Stars: ✭ 113 (-17.52%)

Mutual labels: data-science, big-data

Accelerate data science

Stars: ✭ 118 (-13.87%)

Mutual labels: data-science, data-engineering

Lightweight, Python library for fast and reproducible experimentation 🔬

Stars: ✭ 119 (-13.14%)

Mutual labels: data-science, reproducibility

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Stars: ✭ 122 (-10.95%)

Mutual labels: data-science, data-engineering

Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.

Stars: ✭ 99 (-27.74%)

Mutual labels: big-data, data-mining

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (-20.44%)

Mutual labels: data-science, big-data

Papers Literature Ml Dl Rl Ai

Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning

Stars: ✭ 1,341 (+878.83%)

Mutual labels: data-science, data-mining

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+876.64%)

Mutual labels: data-science, big-data

A tool for building feature stores.

Stars: ✭ 126 (-8.03%)

Mutual labels: data-science, data-engineering

Aws Data Wrangler

Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Stars: ✭ 2,385 (+1640.88%)

Mutual labels: data-science, data-engineering

View All Similar Projects ➔

The Accelerator is a tool for fast and reproducible processing of large amounts of data. Extensive documentation is available here:

Reference Manual
Home Page
PyPI

pip install accelerator
After installation try "ax --help".

Supported Environments

The Accelerator project has been built, tested, and runs on:

Ubuntu 16.04, 18.04, 20.04
Debian 9, 10
FreeBSD 11.3, 12.1

but is not limited to these systems or versions.

Windows is not supported, but WSL should work.

License

Copyright 2017-2018 eBay Inc.
Modifications copyright (c) 2018-2021 Carl Drougge
Modifications copyright (c) 2019-2021 Anders Berkeman

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

https://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 137

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗