All Projects → eBay → Accelerator

eBay / Accelerator

Licence: apache-2.0
The Accelerator is a tool for fast and reproducible processing of large amounts of data.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Accelerator

Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+10.95%)
Mutual labels:  data-science, big-data, data-engineering, high-performance-computing
Vizuka
Explore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-27.01%)
Mutual labels:  data-science, big-data, data-mining
Targets
Function-oriented Make-like declarative workflows for R
Stars: ✭ 293 (+113.87%)
Mutual labels:  data-science, reproducibility, high-performance-computing
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+1002.92%)
Mutual labels:  data-science, big-data, data-engineering
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-42.34%)
Mutual labels:  data-science, big-data, data-engineering
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+523.36%)
Mutual labels:  data-science, big-data, data-mining
Drake Examples
Example workflows for the drake R package
Stars: ✭ 57 (-58.39%)
Mutual labels:  data-science, reproducibility, high-performance-computing
Drake
An R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+849.64%)
Mutual labels:  data-science, reproducibility, high-performance-computing
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-21.9%)
Mutual labels:  data-science, big-data
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+31019.71%)
Mutual labels:  data-science, data-engineering
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-17.52%)
Mutual labels:  data-science, big-data
D6t Python
Accelerate data science
Stars: ✭ 118 (-13.87%)
Mutual labels:  data-science, data-engineering
Steppy
Lightweight, Python library for fast and reproducible experimentation 🔬
Stars: ✭ 119 (-13.14%)
Mutual labels:  data-science, reproducibility
Spark Alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-10.95%)
Mutual labels:  data-science, data-engineering
Graph sampling
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-27.74%)
Mutual labels:  big-data, data-mining
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-20.44%)
Mutual labels:  data-science, big-data
Papers Literature Ml Dl Rl Ai
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+878.83%)
Mutual labels:  data-science, data-mining
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+876.64%)
Mutual labels:  data-science, big-data
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-8.03%)
Mutual labels:  data-science, data-engineering
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+1640.88%)
Mutual labels:  data-science, data-engineering

The Accelerator is a tool for fast and reproducible processing of large amounts of data. Extensive documentation is available here:

Reference Manual
Home Page
PyPI

pip install accelerator
After installation try "ax --help".

Supported Environments

The Accelerator project has been built, tested, and runs on:

  • Ubuntu 16.04, 18.04, 20.04
  • Debian 9, 10
  • FreeBSD 11.3, 12.1

but is not limited to these systems or versions.

Windows is not supported, but WSL should work.

License

Copyright 2017-2018 eBay Inc.
Modifications copyright (c) 2018-2021 Carl Drougge
Modifications copyright (c) 2019-2021 Anders Berkeman

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

https://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].