All Projects → Accelerator → Similar Projects or Alternatives

1688 Open source projects that are alternatives of or similar to Accelerator

Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+10.95%)
Drake Examples
Example workflows for the drake R package
Stars: ✭ 57 (-58.39%)
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-42.34%)
Drake
An R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+849.64%)
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+1002.92%)
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+523.36%)
Mutual labels:  data-science, big-data, data-mining
Targets
Function-oriented Make-like declarative workflows for R
Stars: ✭ 293 (+113.87%)
Vizuka
Explore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-27.01%)
Mutual labels:  data-science, big-data, data-mining
Pretzel
Javascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-81.02%)
Mutual labels:  data-science, big-data
Clevercsv
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+547.45%)
Mutual labels:  data-science, data-mining
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-6.57%)
Mutual labels:  data-science, big-data
Biolitmap
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-86.86%)
Mutual labels:  data-science, data-mining
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-86.86%)
Mutual labels:  data-science, data-mining
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+530.66%)
Mutual labels:  data-science, data-engineering
Awesome Fraud Detection Papers
A curated list of data mining papers about fraud detection.
Stars: ✭ 843 (+515.33%)
Mutual labels:  data-science, data-mining
Tadw
An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-68.61%)
Mutual labels:  data-science, data-mining
Attaca
Robust, distributed version control for large files.
Stars: ✭ 41 (-70.07%)
Mutual labels:  data-science, big-data
Batchtools
Tools for computation on batch systems
Stars: ✭ 127 (-7.3%)
Etherscan Ml
Python Data Science and Machine Learning Library for the Ethereum and ERC-20 Blockchain
Stars: ✭ 55 (-59.85%)
Mutual labels:  data-science, data-mining
Steppy Toolkit
Curated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-84.67%)
Mutual labels:  data-science, reproducibility
Datumbox Framework
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Stars: ✭ 1,063 (+675.91%)
Mutual labels:  data-science, big-data
Verticapy
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-56.93%)
Mutual labels:  data-science, big-data
Linkedingiveaway
👨🏽‍🏫You can learn about anything over here. What Giveaways I do and why it's important in today's modern world. Are you interested in Giveaway's?🔋
Stars: ✭ 67 (-51.09%)
Mutual labels:  data-science, data-mining
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-42.34%)
Mutual labels:  data-science, data-engineering
Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+803.65%)
Mutual labels:  data-science, data-mining
Pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Stars: ✭ 127 (-7.3%)
Mutual labels:  data-science, data-engineering
Pyclustering
pyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (+488.32%)
Mutual labels:  data-science, data-mining
Prefect
The easiest way to automate your data
Stars: ✭ 7,956 (+5707.3%)
Mutual labels:  data-science, data-engineering
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-83.94%)
Mutual labels:  data-science, data-mining
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+413.87%)
Mutual labels:  data-science, data-mining
Applied Ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+12910.22%)
Mutual labels:  data-science, data-engineering
Autodl
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+523.36%)
Mutual labels:  data-science, big-data
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+1780.29%)
Mutual labels:  big-data, data-engineering
Sciblog support
Support content for my blog
Stars: ✭ 694 (+406.57%)
Mutual labels:  data-science, big-data
Mldm
потоковый курс "Машинное обучение и анализ данных (Machine Learning and Data Mining)" на факультете ВМК МГУ имени М.В. Ломоносова
Stars: ✭ 35 (-74.45%)
Mutual labels:  data-science, data-mining
Dvc
🦉Data Version Control | Git for Data & Models | ML Experiments Management
Stars: ✭ 9,004 (+6472.26%)
Mutual labels:  data-science, reproducibility
Php Ml
PHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+5666.42%)
Mutual labels:  data-science, data-mining
Open Solution Value Prediction
Open solution to the Santander Value Prediction Challenge 🐠
Stars: ✭ 34 (-75.18%)
Mutual labels:  data-science, reproducibility
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (+685.4%)
Mutual labels:  data-science, data-mining
Vvedenie Mashinnoe Obuchenie
📝 Подборка ресурсов по машинному обучению
Stars: ✭ 1,282 (+835.77%)
Mutual labels:  data-science, data-mining
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+876.64%)
Mutual labels:  data-science, big-data
Graph sampling
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-27.74%)
Mutual labels:  big-data, data-mining
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-52.55%)
Mutual labels:  data-science, big-data
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+362.04%)
Mutual labels:  data-science, data-engineering
Tsv Utils
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+786.86%)
Mutual labels:  data-science, data-mining
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (-40.15%)
Mutual labels:  big-data, data-engineering
Tsrepr
TSrepr: R package for time series representations
Stars: ✭ 75 (-45.26%)
Mutual labels:  data-science, data-mining
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-8.03%)
Mutual labels:  data-science, data-engineering
Papers Literature Ml Dl Rl Ai
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+878.83%)
Mutual labels:  data-science, data-mining
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+7074.45%)
Mutual labels:  big-data, data-engineering
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+31019.71%)
Mutual labels:  data-science, data-engineering
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-20.44%)
Mutual labels:  data-science, big-data
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-17.52%)
Mutual labels:  data-science, big-data
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-21.9%)
Mutual labels:  data-science, big-data
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+1640.88%)
Mutual labels:  data-science, data-engineering
Rightmove webscraper.py
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-8.76%)
Mutual labels:  data-science, data-mining
Dataproofer
A proofreader for your data
Stars: ✭ 628 (+358.39%)
Mutual labels:  data-science, data-mining
Data Science Career
Career Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (+359.85%)
Mutual labels:  data-science, big-data
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+757.66%)
Mutual labels:  data-science, big-data
Steppy
Lightweight, Python library for fast and reproducible experimentation 🔬
Stars: ✭ 119 (-13.14%)
Mutual labels:  data-science, reproducibility
1-60 of 1688 similar projects