Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → anovos → anovos

anovos / anovos

Licence: other

Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark

Programming Languages

75241 projects

Jupyter Notebook

11667 projects

139335 projects - #7 most used programming language

56736 projects

77523 projects

30231 projects

Labels

visualization data-science machine-learning scale bigdata pyspark feature-engineering transformation feature-recommendation

Projects that are alternatives of or similar to anovos

A collection of tutorials on Hadoop, MapReduce, Spark, Docker

Stars: ✭ 34 (-55.84%)

Mutual labels: bigdata, pyspark

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (+1180.52%)

Mutual labels: bigdata, pyspark

data processing course

Some class materials for a data processing course using PySpark

Stars: ✭ 50 (-35.06%)

Mutual labels: bigdata, pyspark

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+1637.66%)

Mutual labels: bigdata, pyspark

Spark-and-Kafka IoT-Data-Processing-and-Analytics

Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time

Stars: ✭ 42 (-45.45%)

Mutual labels: bigdata, pyspark

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Stars: ✭ 1,351 (+1654.55%)

Mutual labels: bigdata, pyspark

Library for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector

Stars: ✭ 80 (+3.9%)

Mutual labels: feature-engineering

大数据采集,抽取平台

Stars: ✭ 292 (+279.22%)

Mutual labels: bigdata

featuretoolsOnSpark

A simplified version of featuretools for Spark

Stars: ✭ 24 (-68.83%)

Mutual labels: feature-engineering

traefik-ondemand-service

Traefik ondemand service for the traefik ondemand plugin

Stars: ✭ 35 (-54.55%)

Mutual labels: scale

PubMed-Best-Match

Machine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches

Stars: ✭ 36 (-53.25%)

Mutual labels: feature-engineering

Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark

Stars: ✭ 52 (-32.47%)

Mutual labels: bigdata

λ A selfhostable serverless function runtime. Inspired by zeit now.

Stars: ✭ 15 (-80.52%)

Mutual labels: scale

SQL Parsers for BigData, built with antlr4.

Stars: ✭ 135 (+75.32%)

Mutual labels: bigdata

A declarative transformation language for GraphQL 🍸

Stars: ✭ 27 (-64.94%)

Mutual labels: transformation

163-bigdate-note

bigdata note

Stars: ✭ 38 (-50.65%)

Mutual labels: bigdata

flask-spark-docker

Just a boilerplate for PySpark and Flask

Stars: ✭ 32 (-58.44%)

Mutual labels: pyspark

GreyCat - Data Analytics, Temporal data, What-if, Live machine learning

Stars: ✭ 104 (+35.06%)

Mutual labels: bigdata

exemplary-ml-pipeline

Exemplary, annotated machine learning pipeline for any tabular data problem.

Stars: ✭ 23 (-70.13%)

Mutual labels: feature-engineering

the-apache-ignite-book

All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above

Stars: ✭ 65 (-15.58%)

Mutual labels: bigdata

View All Similar Projects ➔

Anovos

Anovos is an open source library for feature engineering at scale. Built by data scientists & ML Engineers for the data science community, it provides all the capabilities required for data ingestion, data analysis, data drift & data stability analysis, feature recommendation and feature composition. In addition, it automatically produces easily interpretable professional data reports that help users understand the nature of data at first sight and further enable data scientists to identify and engineer features.

Leveraging the power of Apache Spark behind the scenes, Anovos improves data scientists' productivity and helps them build more resilient and better performing models.

Quick Start

The easiest way to try out Anovos and explore its capabilities is through the provided examples that you can run via Docker without the need to install anything on your local machine.

# Launch an anovos-examples Docker container
sudo docker run -p 8888:8888 anovos/anovos-examples-3.2.2:latest

To reach the Jupyter environment, open the link to http://127.0.0.1:8888/?token... generated by the Jupyter NotebookApp.

If you're not familiar with Anovos or feature engineering, the Getting Started with Anovos guide is a good place to begin your journey. You can find it in the /guides folder within the Jupyter environment.

For more detailed instructions on how to install Docker and how to troubleshoot potential issues, see the examples README.

Using Anovos

Requirements

To use Anovos, you need compatible versions of Apache Spark, Java and Python.

Currently, we officially support the following combinations:

Apache Spark 2.4.x on Java 8 with Python 3.7.x
Apache Spark 3.1.x on Java 11 with Python 3.9.x
Apache Spark 3.2.x on Java 11 with Python 3.10.x

To see what we're currently testing, see this configuration.

Installation

You can install the latest release of Anovos directly through PyPI:

pip install anovos

Documentation

We provide a comprehensive documentation at docs.anovos.ai that includes user guides as well as a detailed API documentation.

For usage examples, see the provided interactive guides and Jupyter notebooks as well as the Spark demo.

Overview

Roadmap

Anovos has designed for to support any feature engineering tasks in a scalable form. To see what's planned for the upcoming releases, see our roadmap.

Development Version

To try out the latest additions to Anovos, you can install it directly from GitHub:

pip install git+https://github.com/anovos/anovos.git

Please note that this version is frequently updated and might not be fully compatible with the documentation available at docs.anovos.ai.

Contribute

We're always happy to discuss and accept improvements to Anovos. To get started, please refer to our Contributing to Anovos page in the documentation.

To start coding, clone this repository, install both the regular and development requirements, and set up the pre-commit hooks:

git clone https://github.com/anovos/anovos.git
cd anovos/
pip install -r requirements.txt
pip install -r dev_requirements.txt
pre-commit install

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 77

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗