All Projects → Feast → Similar Projects or Alternatives

1234 Open source projects that are alternatives of or similar to Feast

Transmogrifai
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (-19.1%)
Mutual labels:  features, spark, ml, feature-engineering
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-96.93%)
Mutual labels:  spark, big-data, data-engineering
Geni
A Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-94.1%)
Mutual labels:  spark, big-data, data-engineering
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+12.54%)
Mutual labels:  spark, ml, big-data
Succinct
Enabling queries on compressed data.
Stars: ✭ 257 (-90.02%)
Mutual labels:  spark, big-data
deepchecks
Test Suites for Validating ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.
Stars: ✭ 1,595 (-38.08%)
Mutual labels:  ml, mlops
Polyaxon
Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+15.14%)
Mutual labels:  ml, mlops
Spark Alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-95.26%)
Mutual labels:  spark, data-engineering
Listenbrainz Server
Server for the ListenBrainz project
Stars: ✭ 420 (-83.7%)
Mutual labels:  spark, big-data
Awesome Mlops
A curated list of references for MLOps
Stars: ✭ 7,119 (+176.36%)
Mutual labels:  ml, mlops
Magellan
Geo Spatial Data Analytics on Spark
Stars: ✭ 507 (-80.32%)
Mutual labels:  spark, big-data
oomstore
Lightweight and Fast Feature Store Powered by Go (and Rust).
Stars: ✭ 76 (-97.05%)
Mutual labels:  ml, mlops
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (+30.24%)
Mutual labels:  big-data, ml
beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (-97.48%)
Mutual labels:  data-engineering, mlops
vertex-ai-samples
Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
Stars: ✭ 270 (-89.52%)
Mutual labels:  ml, mlops
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-85.99%)
Mutual labels:  spark, big-data
Bigdl
Building Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+48.02%)
Mutual labels:  spark, big-data
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+755.9%)
Mutual labels:  spark, big-data
Around Dataengineering
A Data Engineering & Machine Learning Knowledge Hub
Stars: ✭ 257 (-90.02%)
Mutual labels:  spark, data-engineering
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-71.08%)
Mutual labels:  spark, big-data
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-41.34%)
Mutual labels:  big-data, data-engineering
Autodl
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (-66.85%)
Mutual labels:  big-data, feature-engineering
Docker Spark Cluster
A Spark cluster setup running on Docker containers
Stars: ✭ 57 (-97.79%)
Mutual labels:  spark, big-data
Sparkjni
A heterogeneous Apache Spark framework.
Stars: ✭ 11 (-99.57%)
Mutual labels:  spark, big-data
Waimak
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-97.67%)
Mutual labels:  spark, data-engineering
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-97.48%)
Mutual labels:  spark, big-data
incubator-liminal
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (-95.46%)
Mutual labels:  big-data, ml
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (-76.24%)
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (-98.29%)
neptune-client
📒 Experiment tracking tool and model registry
Stars: ✭ 348 (-86.49%)
Mutual labels:  ml, mlops
awesome-AI-kubernetes
❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (-96.31%)
Mutual labels:  big-data, ml
yt-channels-DS-AI-ML-CS
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (-59.7%)
Mutual labels:  ml, data-engineering
Sk Dist
Distributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (-89.91%)
Mutual labels:  spark, ml
cli
Polyaxon Core Client & CLI to streamline MLOps
Stars: ✭ 18 (-99.3%)
Mutual labels:  ml, mlops
Sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (-85.95%)
Mutual labels:  spark, big-data
Delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+51.51%)
Mutual labels:  spark, big-data
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+125.47%)
Mutual labels:  data-engineering, mlops
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+55.4%)
Mutual labels:  ml, mlops
Metaflow
🚀 Build and manage real-life data science projects with ease!
Stars: ✭ 5,108 (+98.29%)
Mutual labels:  ml, mlops
Featran
A Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (-83.7%)
Mutual labels:  spark, ml
Pointblank
Data validation and organization of metadata for data frames and database tables
Stars: ✭ 480 (-81.37%)
Mutual labels:  spark, data-engineering
big-data-engineering-indonesia
A curated list of big data engineering tools, resources and communities.
Stars: ✭ 26 (-98.99%)
Mutual labels:  big-data, data-engineering
Hyperparameter hunter
Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (-74.84%)
Mutual labels:  ml, feature-engineering
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (-75.43%)
Mutual labels:  spark, data-engineering
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (-69.22%)
Mutual labels:  spark, data-engineering
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+119.57%)
Mutual labels:  spark, big-data
Spark Tda
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-98.25%)
Mutual labels:  spark, ml
Spark
Apache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+1127.41%)
Mutual labels:  spark, big-data
Spark Doc Zh
Apache Spark 官方文档中文版
Stars: ✭ 1,126 (-56.29%)
Mutual labels:  spark, big-data
Zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+114.01%)
Mutual labels:  spark, big-data
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+281.56%)
Mutual labels:  big-data, data-engineering
Spark Website
Apache Spark Website
Stars: ✭ 75 (-97.09%)
Mutual labels:  spark, big-data
Home
ApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (-53.45%)
Mutual labels:  spark, ml
Labs
Research on distributed system
Stars: ✭ 73 (-97.17%)
Mutual labels:  spark, big-data
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (-96.82%)
Mutual labels:  big-data, data-engineering
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-48.06%)
Mutual labels:  spark, big-data
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+326.67%)
Mutual labels:  spark, big-data
VickyBytes
Subscribe to this GitHub repo to access the latest tech talks, tech demos, learning materials & modules, and developer community updates!
Stars: ✭ 48 (-98.14%)
Mutual labels:  ml, mlops
leetspeek
Open and collaborative content from leet hackers!
Stars: ✭ 11 (-99.57%)
Mutual labels:  big-data, ml
Sparklearning
Learning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (-78.34%)
Mutual labels:  spark, ml
1-60 of 1234 similar projects