TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+73.81%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+114.85%)
FeatranA Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (-64.97%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+141.78%)
Spark TdaSparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-96.25%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (-92.08%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (-78.32%)
SparklearningLearning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (-53.46%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-95.16%)
Awesome PulsarA curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-95.25%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (-95%)
KontextfreiWriting application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (-94.41%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (-10.26%)
Docker HadoopA Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-95.5%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (-10.76%)
Ds CheatsheetsList of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+688.32%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-94.58%)
BullseyeA functional language frontend for the Dart VM.
Stars: ✭ 53 (-95.58%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (-95.83%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-94.66%)
Awesome Recommendation EngineThe purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-96.08%)
DarwinEvolutionary Algorithms Framework
Stars: ✭ 72 (-93.99%)
Yogaiwork in progress
Stars: ✭ 59 (-95.08%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-93.83%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-95.16%)
MlflowOpen source platform for the machine learning lifecycle
Stars: ✭ 10,898 (+808.92%)
Fast MrmrAn improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (-94.41%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-95.41%)
Lpa DetectorOptimize and improve the Label propagation algorithm
Stars: ✭ 75 (-93.74%)
Ml With Android 11A repository demonstrating all that's new in Android 11 for ML and how you could try it out for your own use-cases
Stars: ✭ 54 (-95.5%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+777.9%)
Spark Submit UiThis is a based on playframwork for submit spark app
Stars: ✭ 53 (-95.58%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-94.58%)
LabsResearch on distributed system
Stars: ✭ 73 (-93.91%)
ImlКурс "Введение в машинное обучение" (ВМК, МГУ имени М.В. Ломоносова)
Stars: ✭ 46 (-96.16%)
Caffe2Caffe2 is a lightweight, modular, and scalable deep learning framework.
Stars: ✭ 8,409 (+601.33%)
NetworkmlMachine learning plugins for network traffic
Stars: ✭ 73 (-93.91%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-94.75%)
GgnetGG.Net Data Visualization
Stars: ✭ 45 (-96.25%)
Delta ArchitectureStreaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Stars: ✭ 43 (-96.41%)
LudwigData-centric declarative deep learning framework
Stars: ✭ 8,018 (+568.72%)
Openai Api DotnetA C#/.NET SDK for accessing the OpenAI GPT-3 API
Stars: ✭ 41 (-96.58%)
Kamu CliNext generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-94.25%)
RoffildlibraryLibrary for MQL5 (MetaTrader) with Python, Java, Apache Spark, AWS
Stars: ✭ 63 (-94.75%)