All Projects → VinceShieh → Spark Ffm

VinceShieh / Spark Ffm

Licence: apache-2.0
FFM (Field-Awared Factorization Machine) on Spark

Programming Languages

scala
5932 projects

Labels

Projects that are alternatives of or similar to Spark Ffm

Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-16.83%)
Mutual labels:  spark
Spark On Kubernetes Helm
Spark on Kubernetes infrastructure Helm charts repo
Stars: ✭ 92 (-8.91%)
Mutual labels:  spark
Schemer
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-3.96%)
Mutual labels:  spark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-14.85%)
Mutual labels:  spark
Ammonite Spark
Run spark calculations from Ammonite
Stars: ✭ 88 (-12.87%)
Mutual labels:  spark
Spark Summit 2017 Sanfrancisco
spark summit 2017 SanFrancisco
Stars: ✭ 93 (-7.92%)
Mutual labels:  spark
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-18.81%)
Mutual labels:  spark
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+10782.18%)
Mutual labels:  spark
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-11.88%)
Mutual labels:  spark
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (-4.95%)
Mutual labels:  spark
Laravel Spark Google2fa
Google Authenticator support for Laravel Spark
Stars: ✭ 86 (-14.85%)
Mutual labels:  spark
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-12.87%)
Mutual labels:  spark
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-8.91%)
Mutual labels:  spark
Flint
Webex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)
Stars: ✭ 85 (-15.84%)
Mutual labels:  spark
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-3.96%)
Mutual labels:  spark
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-17.82%)
Mutual labels:  spark
Big Data
🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-7.92%)
Mutual labels:  spark
Bigdata Notebook
Stars: ✭ 100 (-0.99%)
Mutual labels:  spark
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+1240.59%)
Mutual labels:  spark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1224.75%)
Mutual labels:  spark

Spark-FFM

A Spark-based implementation of Field-Awared Factorization Machine. See http://www.csie.ntu.edu.tw/~cjlin/papers/ffm.pdf

The data should be formatted as

label field1:feat1:val1 field2:feat2:val2

to fit FFM, that is to extends LIBSVM data format by adding field information to each feature.

Currently, we support paralleledSGD and paralledAdagrad optimization methods, as they are more efficient in dealing with large dataset.

Besides, user can also choose to have FFMModel with/without global bias and one-way interactions.

Contact & Feedback

If you encounter bugs, feel free to submit an issue or pull request.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].