VinceShieh / Spark Ffm
Licence: apache-2.0
FFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101
Programming Languages
scala
5932 projects
Labels
Projects that are alternatives of or similar to Spark Ffm
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-16.83%)
Mutual labels: spark
Spark On Kubernetes Helm
Spark on Kubernetes infrastructure Helm charts repo
Stars: ✭ 92 (-8.91%)
Mutual labels: spark
Schemer
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-3.96%)
Mutual labels: spark
Cuesheet
A framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-14.85%)
Mutual labels: spark
Spark Summit 2017 Sanfrancisco
spark summit 2017 SanFrancisco
Stars: ✭ 93 (-7.92%)
Mutual labels: spark
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-11.88%)
Mutual labels: spark
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (-4.95%)
Mutual labels: spark
Laravel Spark Google2fa
Google Authenticator support for Laravel Spark
Stars: ✭ 86 (-14.85%)
Mutual labels: spark
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-12.87%)
Mutual labels: spark
Flint
Webex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)
Stars: ✭ 85 (-15.84%)
Mutual labels: spark
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-3.96%)
Mutual labels: spark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1224.75%)
Mutual labels: spark
Spark-FFM
A Spark-based implementation of Field-Awared Factorization Machine. See http://www.csie.ntu.edu.tw/~cjlin/papers/ffm.pdf
The data should be formatted as
label field1:feat1:val1 field2:feat2:val2
to fit FFM, that is to extends LIBSVM data format by adding field information to each feature.
Currently, we support paralleledSGD and paralledAdagrad optimization methods, as they are more efficient in dealing with large dataset.
Besides, user can also choose to have FFMModel with/without global bias and one-way interactions.
Contact & Feedback
If you encounter bugs, feel free to submit an issue or pull request.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].