Movie Recommendation EngineMovie Recommender based on the MovieLens Dataset (ml-100k) using item-item collaborative filtering.
Stars: ✭ 21 (-51.16%)
DeeprecommenderDeep learning for recommender systems
Stars: ✭ 1,593 (+3604.65%)
recommenderNReco Recommender is a .NET port of Apache Mahout CF java engine (standalone, non-Hadoop version)
Stars: ✭ 35 (-18.6%)
raptorA lightweight product recommendation system (Item Based Collaborative Filtering) developed in Haskell.
Stars: ✭ 34 (-20.93%)
CornacA Comparative Framework for Multimodal Recommender Systems
Stars: ✭ 308 (+616.28%)
Movie Recommender SystemBasic Movie Recommendation Web Application using user-item collaborative filtering.
Stars: ✭ 85 (+97.67%)
RecommenderA C library for product recommendations/suggestions using collaborative filtering (CF)
Stars: ✭ 238 (+453.49%)
XlearningAI on Hadoop
Stars: ✭ 1,709 (+3874.42%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+3718.6%)
Parquet RsApache Parquet implementation in Rust
Stars: ✭ 144 (+234.88%)
ShifuAn end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+381.4%)
docker-hadoopDocker image for main Apache Hadoop components (Yarn/Hdfs)
Stars: ✭ 59 (+37.21%)
NutchApache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+5195.35%)
SpydraEphemeral Hadoop clusters using Google Compute Platform
Stars: ✭ 128 (+197.67%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (+193.02%)
DynamometerA tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Stars: ✭ 122 (+183.72%)
slopeonePHP implementation of the Weighted Slope One rating-based collaborative filtering scheme.
Stars: ✭ 85 (+97.67%)
Hadoop Attack LibraryA collection of pentest tools and resources targeting Hadoop environments
Stars: ✭ 228 (+430.23%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+311.63%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+3690.7%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (+165.12%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (+279.07%)
Parquet GoGo package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.
Stars: ✭ 114 (+165.12%)
HadoopApache Hadoop
Stars: ✭ 12,177 (+28218.6%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (+225.58%)
phoenixApache Phoenix / Hbase Spring Boot Microservices
Stars: ✭ 23 (-46.51%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (+197.67%)
Awesome Learning实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+358.14%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (+197.67%)
Devops Bash Tools550+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Kafka, Docker, APIs, Hadoop, SQL, PostgreSQL, MySQL, Hive, Impala, Travis CI, Jenkins, Concourse, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, .tmux.conf, .psqlrc ...
Stars: ✭ 226 (+425.58%)
Parquet4sRead and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (+190.7%)
Hive Jdbc Uber JarHive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Stars: ✭ 188 (+337.21%)
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (+172.09%)
ambari-hdp-dockerDockerfiles and Docker Compose for HDP 2.6 with Blueprints
Stars: ✭ 23 (-46.51%)
DataxDataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+169.77%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+28451.16%)
LuigiLuigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Stars: ✭ 15,226 (+35309.3%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (+162.79%)
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (+155.81%)
LR-GCCFRevisiting Graph based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach, AAAI2020
Stars: ✭ 99 (+130.23%)
Hadoop ConnectorsLibraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Stars: ✭ 218 (+406.98%)
PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+30032.56%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+4216.28%)
Haproxy Configs80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (+146.51%)
Hadoop CommonMirror of Apache Hadoop common
Stars: ✭ 155 (+260.47%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+400%)
Movie recommend基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+4765.12%)