connected-componentMap Reduce Implementation of Connected Component on Apache Spark
Stars: ✭ 68 (+70%)
Mutual labels: apache-spark, graphx
Awesome Community DetectionA curated list of community detection research papers with implementations.
Stars: ✭ 1,874 (+4585%)
Mutual labels: community-detection, bigclam
streamsx.kafkaRepository for integration with Apache Kafka
Stars: ✭ 13 (-67.5%)
Mutual labels: apache-spark
LabelPropagationA NetworkX implementation of Label Propagation from a "Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks" (Physical Review E 2008).
Stars: ✭ 101 (+152.5%)
Mutual labels: community-detection
SANSA-StackBig Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Stars: ✭ 130 (+225%)
Mutual labels: apache-spark
SimP-GCNImplementation of the WSDM 2021 paper "Node Similarity Preserving Graph Convolutional Networks"
Stars: ✭ 43 (+7.5%)
Mutual labels: graph-mining
net.jgp.books.spark.ch07Spark in Action, 2nd edition - chapter 7 - Ingestion from files
Stars: ✭ 13 (-67.5%)
Mutual labels: apache-spark
osm-parquetizerA converter for the OSM PBFs to Parquet files
Stars: ✭ 71 (+77.5%)
Mutual labels: apache-spark
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-2.5%)
Mutual labels: apache-spark
sparklygraphsOld repo for R interface for GraphFrames
Stars: ✭ 13 (-67.5%)
Mutual labels: apache-spark
awesome-toolscurated list of awesome tools and libraries for specific domains
Stars: ✭ 31 (-22.5%)
Mutual labels: apache-spark
PLSCPaddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, DeiT, FaceViT.
Stars: ✭ 113 (+182.5%)
Mutual labels: large-scale
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (+27.5%)
Mutual labels: apache-spark
learn-by-examplesReal-world Spark pipelines examples
Stars: ✭ 84 (+110%)
Mutual labels: apache-spark
learning-hadoop-and-sparkCompanion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+265%)
Mutual labels: apache-spark
geosparkbring sf to spark in production
Stars: ✭ 53 (+32.5%)
Mutual labels: apache-spark
M-NMFAn implementation of "Community Preserving Network Embedding" (AAAI 2017)
Stars: ✭ 119 (+197.5%)
Mutual labels: community-detection