SparksNo description or website provided.
spark-transformersSpark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
fb scraperFBLYZE is a Facebook scraping system and analysis system.
spark-operatorOperator for managing the Spark clusters on Kubernetes and OpenShift.
sparklanesA lightweight data processing framework for Apache Spark
GingerGinger - Opinionated RESTful Routing powered by Spark
platys-modern-data-platformSupport for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
booksA collection of online books for data science, computer science and coding!
spark-hatsNested array transformation helper extensions for Apache Spark
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
GeoTriplesPublishing Big Geospatial data as Linked Open Geospatial Data
ros hadoopHadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
hse spark courseРепозиторий учебных материалов для ДПО от ВШЭ (https://cs.hse.ru/dpo/) и курсов по Apache Spark
datatileA library for managing, validating, summarizing, and visualizing data.
dstlrscalable knowledge graph construction from unstructured text
gallia-coreA schema-aware Scala library for data transformation
rdf2xRDF2X converts big RDF datasets to the relational database model, CSV, JSON and ElasticSearch.
pytest-sparkpytest plugin to run the tests with support of pyspark
spark-utilsBasic framework utilities to quickly start writing production ready Apache Spark applications
SynapseMLSimple and Distributed Machine Learning
sbt-sparkSimple SBT plugin to configure Spark applications
tekniqA framework designed around Kotlin providing Restful HTTP Client, JDBC DSL, Loading Cache, Configurations, Validations, and more
learningWalkthrough notebooks for Deep Learning, Machine Learning, Reinforcement Learning, Spark, Statistics, Algorithms, Scala, Python
pigletA compiler for Pig Latin to Spark and Flink.
cassandra.realtimeDifferent ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Spark-Scala-EKSSpark Scala docker container sample for AWS testing - EKS & S3
emmaA quotation-based Scala DSL for scalable data analysis.
xskipperAn Extensible Data Skipping Framework
cobrixA COBOL parser and Mainframe/EBCDIC data source for Apache Spark
SparkTwitterAnalysisAn Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.
MLBDMaterials for "Machine Learning on Big Data" course
spark-fmA parallel implementation of factorization machines based on Spark