Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+900%)
AlmondA Scala kernel for Jupyter
Stars: ✭ 1,354 (+1165.42%)
Ds CheatsheetsList of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+8733.64%)
Ammonite SparkRun spark calculations from Ammonite
Stars: ✭ 88 (-17.76%)
Delta ArchitectureStreaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Stars: ✭ 43 (-59.81%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-30.84%)
GatkOfficial code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+836.45%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+832.71%)
SnappydataProject SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Stars: ✭ 995 (+829.91%)
Kamu CliNext generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-35.51%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-9.35%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+791.59%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-19.63%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+786.92%)
Fast MrmrAn improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (-37.38%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+29449.53%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-1.87%)
FlintA Time Series Library for Apache Spark
Stars: ✭ 878 (+720.56%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+9737.38%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-86.92%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-21.5%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-88.79%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-39.25%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-89.72%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1150.47%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+700.93%)
Tiledb VcfEfficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-75.7%)
Hadoop cookbookCookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-23.36%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+768.22%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+756.07%)
Silexsomething to help you spark
Stars: ✭ 61 (-42.99%)
MleapMLeap: Deploy ML Pipelines to Production
Stars: ✭ 1,232 (+1051.4%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-45.79%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1563.55%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-1.87%)
Big Data🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-13.08%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-26.17%)