Spark TdaSparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-83.75%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-41.16%)
spark-demosCollection of different demo applications using Apache Spark
Stars: ✭ 15 (-94.58%)
thrift2-hbasethrift2-hbase component for Hyperf.
Stars: ✭ 14 (-94.95%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+255.96%)
Vue Info CardSimple and beautiful card component with an elegant spark line, for VueJS.
Stars: ✭ 159 (-42.6%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-90.25%)
Spark WorkshopApache Spark™ and Scala Workshops
Stars: ✭ 224 (-19.13%)
Vagrant ProjectsVagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
Stars: ✭ 34 (-87.73%)
Spark FlamegraphEasy CPU Profiling for Apache Spark applications
Stars: ✭ 30 (-89.17%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (-80.87%)
PucketBucketing and partitioning system for Parquet
Stars: ✭ 29 (-89.53%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-45.13%)
tpch-sparkTPC-H queries in Apache Spark SQL using native DataFrames API
Stars: ✭ 63 (-77.26%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (-44.4%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-94.95%)
UrhoxUrho3D extension library
Stars: ✭ 13 (-95.31%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-44.77%)
MlfeatureFeature engineering toolkit for Spark MLlib.
Stars: ✭ 12 (-95.67%)
spark-data-sourcesDeveloping Spark External Data Sources using the V2 API
Stars: ✭ 36 (-87%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (-45.49%)
Big Data🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-66.43%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-93.14%)
Spark SwaggerSpark (http://sparkjava.com/) support for Swagger (https://swagger.io/)
Stars: ✭ 25 (-90.97%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+562.45%)
ChroniclerScala toolchain for InfluxDB
Stars: ✭ 24 (-91.34%)
frovedisFramework of vectorized and distributed data analytics
Stars: ✭ 59 (-78.7%)
DigitrecognizerJava Convolutional Neural Network example for Hand Writing Digit Recognition
Stars: ✭ 23 (-91.7%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-45.85%)
mangoCore utility library & data connectors designed for simpler usage in Scala
Stars: ✭ 41 (-85.2%)
Ruby SparkRuby wrapper for Apache Spark
Stars: ✭ 221 (-20.22%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-86.64%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+2231.41%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+528.88%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+168.95%)
Sagemaker SparkA Spark library for Amazon SageMaker.
Stars: ✭ 219 (-20.94%)
Ammonite SparkRun spark calculations from Ammonite
Stars: ✭ 88 (-68.23%)
Spark Nlp ModelsModels and Pipelines for the Spark NLP library
Stars: ✭ 88 (-68.23%)
Spark ExcelA Spark plugin for reading Excel files via Apache POI
Stars: ✭ 216 (-22.02%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-48.74%)
DatavecETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (-1.81%)
Docker Spark ClusterA simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (-5.78%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (-7.22%)