Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (+92.31%)
OapOptimized Analytics Package for Spark* Platform
Stars: ✭ 343 (+779.49%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (+0%)
Lpa DetectorOptimize and improve the Label propagation algorithm
Stars: ✭ 75 (+92.31%)
SparkctrCTR prediction model based on spark(LR, GBDT, DNN)
Stars: ✭ 740 (+1797.44%)
SFDCRulesSimple yet powerful Rule Engine for Salesforce - SFDCRules
Stars: ✭ 38 (-2.56%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (+84.62%)
ScalnetA Scala wrapper for Deeplearning4j, inspired by Keras. Scala + DL + Spark + GPUs
Stars: ✭ 342 (+776.92%)
stormnodeNode js node client for storm.dev
Stars: ✭ 11 (-71.79%)
KontextfreiWriting application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (+71.79%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (+66.67%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+43.59%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+64.1%)
Ytk LearnYtk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).
Stars: ✭ 337 (+764.1%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (+61.54%)
SparkmonitorMonitor Apache Spark from Jupyter Notebook
Stars: ✭ 154 (+294.87%)
SparkleHaskell on Apache Spark.
Stars: ✭ 419 (+974.36%)
docker-apex-stackUtility scripts for creating an Oracle Application Express stack as a Docker container.
Stars: ✭ 67 (+71.79%)
QuillCompile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+5023.08%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (+53.85%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (+53.85%)
PmdAn extensible multilanguage static code analyzer.
Stars: ✭ 3,667 (+9302.56%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (+48.72%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-5.13%)
Docker HadoopA Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (+38.46%)
CookFair job scheduler on Kubernetes and Mesos for batch workloads and Spark
Stars: ✭ 314 (+705.13%)
Spark Submit UiThis is a based on playframwork for submit spark app
Stars: ✭ 53 (+35.9%)
ApexConfigsApex Legends configs for a competitve player
Stars: ✭ 52 (+33.33%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (+28.21%)
HailScalable genomic data analysis.
Stars: ✭ 706 (+1710.26%)
Awesome Recommendation EngineThe purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (+20.51%)
apex-tmLanguageSalesforce Apex Language syntax grammar used for colorization
Stars: ✭ 27 (-30.77%)
Spark TdaSparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (+15.38%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+8407.69%)
Flink Recommandsystem Demo🚁🚀基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
Stars: ✭ 3,115 (+7887.18%)
Apex RecipesA library of concise, meaningful examples of Apex code for common use cases following best practices.
Stars: ✭ 307 (+687.18%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (+292.31%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (+289.74%)
webmorphAverage and morph faces online http://webmorph.org/
Stars: ✭ 55 (+41.03%)
SnappydataProject SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Stars: ✭ 995 (+2451.28%)
ObjectmergeOpen-source solution for merging Salesforce objects and their related objects.
Stars: ✭ 35 (-10.26%)
Spark FlamegraphEasy CPU Profiling for Apache Spark applications
Stars: ✭ 30 (-23.08%)
Apex Test TrackerLightweight native continuous integration tool for Salesforce
Stars: ✭ 12 (-69.23%)
Spark TsneDistributed t-SNE via Apache Spark
Stars: ✭ 151 (+287.18%)