3-D-Scene-Graph3D scene graph generator implemented in Pytorch.
Stars: ✭ 52 (+100%)
ElassandraElassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+6092.31%)
objectiv-analyticsPowerful product analytics for data teams, with full control over data & models.
Stars: ✭ 399 (+1434.62%)
Spark LucenerddSpark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (+338.46%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+50%)
Js SparkRealtime calculation distributed system. AKA distributed lodash
Stars: ✭ 187 (+619.23%)
ElephasDistributed Deep learning with Keras & Spark
Stars: ✭ 1,521 (+5750%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+196.15%)
AzuredatabricksbestpracticesVersion 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Stars: ✭ 186 (+615.38%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (+323.08%)
focallossFocal Loss of multi-classification in tensorflow
Stars: ✭ 75 (+188.46%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (+603.85%)
primrosePrimrose modeling framework for simple production models
Stars: ✭ 33 (+26.92%)
Seldon ServerMachine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+5419.23%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+6746.15%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (+126.92%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (+303.85%)
dlsaDistributed least squares approximation (dlsa) implemented with Apache Spark
Stars: ✭ 25 (-3.85%)
Spark FfmFFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (+288.46%)
fedora-primeSimple program to switch between intel and nvidia gpu
Stars: ✭ 24 (-7.69%)
DeepPixelAn open-source Python package for making computer vision and image processing simpler
Stars: ✭ 21 (-19.23%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (+273.08%)
kuwalaKuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+1723.08%)
SpeechEnhancementCombining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks
Stars: ✭ 49 (+88.46%)
RFDA-PyTorchOfficial Code for 'Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction' - ACM Multimedia2021 (ACMMM2021) Accepted Paper Task: Video Quality Enhancement / Video Compression Artifact Reduction
Stars: ✭ 44 (+69.23%)
RoaringbitmapA better compressed bitset in Java
Stars: ✭ 2,460 (+9361.54%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (+253.85%)
Big Data🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (+257.69%)
awesome-open-mlopsThe Fuzzy Labs guide to the universe of open source MLOps
Stars: ✭ 304 (+1069.23%)
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+596.15%)
DataScienceTutorials.jlA set of tutorials to show how to use Julia for data science (DataFrames, MLJ, ...)
Stars: ✭ 94 (+261.54%)
FlintWebex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)
Stars: ✭ 85 (+226.92%)
Groundbreaking-PapersML Research paper summaries, annotated papers and implementation walkthroughs
Stars: ✭ 90 (+246.15%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (+219.23%)
wildebeestFile processing pipelines
Stars: ✭ 86 (+230.77%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (+588.46%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (+576.92%)
fastapi-templateCompletely Scalable FastAPI based template for Machine Learning, Deep Learning and any other software project which wants to use Fast API as an API framework.
Stars: ✭ 156 (+500%)