OpaqueAn encrypted data analytics platform
Stars: ✭ 129 (-7.19%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1235.25%)
Scala SamplesThere are pieces of scala code that explain Scala syntax and related things - like what you can do with all this
Stars: ✭ 125 (-10.07%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-20.86%)
ArflowThe official PyTorch implementation of the paper "Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation".
Stars: ✭ 134 (-3.6%)
PaysageUnsupervised learning and generative models in python/pytorch.
Stars: ✭ 109 (-21.58%)
Logdeeplog anomaly detection toolkit including DeepLog
Stars: ✭ 125 (-10.07%)
Diff2vecReference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
Stars: ✭ 108 (-22.3%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-22.3%)
CsiCSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances (NeurIPS 2020)
Stars: ✭ 123 (-11.51%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+8085.61%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-23.02%)
CleanlabThe standard package for machine learning with noisy labels, finding mislabeled data, and uncertainty quantification. Works with most datasets and models.
Stars: ✭ 2,526 (+1717.27%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1180.58%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+1081.29%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-24.46%)
SfmlearnerAn unsupervised learning framework for depth and ego-motion estimation from monocular videos
Stars: ✭ 1,661 (+1094.96%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-24.46%)
LinkedrwA simple CLI to create your resume and personal website based on your LinkedIn profile or a JSON file
Stars: ✭ 104 (-25.18%)
ZparkioBoiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-12.95%)
Keras Oneclassanomalydetection[5 FPS - 150 FPS] Learning Deep Features for One-Class Classification (AnomalyDetection). Corresponds RaspberryPi3. Convert to Tensorflow, ONNX, Caffe, PyTorch. Implementation by Python + OpenVINO/Tensorflow Lite.
Stars: ✭ 102 (-26.62%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-7.91%)
DdflowDDFlow: Learning Optical Flow with Unlabeled Data Distillation
Stars: ✭ 101 (-27.34%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+1210.07%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-28.06%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-13.67%)
AlmondA Scala kernel for Jupyter
Stars: ✭ 1,354 (+874.1%)
Spring Boot Quick🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌
Stars: ✭ 1,819 (+1208.63%)
Awesome Transfer LearningBest transfer learning and domain adaptation resources (papers, tutorials, datasets, etc.)
Stars: ✭ 1,349 (+870.5%)
CalcConvolutional Autoencoder for Loop Closure
Stars: ✭ 119 (-14.39%)
AndOfficial Pytorch Implementation for ICML'19 paper: Unsupervised Deep Learning by Neighbourhood Discovery
Stars: ✭ 133 (-4.32%)
Linkedin Api Php ClientLinkedIn API PHP SDK with OAuth 2 support. Can be used for social sign in or sharing on LinkedIn. Has a good usage examples
Stars: ✭ 88 (-36.69%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-30.94%)
Cube.js📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+8520.86%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-33.81%)
Spring Shiro SparkSpring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Stars: ✭ 114 (-17.99%)
Ammonite SparkRun spark calculations from Ammonite
Stars: ✭ 88 (-36.69%)
Big Data🔧 Use dplyr to analyze Big Data 🐘
Stars: ✭ 93 (-33.09%)
ForemastForemast adds application resiliency to Kubernetes by leveraging machine learnt patterns of application health to keep applications healthy and stable
Stars: ✭ 115 (-17.27%)
Daily Coding ProblemSeries of the problem 💯 and solution ✅ asked by Daily Coding problem👨🎓 website.
Stars: ✭ 90 (-35.25%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+8492.09%)
Spark LucenerddSpark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (-17.99%)
Pytorch cppDeep Learning sample programs using PyTorch in C++
Stars: ✭ 114 (-17.99%)
Spark Nlp ModelsModels and Pipelines for the Spark NLP library
Stars: ✭ 88 (-36.69%)
BayesloopProbabilistic programming framework that facilitates objective model selection for time-varying parameter models.
Stars: ✭ 87 (-37.41%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-6.47%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-10.07%)
Dex Test ParserFind all test methods in an Android instrumentation APK
Stars: ✭ 87 (-37.41%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-9.35%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-19.42%)