FreestyleA cohesive & pragmatic framework of FP centric Scala libraries
Stars: ✭ 627 (+351.08%)
CsiCSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances (NeurIPS 2020)
Stars: ✭ 123 (-11.51%)
AdtkA Python toolkit for rule-based/unsupervised anomaly detection in time series
Stars: ✭ 615 (+342.45%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+3969.06%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+8085.61%)
Alibi DetectAlgorithms for outlier and adversarial instance detection, concept drift and metrics.
Stars: ✭ 604 (+334.53%)
MidasAnomaly Detection on Dynamic (time-evolving) Graphs in Real-time and Streaming manner. Detecting intrusions (DoS and DDoS attacks), frauds, fake rating anomalies.
Stars: ✭ 591 (+325.18%)
Mongo SparkThe MongoDB Spark Connector
Stars: ✭ 588 (+323.02%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-23.02%)
AlluxioAlluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+3769.78%)
Fast MrmrAn improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (-51.8%)
SparklearningLearning Apache spark,including code and data .Most part can run local.
Stars: ✭ 558 (+301.44%)
CleanlabThe standard package for machine learning with noisy labels, finding mislabeled data, and uncertainty quantification. Works with most datasets and models.
Stars: ✭ 2,526 (+1717.27%)
LoghubA large collection of system log datasets for AI-powered log analytics
Stars: ✭ 551 (+296.4%)
SineA PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).
Stars: ✭ 67 (-51.8%)
JustenoughscalaforsparkA tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+287.05%)
Spark On K8s OperatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Stars: ✭ 1,780 (+1180.58%)
LopqTraining of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Stars: ✭ 530 (+281.29%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+7472.66%)
HidtOfficial repository for the paper "High-Resolution Daytime Translation Without Domain Labels" (CVPR2020, Oral)
Stars: ✭ 513 (+269.06%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+1081.29%)
Operative Frameworkoperative framework is a OSINT investigation framework, you can interact with multiple targets, execute multiple modules, create links with target, export rapport to PDF file, add note to target or results, interact with RESTFul API, write your own modules.
Stars: ✭ 511 (+267.63%)
Coursera Ml PyPython programming assignments for Machine Learning by Prof. Andrew Ng in Coursera
Stars: ✭ 1,140 (+720.14%)
MagellanGeo Spatial Data Analytics on Spark
Stars: ✭ 507 (+264.75%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-24.46%)
PointblankData validation and organization of metadata for data frames and database tables
Stars: ✭ 480 (+245.32%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-53.24%)
Pdf编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
Stars: ✭ 12,009 (+8539.57%)
SfmlearnerAn unsupervised learning framework for depth and ego-motion estimation from monocular videos
Stars: ✭ 1,661 (+1094.96%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-53.96%)
UrhoxUrho3D extension library
Stars: ✭ 13 (-90.65%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+228.06%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-24.46%)
WdbgarkWinDBG Anti-RootKit Extension
Stars: ✭ 450 (+223.74%)
Neuralhmmcode for unsupervised learning Neural Hidden Markov Models paper
Stars: ✭ 64 (-53.96%)
Bigdataie大数据博客、笔试题、教程、项目、面经的整理
Stars: ✭ 445 (+220.14%)
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+4222.3%)
Enlightengan[IEEE TIP'2021] "EnlightenGAN: Deep Light Enhancement without Paired Supervision" by Yifan Jiang, Xinyu Gong, Ding Liu, Yu Cheng, Chen Fang, Xiaohui Shen, Jianchao Yang, Pan Zhou, Zhangyang Wang
Stars: ✭ 434 (+212.23%)
LinkedrwA simple CLI to create your resume and personal website based on your LinkedIn profile or a JSON file
Stars: ✭ 104 (-25.18%)
RoffildlibraryLibrary for MQL5 (MetaTrader) with Python, Java, Apache Spark, AWS
Stars: ✭ 63 (-54.68%)
Dji Firmware ToolsTools for handling firmwares of DJI products, with focus on quadcopters.
Stars: ✭ 424 (+205.04%)
ZparkioBoiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-12.95%)
Bio corexA flexible version of CorEx developed for bio-data challenges that handles missing data, continuous/discrete variables, multi-CPU, overlapping structure, and includes visualizations
Stars: ✭ 112 (-19.42%)
FlintWebex Bot SDK for Node.js (deprecated in favor of https://github.com/webex/webex-bot-node-framework)
Stars: ✭ 85 (-38.85%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-91.37%)
MlfeatureFeature engineering toolkit for Spark MLlib.
Stars: ✭ 12 (-91.37%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-39.57%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-92.09%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-92.09%)
SplitbrainautoSplit-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction. In CVPR, 2017.
Stars: ✭ 137 (-1.44%)