Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-47.14%)
CamusMirror of Linkedin's Camus
Stars: ✭ 81 (-42.14%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-34.29%)
AzopsThis container image can be used to deploy ARM templates at Tenant, Management Group, Subscription and Resource Group scope and export current Azure configuration hierarchy in Git repository.
Stars: ✭ 109 (-22.14%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+855.71%)
Flinkstreamsql基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (+1101.43%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-30.71%)
RecommendersBest Practices on Recommendation Systems
Stars: ✭ 11,818 (+8341.43%)
StormkafkamonDumps state of Storm Kafka consumers
Stars: ✭ 99 (-29.29%)
TwitchrecoverTwitch VOD tool which recovers all VODs including those that are sub only or deleted.
Stars: ✭ 123 (-12.14%)
Oauth2 AzureAzure AD provider for the OAuth 2.0 Client.
Stars: ✭ 140 (+0%)
Reddit sse streamA Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
Stars: ✭ 39 (-72.14%)
Hls.jsHLS.js is a JavaScript library that plays HLS in browsers with support for MSE.
Stars: ✭ 10,791 (+7607.86%)
Azure SignalrAzure SignalR Service SDK for .NET
Stars: ✭ 137 (-2.14%)
Go Kafka ExampleGolang Kafka consumer and producer example
Stars: ✭ 108 (-22.86%)
Vsts Work Item MigratorWiMigrator is a command line tool for migrating work items between VSTS/TFS projects
Stars: ✭ 124 (-11.43%)
Components ContribCommunity driven, reusable components for distributed apps
Stars: ✭ 131 (-6.43%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-22.86%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-22.86%)
EchartsApache ECharts is a powerful, interactive charting and data visualization library for browser
Stars: ✭ 49,119 (+34985%)
JiosaavnapiAn unofficial API for JioSaavn written in Python 3
Stars: ✭ 123 (-12.14%)
SupersetApache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+30352.86%)
Springboot Labs一个涵盖六个专栏:Spring Boot 2.X、Spring Cloud、Spring Cloud Alibaba、Dubbo、分布式消息队列、分布式事务的仓库。希望胖友小手一抖,右上角来个 Star,感恩 1024
Stars: ✭ 12,804 (+9045.71%)
SignalsGeneral purpose modern C++ Signal-Slot providing ease of use, flexibility and extremely high performance aiming to replace traditional interfaces in real-time applications
Stars: ✭ 137 (-2.14%)
DynamictranslatorInstant translation application for windows in .NET 🎪
Stars: ✭ 131 (-6.43%)
Apiproject[https://www.sofineday.com], golang项目开发脚手架,集成最佳实践(gin+gorm+go-redis+mongo+cors+jwt+json日志库zap(支持日志收集到kafka或mongo)+消息队列kafka+微信支付宝支付gopay+api加密+api反向代理+go modules依赖管理+headless爬虫chromedp+makefile+二进制压缩+livereload热加载)
Stars: ✭ 124 (-11.43%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+7384.29%)
Spark Infotheoretic Feature SelectionThis package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
Stars: ✭ 123 (-12.14%)
Mongolastic🚥 A dataset migration tool from MongoDB to Elasticsearch and vice versa.
Stars: ✭ 131 (-6.43%)
Vs DeployVisual Studio Code extension that provides commands to deploy files of a workspace to a destination.
Stars: ✭ 123 (-12.14%)
Syslog Ngsyslog-ng is an enhanced log daemon, supporting a wide range of input and output methods: syslog, unstructured text, queueing, SQL & NoSQL.
Stars: ✭ 1,555 (+1010.71%)
SodAn Embedded Computer Vision & Machine Learning Library (CPU Optimized & IoT Capable)
Stars: ✭ 1,460 (+942.86%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-23.57%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-0.71%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-23.57%)
Vscode MavenVSCode extension "Maven for Java"
Stars: ✭ 107 (-23.57%)
CliflixWatch anything instantaneously, just write its name.
Stars: ✭ 1,439 (+927.86%)
Markup.mlError-recovering streaming HTML5 and XML parsers
Stars: ✭ 122 (-12.86%)
KaffeAn opinionated Elixir wrapper around brod, the Erlang Kafka client, that supports encrypted connections to Heroku Kafka out of the box.
Stars: ✭ 106 (-24.29%)
Metronome Metronome is a distributed and fault-tolerant event scheduler
Stars: ✭ 131 (-6.43%)
Vc StorefrontVirtoCommerce Storefront for ASP.NET Core 3.1 repository
Stars: ✭ 122 (-12.86%)
SupermanSuperman是什么:构建Java 高级开发技术的知识体系,从基础不断打怪升级成为超人之路(更新中.......)
Stars: ✭ 106 (-24.29%)
DtcraftA High-performance Cluster Computing Engine
Stars: ✭ 122 (-12.86%)
Streaming RoomStreaming room in Node.js, rtmp, hsl, html5 videojs player
Stars: ✭ 106 (-24.29%)
WaterdropWaterDrop is a standalone Karafka component library for generating Kafka messages
Stars: ✭ 136 (-2.86%)
NekoA self hosted virtual browser (rabb.it clone) that runs in docker.
Stars: ✭ 1,957 (+1297.86%)
Spark AlchemyCollection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-12.86%)
GriddbGridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Stars: ✭ 1,587 (+1033.57%)