YauaaYet Another UserAgent Analyzer
Stars: ✭ 472 (+239.57%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-79.14%)
ETL-Starter-Kit📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Stars: ✭ 21 (-84.89%)
cloud云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (-65.47%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+1064.75%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+759.71%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+487.77%)
litemall-dw基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-74.1%)
pigletA compiler for Pig Latin to Spark and Flink.
Stars: ✭ 23 (-83.45%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+228.06%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-39.57%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-33.81%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-59.71%)
dpkb大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (-11.51%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-9.35%)
God Of Bigdata专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+4222.3%)
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+1210.07%)
TiBigDataTiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (+38.13%)
DataX-srcDataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-84.89%)
djburgerFramework for safe and maintainable web-projects.
Stars: ✭ 75 (-46.04%)
flink-clientJava library for managing Apache Flink via the Monitoring REST API
Stars: ✭ 48 (-65.47%)
PyScholarA 'supervised' parser for Google Scholar
Stars: ✭ 74 (-46.76%)
HiveRunnerAn Open Source unit test framework for Hive queries based on JUnit 4 and 5
Stars: ✭ 244 (+75.54%)
fastprotoFastProto is a binary data processing tool written in Java.
Stars: ✭ 65 (-53.24%)
BBob⚡️Blazing-fast js-bbcode-parser, bbcode js, that transforms and parses to AST with plugin support in pure javascript, no dependencies
Stars: ✭ 133 (-4.32%)
precompressGenerate pre-compressed .gz and .br files for static web servers
Stars: ✭ 27 (-80.58%)
iridiumA register-based VM in Rust
Stars: ✭ 60 (-56.83%)
spreadsheetTypeScript/javascript spreadsheet parser, with formulas.
Stars: ✭ 40 (-71.22%)
flink-connector-kudu基于Apache-bahir-kudu-connector的flink-connector-kudu,支持Flink1.11.x DynamicTableSource/Sink,支持Range分区等
Stars: ✭ 40 (-71.22%)
how-much💰 iOS price list app using Firebase, Realm & more
Stars: ✭ 22 (-84.17%)
fdp-modelserverAn umbrella project for multiple implementations of model serving
Stars: ✭ 47 (-66.19%)
liltLILT: noun, A characteristic rising and falling of the voice when speaking; a pleasant gentle accent.
Stars: ✭ 18 (-87.05%)
section-matterLike front-matter, but allows multiple sections in a single document.
Stars: ✭ 18 (-87.05%)
Live-Stream-Chat-RetrieverRetrieve live streams chat messages from different sources (Twitch, YouTube Gaming, Dailymotion etc...) to print them into a single HTML page.
Stars: ✭ 40 (-71.22%)
go-htmlinfoGo HTML Info package for extracting meaningful information from html page
Stars: ✭ 33 (-76.26%)
shape-jsonModule used to convert a flat json array into a nested json object with a predefined scheme
Stars: ✭ 31 (-77.7%)
Lidea大型分布式系统实时监控平台
Stars: ✭ 28 (-79.86%)
MzingaOpen-source software to play the board game Hive.
Stars: ✭ 57 (-58.99%)
spacesuitAPI Gateway with URL remapping
Stars: ✭ 19 (-86.33%)
pp-tomlPaul's Parser for Tom's Own Minimal Language
Stars: ✭ 17 (-87.77%)
TIFeedParserRSS Parser written in Swift
Stars: ✭ 18 (-87.05%)
jetJet is a simple OOP, dynamically typed, functional language that runs on the Erlang virtual machine (BEAM). Jet's syntax is Ruby-like syntax.
Stars: ✭ 22 (-84.17%)
pf-azure-sentinelParse pfSense/OPNSense logs using Logstash, GeoIP tag entities, add additional context to logs, then send to Azure Sentinel for analysis.
Stars: ✭ 24 (-82.73%)
radiatorHive Ruby API Client
Stars: ✭ 49 (-64.75%)
icecast-parserNode.js module for getting and parsing metadata from SHOUTcast/Icecast radio streams
Stars: ✭ 66 (-52.52%)
go-oembedGolang package for parsing Oembed data from known providers by URL
Stars: ✭ 22 (-84.17%)
xarray-beamDistributed Xarray with Apache Beam
Stars: ✭ 83 (-40.29%)
beemosBEE MOnitoring System: create an infrastructure for monitoring beehives
Stars: ✭ 16 (-88.49%)
hive compared bqhive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Stars: ✭ 27 (-80.58%)