litemall-dw基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-29.41%)
cloud云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (-5.88%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+4454.9%)
Lidea大型分布式系统实时监控平台
Stars: ✭ 28 (-45.1%)
darwinAvro Schema Evolution made easy
Stars: ✭ 26 (-49.02%)
incubator-linkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+4721.57%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-43.14%)
Node HbaseAsynchronous HBase client for NodeJs using REST
Stars: ✭ 226 (+343.14%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (+250.98%)
replicatorMySQL Replicator. Replicates MySQL tables to Kafka and HBase, keeping the data changes history in HBase.
Stars: ✭ 41 (-19.61%)
hbase-pythonhbase-python is a pure python package used to access HBase.
Stars: ✭ 38 (-25.49%)
phoenixApache Phoenix / Hbase Spring Boot Microservices
Stars: ✭ 23 (-54.9%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+3096.08%)
disk基于hadoop+hbase+springboot实现分布式网盘系统
Stars: ✭ 53 (+3.92%)
xingtianxingtian is a componentized library for the development and verification of reinforcement learning algorithms
Stars: ✭ 229 (+349.02%)
talosNo description or website provided.
Stars: ✭ 37 (-27.45%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+323.53%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-62.75%)
bloomeryWeb UI for Impala
Stars: ✭ 15 (-70.59%)
Camelliacamellia framework by netease-im. provider: 1) redis-client; 2) redis-proxy(redis-sentinel/redis-cluster); 3) hbase-client; 4) others
Stars: ✭ 146 (+186.27%)
orionManagement and automation platform for Stateful Distributed Systems
Stars: ✭ 77 (+50.98%)
mangoCore utility library & data connectors designed for simpler usage in Scala
Stars: ✭ 41 (-19.61%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (+3.92%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-27.45%)
mizoSuper-fast Spark RDD for Titan Graph Database on HBase
Stars: ✭ 24 (-52.94%)
cmuxA set of commands for managing CDH clusters using Cloudera Manager REST API.
Stars: ✭ 34 (-33.33%)
Sqliorm sql interface, Criteria, CriteriaBuilder, ResultMapBuilder
Stars: ✭ 1,644 (+3123.53%)
Awesome HbaseA curated list of awesome HBase projects and resources.
Stars: ✭ 140 (+174.51%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+1105.88%)
Spark DB ConnectorUse Scala API to read/write data from different databases,HBase,MySQL,etc.
Stars: ✭ 24 (-52.94%)
HgraphdbHBase as a TinkerPop Graph Database
Stars: ✭ 226 (+343.14%)
ImposterScriptable, multipurpose mock server.
Stars: ✭ 187 (+266.67%)
implyrSQL backend to dplyr for Impala
Stars: ✭ 74 (+45.1%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+247.06%)
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-58.82%)
thrift2-hbasethrift2-hbase component for Hyperf.
Stars: ✭ 14 (-72.55%)
TeraAn Internet-Scale Database.
Stars: ✭ 1,846 (+3519.61%)
dpkb大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+141.18%)
Technology Talk汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+23696.08%)
cbassadding "simple" to HBase
Stars: ✭ 25 (-50.98%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-31.37%)
hdocdbHBase as a JSON Document Database
Stars: ✭ 24 (-52.94%)
liquibase-impalaLiquibase extension to add Impala Database support
Stars: ✭ 23 (-54.9%)
DataX-srcDataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-58.82%)