LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+1097.42%)
Movie recommend基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Stars: ✭ 2,092 (+978.35%)
kdpKubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store
Stars: ✭ 15 (-92.27%)
HiveFast. Scalable. Powerful. The Blockchain for Web 3.0
Stars: ✭ 142 (-26.8%)
hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (-84.02%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-27.84%)
reglnWindows Rregistry Linking Utility
Stars: ✭ 38 (-80.41%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-53.09%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-71.13%)
Cube.js📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+6076.8%)
MzingaOpen-source software to play the board game Hive.
Stars: ✭ 57 (-70.62%)
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (-43.3%)
Instagram2FediPython script for crossposting from Instagram to Mastodon or Pixelfed
Stars: ✭ 45 (-76.8%)
Php Thrift SqlA PHP library for connecting to Hive or Impala over Thrift
Stars: ✭ 107 (-44.85%)
IdraIdra - Open Data Federation Platform
Stars: ✭ 15 (-92.27%)
PyhivePython interface to Hive and Presto. 🐝
Stars: ✭ 1,378 (+610.31%)
data-profilinga set of scripts to pull meta data and data profiling metrics from relational database systems
Stars: ✭ 57 (-70.62%)
Esteem SurferEcency desktop formerly known as Esteem Surfer - reimagined desktop social wallet, contribute and get rewarded (for Windows, Mac, Linux)
Stars: ✭ 100 (-48.45%)
pylodonFlask-based ActivityPub server
Stars: ✭ 86 (-55.67%)
hive-cubeData self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org
Stars: ✭ 34 (-82.47%)
Repository个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-52.58%)
TiBigDataTiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (-1.03%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-56.7%)
hiveql-parserHiveQL Parser. Parse HiveQL code and print AST in JSON format if success, else print well formed syntax error message.
Stars: ✭ 25 (-87.11%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+515.98%)
HiveRunnerAn Open Source unit test framework for Hive queries based on JUnit 4 and 5
Stars: ✭ 244 (+25.77%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-87.63%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-66.49%)
Hive Jdbc Uber JarHive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Stars: ✭ 188 (-3.09%)
Eyerissf An Eyeriss Chip (researched by MIT, a CNN accelerator) simulator and New DNN framework "Hive"
Stars: ✭ 68 (-64.95%)
bookwyrmSocial reading and reviewing, decentralized with ActivityPub
Stars: ✭ 1,499 (+672.68%)
Pyetlpython ETL framework
Stars: ✭ 33 (-82.99%)
common-datax基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步
Stars: ✭ 51 (-73.71%)
DataX-srcDataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-89.18%)
beep-beepFictional p2p protocol
Stars: ✭ 34 (-82.47%)
Bigdata💎🔥大数据学习笔记
Stars: ✭ 488 (+151.55%)
diaspora federationA library that provides functionalities needed for the diaspora* federation protocol.
Stars: ✭ 97 (-50%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+135.05%)
awesome-hiveA curated list of awesome Hive resources.
Stars: ✭ 20 (-89.69%)
YanagishimaWeb UI for Trino, Presto, Hive, Elasticsearch, SparkSQL
Stars: ✭ 424 (+118.56%)
beemosBEE MOnitoring System: create an infrastructure for monitoring beehives
Stars: ✭ 16 (-91.75%)
Spiderman基于 scrapy-redis 的通用分布式爬虫框架
Stars: ✭ 392 (+102.06%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (-72.68%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+91.75%)
cherrypick🌎 A interplanetary communication platform 🚀
Stars: ✭ 40 (-79.38%)
DatafakerDatafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具
Stars: ✭ 327 (+68.56%)
fenseFense is a database proxy written in Java, which can connect DB of different engines at the same time. The key features are: authority management, query cache, audit security, current limiting fuse, onesql and so on
Stars: ✭ 22 (-88.66%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+734.54%)
HiverunnerAn Open Source unit test framework for Hive queries based on JUnit 4 and 5
Stars: ✭ 225 (+15.98%)
incubator-linkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+1167.53%)
liquibase-impalaLiquibase extension to add Impala Database support
Stars: ✭ 23 (-88.14%)
darrrrAn SDK for the delegated recovery specfication
Stars: ✭ 43 (-77.84%)
brambleThe Movio GraphQL Gateway
Stars: ✭ 423 (+118.04%)
hadoop-etl-udfsThe Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-91.24%)
apollo-studio-community🎡 GraphQL developer portal featuring an IDE (Apollo Explorer), auto-documentation, metrics reporting, and more. This repo is for issues, feature requests, and preview docs. 📬
Stars: ✭ 212 (+9.28%)
HiveLightweight and blazing fast key-value database written in pure Dart.
Stars: ✭ 2,681 (+1281.96%)