Hive Jdbc Uber JarHive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Stars: ✭ 188 (+154.05%)
hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (-58.11%)
db.rstudio.comWebsite dedicated to all things R and Databases
Stars: ✭ 13 (-82.43%)
TidyqueryQuery R data frames with SQL
Stars: ✭ 138 (+86.49%)
TidyTidy up your data with JavaScript, inspired by dplyr and the tidyverse
Stars: ✭ 307 (+314.86%)
incubator-linkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+3222.97%)
KyuubiKyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (+390.54%)
datarA Grammar of Data Manipulation in python
Stars: ✭ 142 (+91.89%)
waspWASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-74.32%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+6090.54%)
Sqliorm sql interface, Criteria, CriteriaBuilder, ResultMapBuilder
Stars: ✭ 1,644 (+2121.62%)
liquibase-impalaLiquibase extension to add Impala Database support
Stars: ✭ 23 (-68.92%)
TidyheatmapDraw heatmap simply using a tidy data frame
Stars: ✭ 151 (+104.05%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+102.7%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+3039.19%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+2087.84%)
TezApache Tez
Stars: ✭ 313 (+322.97%)
Moderndive bookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (+612.16%)
TidyquantBringing financial analysis to the tidyverse
Stars: ✭ 635 (+758.11%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+731.08%)
NutchApache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+2977.03%)
eeguanaA package for manipulating EEG data in R.
Stars: ✭ 16 (-78.38%)
CSSS508CSSS508: Introduction to R for Social Scientists
Stars: ✭ 28 (-62.16%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+2102.7%)
parcours-rValise pédagogique pour la formation à R
Stars: ✭ 25 (-66.22%)
TimetkA toolkit for working with time series in R
Stars: ✭ 371 (+401.35%)
TidylogTidylog provides feedback about dplyr and tidyr operations. It provides wrapper functions for the most common functions, such as filter, mutate, select, and group_by, and provides detailed output for joins.
Stars: ✭ 428 (+478.38%)
casewhenCreate reusable dplyr::case_when() functions
Stars: ✭ 64 (-13.51%)
HiveApache Hive
Stars: ✭ 4,031 (+5347.3%)
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-71.62%)
R-data-wranglingMaterials for my my R data workshop. https://cengel.github.io/R-data-wrangling/
Stars: ✭ 17 (-77.03%)
skeinA tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+72.97%)
java📚 Recursos para aprender Java
Stars: ✭ 31 (-58.11%)
expresso-phpFast and simple Docker setup for all your PHP development. Quick but not dirty.
Stars: ✭ 31 (-58.11%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-50%)
DBISProjectLibrary Management System using Java and MySQL
Stars: ✭ 27 (-63.51%)
pypyodbcpypyodbc is a pure Python cross platform ODBC interface module (pyodbc compatible as of 2017)
Stars: ✭ 39 (-47.3%)
datasqueezeHadoop utility to compact small files
Stars: ✭ 18 (-75.68%)
openwhisk-runtime-goApache OpenWhisk Runtime Go supports Apache OpenWhisk functions written in Go
Stars: ✭ 31 (-58.11%)
hadoop-etl-udfsThe Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-77.03%)
spwrapSimple Stored Procedure call wrapper with no framework dependencies.
Stars: ✭ 24 (-67.57%)
NeoOrm框架:基于ActiveRecord思想开发的至简化的java的Orm框架
Stars: ✭ 35 (-52.7%)
memex-gateGeneral Architecture for Text Engineering
Stars: ✭ 47 (-36.49%)
KBC--Kaun-Banega-CrorepatiIt is Core Java based Game based on Indian television game show having best animation as possible in Core java 5000+ lines
Stars: ✭ 38 (-48.65%)
docker-oxid6Docker Container with PHP7, MySQL 5.7 and OXID eShop 6
Stars: ✭ 30 (-59.46%)
uima-uimajApache UIMA Java SDK
Stars: ✭ 50 (-32.43%)
sparkucxA high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (-56.76%)
disqA library for manipulating bioinformatics sequencing formats in Apache Spark
Stars: ✭ 29 (-60.81%)
corcAn ORC File Scheme for the Cascading data processing platform.
Stars: ✭ 14 (-81.08%)
odbc2parquetA command line tool to query an ODBC data source and write the result into a parquet file.
Stars: ✭ 95 (+28.38%)