All Projects → Parquet Index → Similar Projects or Alternatives

1587 Open source projects that are alternatives of or similar to Parquet Index

Parquet file generator

Stars: ✭ 16 (-85.32%)

Mutual labels: sql, spark, parquet

Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.

Stars: ✭ 696 (+538.53%)

Mutual labels: sql, spark

Oap

Optimized Analytics Package for Spark* Platform

Stars: ✭ 343 (+214.68%)

Mutual labels: spark, parquet

Rumble

⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more

Stars: ✭ 58 (-46.79%)

Mutual labels: spark, parquet

Xsql

Unified SQL Analytics Engine Based on SparkSQL

Stars: ✭ 176 (+61.47%)

Mutual labels: sql, spark

Metorikku

A simplified, lightweight ETL Framework based on Apache Spark

Stars: ✭ 361 (+231.19%)

Mutual labels: sql, spark

Whylogs Java

Profile and monitor your ML data pipeline end-to-end

Stars: ✭ 164 (+50.46%)

Mutual labels: spark, statistics

Sqlindexmanager

Free GUI Tool for Index Maintenance on SQL Server and Azure

Stars: ✭ 403 (+269.72%)

Mutual labels: sql, index

Iceberg

Iceberg is a table format for large, slow-moving tabular data

Stars: ✭ 393 (+260.55%)

Mutual labels: spark, parquet

experiments

Code examples for my blog posts

Stars: ✭ 21 (-80.73%)

Mutual labels: spark, parquet

Mlinterview

A curated awesome list of AI Startups in India & Machine Learning Interview Guide. Feel free to contribute!

Stars: ✭ 410 (+276.15%)

Mutual labels: sql, statistics

Datafusion

DataFusion has now been donated to the Apache Arrow project

Stars: ✭ 611 (+460.55%)

Mutual labels: sql, spark

Spark Website

Apache Spark Website

Stars: ✭ 75 (-31.19%)

Mutual labels: sql, spark

Pucket

Bucketing and partitioning system for Parquet

Stars: ✭ 29 (-73.39%)

Mutual labels: spark, parquet

Data Science Best Resources

Carefully curated resource links for data science in one place

Stars: ✭ 1,104 (+912.84%)

Mutual labels: sql, statistics

Data Science Question Answer

A repo for data science related questions and answers

Stars: ✭ 2,000 (+1734.86%)

Mutual labels: sql, statistics

Quicksql

A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources

Stars: ✭ 1,821 (+1570.64%)

Mutual labels: sql, spark

Gaffer

A large-scale entity and relation database supporting aggregation of properties

Stars: ✭ 1,642 (+1406.42%)

Mutual labels: spark, parquet

Spark

Apache Spark - A unified analytics engine for large-scale data processing

Stars: ✭ 31,618 (+28907.34%)

Mutual labels: sql, spark

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (+37.61%)

Mutual labels: sql, spark

Linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,323 (+2031.19%)

Mutual labels: sql, spark

Kyuubi

Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark

Stars: ✭ 363 (+233.03%)

Mutual labels: sql, spark

Roapi

Create full-fledged APIs for static datasets without writing a single line of code.

Stars: ✭ 253 (+132.11%)

Mutual labels: sql, parquet

Devops Python Tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Stars: ✭ 406 (+272.48%)

Mutual labels: spark, parquet

Kamu Cli

Next generation tool for decentralized exchange and transformation of semi-structured data

Stars: ✭ 69 (-36.7%)

Mutual labels: sql, spark

Schemer

Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.

Stars: ✭ 97 (-11.01%)

Mutual labels: spark, parquet

Php Thrift Sql

A PHP library for connecting to Hive or Impala over Thrift

Stars: ✭ 107 (-1.83%)

Mutual labels: sql

Xorm

xorm是一个简单而强大的Go语言ORM库，通过它可以使数据库操作非常简便。本库是基于原版xorm的定制增强版本，为xorm提供类似ibatis的配置文件及动态SQL支持，支持AcitveRecord操作

Stars: ✭ 1,394 (+1178.9%)

Mutual labels: sql

Ransom0

Ransom0 is a open source ransomware made with Python, designed to find and encrypt user data.

Stars: ✭ 105 (-3.67%)

Mutual labels: sql

Legacy Search

Demo project showing how to add elasticsearch to a legacy application.

Stars: ✭ 103 (-5.5%)

Mutual labels: sql

Pyspark Cheatsheet

🐍 Quick reference guide to common patterns & functions in PySpark.

Stars: ✭ 108 (-0.92%)

Mutual labels: spark

Tennis Crystal Ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction

Stars: ✭ 107 (-1.83%)

Mutual labels: statistics

Cubes

Light-weight Python OLAP framework for multi-dimensional data analysis

Stars: ✭ 1,393 (+1177.98%)

Mutual labels: sql

Minisqlquery

Minimalist SQL Query tool for any .NET DB Provider - SQL, SQLite, SQL CE, Oracle, Access...

Stars: ✭ 103 (-5.5%)

Mutual labels: sql

Monetdblite

MonetDB reconfigured as a library

Stars: ✭ 107 (-1.83%)

Mutual labels: sql

Your spotify

Self hosted Spotify tracking dashboard

Stars: ✭ 102 (-6.42%)

Mutual labels: statistics

Laravel Stats

📈 Get insights about your Laravel or Lumen Project

Stars: ✭ 1,386 (+1171.56%)

Mutual labels: statistics

Isl Python

Porting the R code in ISL to python. Labs and exercises

Stars: ✭ 108 (-0.92%)

Mutual labels: statistics

Hnswlib

Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs

Stars: ✭ 108 (-0.92%)

Mutual labels: spark

Logigsk

A Linux based software package to control led's on Logitech G910, G810, G610 and G410.

Stars: ✭ 107 (-1.83%)

Mutual labels: spark

Gitlogg

💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'

Stars: ✭ 102 (-6.42%)

Mutual labels: statistics

Idea Sql Generator Tool

intellij idea sql generator tool

Stars: ✭ 102 (-6.42%)

Mutual labels: sql

Ml Videos

A collection of video resources for machine learning

Stars: ✭ 1,446 (+1226.61%)

Mutual labels: statistics

The Federation.info

Statistics hub for the Fediverse

Stars: ✭ 101 (-7.34%)

Mutual labels: statistics

Hackermath

Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way

Stars: ✭ 1,380 (+1166.06%)

Mutual labels: statistics

Cslearning

开源项目之「计算机编程自学之路」：计算机自学指南+面试大全+资源分享+技术文章

Stars: ✭ 107 (-1.83%)

Mutual labels: sql

Datawarehouse

数据仓库和用户画像

Stars: ✭ 105 (-3.67%)

Mutual labels: sql

Root

The official repository for ROOT: analyzing, storing and visualizing big data, scientifically

Stars: ✭ 1,377 (+1163.3%)

Mutual labels: statistics

Spark Terasort

Stars: ✭ 101 (-7.34%)

Mutual labels: spark

Sqlobject

SQLObject, an object-relational mapper for Python

Stars: ✭ 106 (-2.75%)

Mutual labels: sql

F3 Cortex

A multi-engine ORM / ODM for the PHP Fat-Free Framework

Stars: ✭ 101 (-7.34%)

Mutual labels: sql

Spark Ffm

FFM (Field-Awared Factorization Machine) on Spark

Stars: ✭ 101 (-7.34%)

Mutual labels: spark

Sqlfaker

轻量级、易拓展的数据库智能填充Java开源库

Stars: ✭ 109 (+0%)

Mutual labels: sql

Ptstat

Probabilistic Programming and Statistical Inference in PyTorch

Stars: ✭ 108 (-0.92%)

Mutual labels: statistics

Scikit Learn

scikit-learn: machine learning in Python

Stars: ✭ 48,322 (+44232.11%)

Mutual labels: statistics

Npm Stats

📈 npm package statistics dashboard build with vue

Stars: ✭ 106 (-2.75%)

Mutual labels: statistics

Flyway Sbt

Flyway SBT plugin

Stars: ✭ 101 (-7.34%)

Mutual labels: sql

Maha

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

Stars: ✭ 101 (-7.34%)

Mutual labels: sql

Griddb

GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.

Stars: ✭ 1,587 (+1355.96%)

Mutual labels: sql

Fiflow

flink-sql 在 flink 上运行 sql 和构建数据流的平台基于 apache flink 1.10.0

Stars: ✭ 100 (-8.26%)

Mutual labels: sql

1-60 of 1587 similar projects

›

next*5