All Projects → Parquet Index → Similar Projects or Alternatives

1587 Open source projects that are alternatives of or similar to Parquet Index

Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1127.52%)
Mutual labels:  spark
Devops Exercises
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
Stars: ✭ 20,905 (+19078.9%)
Mutual labels:  sql
Docker Trino Cluster
Multiple node presto cluster on docker container
Stars: ✭ 81 (-25.69%)
Mutual labels:  sql
Sqlformat
.NET SQL Parser and Formatter Tool and SSMS Plugin
Stars: ✭ 49 (-55.05%)
Mutual labels:  sql
Laravel Stats
📈 Get insights about your Laravel or Lumen Project
Stars: ✭ 1,386 (+1171.56%)
Mutual labels:  statistics
Scala Db Codegen
Scala code/boilerplate generator from a db schema
Stars: ✭ 49 (-55.05%)
Mutual labels:  sql
Lehar
Visualize data using relative ordering
Stars: ✭ 81 (-25.69%)
Mutual labels:  spark
Base
https://www.researchgate.net/profile/Rajah_Iyer
Stars: ✭ 48 (-55.96%)
Mutual labels:  sql
Sofastack
SOFAStack™ (Scalable Open Financial Architecture Stack) is a collection of cloud native middleware components, which are designed to build distributed systems with high performance and reliability, and have been fully validated by mission-critical financial business scenarios.
Stars: ✭ 96 (-11.93%)
Mutual labels:  index
Awesome Recommendation Engine
The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-56.88%)
Mutual labels:  spark
Deveeldb
DeveelDB is a complete SQL database system, primarly developed for .NET/Mono frameworks
Stars: ✭ 80 (-26.61%)
Mutual labels:  sql
Node Parquet
NodeJS module to access apache parquet format files
Stars: ✭ 46 (-57.8%)
Mutual labels:  parquet
Isl Python
Porting the R code in ISL to python. Labs and exercises
Stars: ✭ 108 (-0.92%)
Mutual labels:  statistics
Pibench
Benchmarking framework for index structures on persistent memory
Stars: ✭ 46 (-57.8%)
Mutual labels:  index
Djongo
Django and MongoDB database connector
Stars: ✭ 1,222 (+1021.1%)
Mutual labels:  sql
Examples
Demo applications and code examples for Confluent Platform and Apache Kafka
Stars: ✭ 571 (+423.85%)
Mutual labels:  sql
Probflow
A Python package for building Bayesian models with TensorFlow or PyTorch
Stars: ✭ 95 (-12.84%)
Mutual labels:  statistics
Spark Tda
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-58.72%)
Mutual labels:  spark
Askxml
Run SQL statements on XML documents
Stars: ✭ 79 (-27.52%)
Mutual labels:  sql
Jl Sql
SQL for JSON and CSV streams
Stars: ✭ 44 (-59.63%)
Mutual labels:  sql
Idea Sql Generator Tool
intellij idea sql generator tool
Stars: ✭ 102 (-6.42%)
Mutual labels:  sql
Be Course 17 18
🎓 Backend · 2017-2018 · Curriculum and Syllabus 💾
Stars: ✭ 44 (-59.63%)
Mutual labels:  sql
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-27.52%)
Mutual labels:  sql
Dagbot
The official Repository for dagbot, the self proclaimmed n1 meme bot.
Stars: ✭ 40 (-63.3%)
Mutual labels:  sql
Brein Time Utilities
Library which contains several time-dependent data and index structures (e.g., IntervalTree, BucketTimeSeries), as well as algorithms.
Stars: ✭ 94 (-13.76%)
Mutual labels:  index
Delta Architecture
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Stars: ✭ 43 (-60.55%)
Mutual labels:  spark
Superseriousstats
superseriousstats is a fast and efficient program to create statistics out of various types of chat logs
Stars: ✭ 78 (-28.44%)
Mutual labels:  statistics
Spark Examples
Spark examples
Stars: ✭ 41 (-62.39%)
Mutual labels:  spark
Ml Videos
A collection of video resources for machine learning
Stars: ✭ 1,446 (+1226.61%)
Mutual labels:  statistics
Tigertoolbox
Toolbox repository for Tiger team
Stars: ✭ 1,003 (+820.18%)
Mutual labels:  sql
Emacs Sql Indent
Syntax based indentation for SQL files inside GNU Emacs
Stars: ✭ 78 (-28.44%)
Mutual labels:  sql
Gatk
Official code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+819.27%)
Mutual labels:  spark
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-15.6%)
Mutual labels:  spark
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+815.6%)
Mutual labels:  spark
Dapper.bulk
Dapper.Bulk SqlServer
Stars: ✭ 78 (-28.44%)
Mutual labels:  sql
Discord Global Mutual
Get the list of people that you have shared servers with
Stars: ✭ 39 (-64.22%)
Mutual labels:  statistics
Hackermath
Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way
Stars: ✭ 1,380 (+1166.06%)
Mutual labels:  statistics
Sqlstyle.guide
A consistent code style guide for SQL to ensure legible and maintainable projects
Stars: ✭ 994 (+811.93%)
Mutual labels:  sql
Bvcms
The open source church management system
Stars: ✭ 77 (-29.36%)
Mutual labels:  sql
Statzone
DNS zone file analyzer targeted at TLD zones
Stars: ✭ 38 (-65.14%)
Mutual labels:  statistics
Tyche
Statistics utilities for the JVM - in Scala!
Stars: ✭ 93 (-14.68%)
Mutual labels:  statistics
Rhodddoobie
My little sandbox for playing around with the FP + OOP + DDD combination, in particular using Rho, doobie, Docker, testing, etc in a project.
Stars: ✭ 38 (-65.14%)
Mutual labels:  sql
Github Traffic
Get the Github traffic for the specified repository
Stars: ✭ 77 (-29.36%)
Mutual labels:  statistics
Goqu
SQL builder and query library for golang
Stars: ✭ 984 (+802.75%)
Mutual labels:  sql
Cslearning
开源项目之「计算机编程自学之路」:计算机自学指南+面试大全+资源分享+技术文章
Stars: ✭ 107 (-1.83%)
Mutual labels:  sql
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-66.06%)
Mutual labels:  spark
Hyperlearn
50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+1004.59%)
Mutual labels:  statistics
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-66.06%)
Mutual labels:  spark
Spark Summit 2017 Sanfrancisco
spark summit 2017 SanFrancisco
Stars: ✭ 93 (-14.68%)
Mutual labels:  spark
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (+800.92%)
Mutual labels:  statistics
Home
ApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (+1000%)
Mutual labels:  spark
Helioml
A book about machine learning, statistics, and data mining for heliophysics
Stars: ✭ 36 (-66.97%)
Mutual labels:  statistics
Spark Terasort
Spark Terasort
Stars: ✭ 101 (-7.34%)
Mutual labels:  spark
Reporting Services Examples
📕 Various example reports I use for SQL Server Reporting Services (SSRS) as well as documents for unit testing, requirements and a style guide template.
Stars: ✭ 63 (-42.2%)
Mutual labels:  sql
Bigdata File Viewer
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-21.1%)
Mutual labels:  parquet
Spark Doc Zh
Apache Spark 官方文档中文版
Stars: ✭ 1,126 (+933.03%)
Mutual labels:  spark
Sqlite orm
❤️ SQLite ORM light header only library for modern C++
Stars: ✭ 1,121 (+928.44%)
Mutual labels:  sql
Dynamodb Oop
Speak fluent DynamoDB, write code with fashion, I Promise() 😃
Stars: ✭ 104 (-4.59%)
Mutual labels:  sql
Porcupine
Threading, Resiliency and Monitoring for Java EE 7/8
Stars: ✭ 99 (-9.17%)
Mutual labels:  statistics
Training Material
A collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (-22.02%)
Mutual labels:  sql
301-360 of 1587 similar projects