ibis-project / Ibis

Licence: apache-2.0

A pandas-like deferred expression system, with first-class SQL support

Programming Languages

python

139335 projects - #7 most used programming language

C++

36643 projects - #6 most used programming language

Projects that are alternatives of or similar to Ibis

Learning Spark

零基础学习spark，大数据学习

Stars: ✭ 37 (-97.73%)

Mutual labels: hadoop, hdfs, spark

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-99.2%)

Mutual labels: spark, hadoop, hdfs

Bigdata docker

Big Data Ecosystem Docker

Stars: ✭ 161 (-90.12%)

Mutual labels: hadoop, hdfs, spark

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-90.8%)

Mutual labels: hadoop, hdfs, spark

Repository

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (-94.36%)

Mutual labels: hadoop, hdfs, spark

hive to es

同步Hive数据仓库数据到Elasticsearch的小工具

Stars: ✭ 21 (-98.71%)

Mutual labels: hadoop, impala, hdfs

fastdata-cluster

Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)

Stars: ✭ 20 (-98.77%)

Mutual labels: spark, hadoop, hdfs

bigdata-fun

A complete (distributed) BigData stack, running in containers

Stars: ✭ 14 (-99.14%)

Mutual labels: spark, hadoop, hdfs

God Of Bigdata

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stars: ✭ 6,008 (+268.59%)

Mutual labels: hadoop, hdfs, spark

Devops Python Tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Stars: ✭ 406 (-75.09%)

Mutual labels: hadoop, hdfs, spark

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (-47.42%)

Mutual labels: hadoop, hdfs, spark

Data Science Ipython Notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Stars: ✭ 22,048 (+1252.64%)

Mutual labels: pandas, hadoop, spark

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+574.29%)

Mutual labels: hadoop, hdfs, spark

Xlearning Xdml

extremely distributed machine learning

Stars: ✭ 113 (-93.07%)

Mutual labels: hadoop, spark

Weblogsanalysissystem

A big data platform for analyzing web access logs

Stars: ✭ 37 (-97.73%)

Mutual labels: hadoop, spark

Docker Hadoop

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

Stars: ✭ 54 (-96.69%)

Mutual labels: hadoop, spark

Rumble

⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more

Stars: ✭ 58 (-96.44%)

Mutual labels: hdfs, spark

Docker Spark Cluster

A Spark cluster setup running on Docker containers

Stars: ✭ 57 (-96.5%)

Mutual labels: hadoop, spark

Waimak

Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.

Stars: ✭ 60 (-96.32%)

Mutual labels: hadoop, spark

Apache Spark Hands On

Educational notes,Hands on problems w/ solutions for hadoop ecosystem

Stars: ✭ 74 (-95.46%)

Mutual labels: hadoop, spark

View All Similar Projects ➔

Ibis: Python data analysis framework for Hadoop and SQL engines

Service	Status
Documentation
Conda packages
PyPI
Ibis CI
Backend CI
Coverage

Ibis is a toolbox to bridge the gap between local Python environments, remote storage, execution systems like Hadoop components (HDFS, Impala, Hive, Spark) and SQL databases. Its goal is to simplify analytical workflows and make you more productive.

Install Ibis from PyPI with:

pip install ibis-framework

or from conda-forge with

conda install ibis-framework -c conda-forge

Ibis currently provides tools for interacting with the following systems:

Learn more about using the library at http://ibis-project.org.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

ibis-project / Ibis

Programming Languages

Labels

Projects that are alternatives of or similar to Ibis

Ibis: Python data analysis framework for Hadoop and SQL engines