visualize-data-with-pythonA Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (-76.38%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-93.7%)
v6.dooring.public可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+27.17%)
Spark-PMoFSpark Shuffle Optimization with RDMA+AEP
Stars: ✭ 28 (-88.98%)
spark-http-streamspark structured streaming via HTTP communication
Stars: ✭ 17 (-93.31%)
ETL-Starter-Kit📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Stars: ✭ 21 (-91.73%)
bqvThe simplest tool to manage views of BigQuery.
Stars: ✭ 22 (-91.34%)
Covid19TrackerA Robinhood style COVID-19 🦠 Android tracking app for the US. Open source and built with Kotlin.
Stars: ✭ 65 (-74.41%)
vulknLove your Data. Love the Environment. Love VULKИ.
Stars: ✭ 43 (-83.07%)
docker-sparkApache Spark docker container image (Standalone mode)
Stars: ✭ 34 (-86.61%)
jigsaw-seed这是组件库 Jigsaw-七巧板(https://github.com/rdkmaster/jigsaw) 的种子工程,建议所有新增的app都以这个工程作为种子开始构建。
Stars: ✭ 17 (-93.31%)
UnROOT.jlNative Julia I/O package to work with CERN ROOT files
Stars: ✭ 52 (-79.53%)
cdsData syncing in golang for ClickHouse.
Stars: ✭ 839 (+230.31%)
SparkV🤖⚡ | The most POWERFUL multipurpose chat/meme bot that will boost the activity in your server.
Stars: ✭ 24 (-90.55%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-76.38%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-64.17%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-77.95%)
daf-kyloKylo integration with PDND (previously DAF).
Stars: ✭ 20 (-92.13%)
learning-sparkTidy up Spark and Hadoop tutorials.
Stars: ✭ 28 (-88.98%)
spark-word2vecA parallel implementation of word2vec based on Spark
Stars: ✭ 24 (-90.55%)
columnifyMake record oriented data to columnar format.
Stars: ✭ 28 (-88.98%)
trembitaModel complex data transformation pipelines easily
Stars: ✭ 44 (-82.68%)
shamashAutoscaling for Google Cloud Dataproc
Stars: ✭ 31 (-87.8%)
NotesThis is a learning note | Java基础,JVM,源码,大数据,面经
Stars: ✭ 69 (-72.83%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-92.52%)
dllibdllib is a distributed deep learning library running on Apache Spark
Stars: ✭ 32 (-87.4%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-90.16%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (-69.69%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-88.58%)
Search Ads Web ServiceOnline search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (-88.19%)
StreamBenchMeasuring the performance of popular streaming engines with Yahoo's Streaming Benchmark
Stars: ✭ 52 (-79.53%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-94.49%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-74.41%)
zdh web大数据采集,抽取平台
Stars: ✭ 292 (+14.96%)
jhdfA pure Java HDF5 library
Stars: ✭ 83 (-67.32%)
amasAmas is recursive acronym for “Amas, monitor alert system”.
Stars: ✭ 77 (-69.69%)
dt-sql-parserSQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (-46.85%)
CasperA compiler for automatically re-targeting sequential Java code to Apache Spark.
Stars: ✭ 45 (-82.28%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-93.7%)
greycatGreyCat - Data Analytics, Temporal data, What-if, Live machine learning
Stars: ✭ 104 (-59.06%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-89.37%)
smolderHL7 Apache Spark Datasource
Stars: ✭ 33 (-87.01%)
lectures-hse-sparkМасштабируемое машинное обучение и анализ больших данных с Apache Spark
Stars: ✭ 20 (-92.13%)
TiBigDataTiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (-24.41%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (-62.6%)
awesome-coder-resources编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (-78.74%)
Book本项目收藏这些年来看过或者听过的一些不错的书籍,在整理文件时看见这些,发现删掉有点可惜,放着又太浪费空间,本着分享的原则,就把它们共享出来,一方面给需要的读者提供这些书籍,另一方面也是一种像知识库的积累吧
Stars: ✭ 47 (-81.5%)