AeroBulk is a modern-FORTRAN-based package/library that gathers state-of-the-art aerodynamic bulk formulae algorithms used to compute turbulent air-sea fluxes of momentum, heat and freshwater.

Stars: ✭ 24 (-17.24%)

Mutual labels: climate-data

Kotlin Spark Api

This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x

Stars: ✭ 183 (+531.03%)

Mutual labels: bigdata

PersonNotes

个人笔记集中营，快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧

Stars: ✭ 61 (+110.34%)

Mutual labels: bigdata

Bigdata practice

大数据分析可视化实践

Stars: ✭ 166 (+472.41%)

Mutual labels: bigdata

optimus

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Stars: ✭ 1,351 (+4558.62%)

Mutual labels: bigdata

Nmflibrary

MATLAB library for non-negative matrix factorization (NMF): Version 1.8.1

Stars: ✭ 153 (+427.59%)

Mutual labels: bigdata

TiBigData

TiDB connectors for Flink/Hive/Presto

Stars: ✭ 192 (+562.07%)

Mutual labels: bigdata

Hudi

Upserts, Deletes And Incremental Processing on Big Data.

Stars: ✭ 2,586 (+8817.24%)

Mutual labels: bigdata

workflUX

An open-source, cloud-ready web application for simplified deployment of big data workflows.

Stars: ✭ 26 (-10.34%)

Mutual labels: bigdata

Avro

Apache Avro is a data serialization system.

Stars: ✭ 2,005 (+6813.79%)

Mutual labels: bigdata

intersect

一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集

Stars: ✭ 54 (+86.21%)

Mutual labels: bigdata

Big Data Study

🐳 big data study

Stars: ✭ 141 (+386.21%)

Mutual labels: bigdata

Every Single Day I Tldr

A daily digest of the articles or videos I've found interesting, that I want to share with you.

Stars: ✭ 249 (+758.62%)

Mutual labels: bigdata

Ecommercerecommendsystem

商品大数据实时推荐系统。前端：Vue + TypeScript + ElementUI，后端 Spring + Spark

Stars: ✭ 139 (+379.31%)

Mutual labels: bigdata

hockeystick

Download and Visualize Essential Global Heating Data in R

Stars: ✭ 42 (+44.83%)

Mutual labels: climate-data

Tipdm

TipDM建模平台，开源的数据挖掘工具。

Stars: ✭ 130 (+348.28%)

Mutual labels: bigdata

Dpark

Python clone of Spark, a MapReduce alike framework in Python

Stars: ✭ 2,668 (+9100%)

Mutual labels: bigdata

Fpart

Sort files and pack them into partitions

Stars: ✭ 127 (+337.93%)

Mutual labels: bigdata

twitter-archive-reader

Full featured TypeScript Twitter archive reader and browser

Stars: ✭ 43 (+48.28%)

Mutual labels: bigdata

Hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Stars: ✭ 126 (+334.48%)

Mutual labels: bigdata

Hadoop Attack Library

A collection of pentest tools and resources targeting Hadoop environments

Stars: ✭ 228 (+686.21%)

Mutual labels: bigdata

Genie

Distributed Big Data Orchestration Service

Stars: ✭ 1,544 (+5224.14%)

Mutual labels: bigdata

chatnoir-resiliparse

A robust web archive analytics toolkit

Stars: ✭ 26 (-10.34%)

Mutual labels: bigdata

Books

技术书籍等

Stars: ✭ 110 (+279.31%)

Mutual labels: bigdata

Node Hbase

Asynchronous HBase client for NodeJs using REST

Stars: ✭ 226 (+679.31%)

Mutual labels: bigdata

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (+275.86%)

Mutual labels: bigdata

hayabusa

Hayabusa: Simple and Fast Full-Text Search Engine for Massive System Log Data

Stars: ✭ 43 (+48.28%)

Mutual labels: bigdata

lectures-hse-spark

Масштабируемое машинное обучение и анализ больших данных с Apache Spark

Stars: ✭ 20 (-31.03%)

Mutual labels: bigdata

bigdata-doc

大数据学习笔记，学习路线，技术案例整理。

Stars: ✭ 37 (+27.59%)

Mutual labels: bigdata

Clustering4Ever

C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.

Stars: ✭ 126 (+334.48%)

Mutual labels: bigdata

Flink Boot

懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系，使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序，懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本（不需要理解分布式计算的理论知识和Flink框架的细节）便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度，该脚手架默认集成Spring框架进行Bean管理，同时将微服务以及WEB开发领域中经常用到的框架集成进来，进一步提升开发速度。比如集成Mybatis ORM框架，Hibernate Validator校验框架,Spring Retry重试框架等，具体见下面的脚手架特性。

Stars: ✭ 209 (+620.69%)

Mutual labels: bigdata

Awesome Bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

Stars: ✭ 10,478 (+36031.03%)

Mutual labels: bigdata

Shifu

An end-to-end machine learning and data mining framework on Hadoop

Stars: ✭ 207 (+613.79%)

Mutual labels: bigdata

1-60 of 194 similar projects

›