All Projects → jhdf → Similar Projects or Alternatives

224 Open source projects that are alternatives of or similar to jhdf

ROOT I/O in pure Python and NumPy.

Stars: ✭ 312 (+275.9%)

Mutual labels: bigdata, file-format

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀

Stars: ✭ 6,793 (+8084.34%)

Mutual labels: bigdata, hdf5

ROOT I/O in pure Python and NumPy.

Stars: ✭ 80 (-3.61%)

Mutual labels: bigdata, file-format

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (+159.04%)

Mutual labels: bigdata

Simple It English

Simple-IT-English: smart wordbook from community for community

Stars: ✭ 233 (+180.72%)

Mutual labels: bigdata

Clustering4Ever

C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.

Stars: ✭ 126 (+51.81%)

Mutual labels: bigdata

2019 egu workshop jupyter notebooks

Short course on interactive analysis of Big Earth Data with Jupyter Notebooks

Stars: ✭ 29 (-65.06%)

Mutual labels: bigdata

Awesome Learning

实践源码库：https://github.com/jast90/bigdata 。微信搜索Jast关注公众号，获取最新技术分享😯。

Stars: ✭ 197 (+137.35%)

Mutual labels: bigdata

java学习和项目中一些典型的应用场景样例代码

Stars: ✭ 21 (-74.7%)

Mutual labels: bigdata

☕️ Java 基础 👫 面向对象思想✏️ 算法 📝 操作系统 ☁️ 网络 💾 数据库 🙊 Spring 💡 系统架构🐘大数据

Stars: ✭ 160 (+92.77%)

Mutual labels: bigdata

AthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.

Stars: ✭ 151 (+81.93%)

Mutual labels: bigdata

Aws Etl Orchestrator

A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.

Stars: ✭ 245 (+195.18%)

Mutual labels: bigdata

Analyze zipfile, either local, or from url

Stars: ✭ 25 (-69.88%)

Mutual labels: file-format

An open-source big data platform designed and optimized for the Internet of Things (IoT).

Stars: ✭ 17,434 (+20904.82%)

Mutual labels: bigdata

GreyCat - Data Analytics, Temporal data, What-if, Live machine learning

Stars: ✭ 104 (+25.3%)

Mutual labels: bigdata

An end-to-end machine learning and data mining framework on Hadoop

Stars: ✭ 207 (+149.4%)

Mutual labels: bigdata

bigquery-data-lineage

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Stars: ✭ 112 (+34.94%)

Mutual labels: bigdata

Based on Apache Flink. support data synchronization/integration and streaming SQL computation.

Stars: ✭ 2,651 (+3093.98%)

Mutual labels: bigdata

Public EMsoft repository

Stars: ✭ 44 (-46.99%)

Mutual labels: hdf5

最全的Java技术知识点，以及Java源码分析。为开源贡献自己的一份力。

Stars: ✭ 154 (+85.54%)

Mutual labels: bigdata

Library of PH5 clients, apis, and utilities

Stars: ✭ 14 (-83.13%)

Mutual labels: hdf5

lectures-hse-spark

Масштабируемое машинное обучение и анализ больших данных с Apache Spark

Stars: ✭ 20 (-75.9%)

Mutual labels: bigdata

An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.

Stars: ✭ 1,850 (+2128.92%)

Mutual labels: bigdata

Spark-MLlib-Tutorial

大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件

Stars: ✭ 32 (-61.45%)

Mutual labels: bigdata

Azure Event Hubs Spark

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

Stars: ✭ 140 (+68.67%)

Mutual labels: bigdata

Monitor twitter stream

Stars: ✭ 133 (+60.24%)

Mutual labels: bigdata

Every Single Day I Tldr

A daily digest of the articles or videos I've found interesting, that I want to share with you.

Stars: ✭ 249 (+200%)

Mutual labels: bigdata

awesome-coder-resources

编程路上加油站！------【持续更新中...欢迎star,欢迎常回来看看......】【内容：编程/学习/阅读资源，开源项目,面试题,网站,书,博客,教程等等】

Stars: ✭ 54 (-34.94%)

Mutual labels: bigdata

Python clone of Spark, a MapReduce alike framework in Python

Stars: ✭ 2,668 (+3114.46%)

Mutual labels: bigdata

jupyterlab-h5web

A JupyterLab extension to explore and visualize HDF5 file contents. Based on https://github.com/silx-kit/h5web.

Stars: ✭ 41 (-50.6%)

Mutual labels: hdf5

Hadoop Attack Library

A collection of pentest tools and resources targeting Hadoop environments

Stars: ✭ 228 (+174.7%)

Mutual labels: bigdata

chatnoir-resiliparse

A robust web archive analytics toolkit

Stars: ✭ 26 (-68.67%)

Mutual labels: bigdata

Asynchronous HBase client for NodeJs using REST

Stars: ✭ 226 (+172.29%)

Mutual labels: bigdata

Albis: High-Performance File Format for Big Data Systems

Stars: ✭ 20 (-75.9%)

Mutual labels: file-format

懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系，使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序，懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本（不需要理解分布式计算的理论知识和Flink框架的细节）便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度，该脚手架默认集成Spring框架进行Bean管理，同时将微服务以及WEB开发领域中经常用到的框架集成进来，进一步提升开发速度。比如集成Mybatis ORM框架，Hibernate Validator校验框架,Spring Retry重试框架等，具体见下面的脚手架特性。

Stars: ✭ 209 (+151.81%)

Mutual labels: bigdata

个人笔记集中营，快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧

Stars: ✭ 61 (-26.51%)

Mutual labels: bigdata

Javaorbigdata Interview

Java开发者或者大数据开发者面试知识点整理

Stars: ✭ 203 (+144.58%)

Mutual labels: bigdata

ReClassicfication

Maybe one day a WINE-style implementation of the classic Mac Toolbox.

Stars: ✭ 29 (-65.06%)

Mutual labels: file-format

Kotlin Spark Api

This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x

Stars: ✭ 183 (+120.48%)

Mutual labels: bigdata

一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集

Stars: ✭ 54 (-34.94%)

Mutual labels: bigdata

Bigdata practice

大数据分析可视化实践

Stars: ✭ 166 (+100%)

Mutual labels: bigdata

Fast writing of numpy 3d-arrays into HDF5 Fiji/BigDataViewer files.

Stars: ✭ 25 (-69.88%)

Mutual labels: hdf5

MATLAB library for non-negative matrix factorization (NMF): Version 1.8.1

Stars: ✭ 153 (+84.34%)

Mutual labels: bigdata

twitter-archive-reader

Full featured TypeScript Twitter archive reader and browser

Stars: ✭ 43 (-48.19%)

Mutual labels: bigdata

Upserts, Deletes And Incremental Processing on Big Data.

Stars: ✭ 2,586 (+3015.66%)

Mutual labels: bigdata

Neuroscience information exchange format

Stars: ✭ 64 (-22.89%)

Mutual labels: file-format

Apache Avro is a data serialization system.

Stars: ✭ 2,005 (+2315.66%)

Mutual labels: bigdata

Hayabusa: Simple and Fast Full-Text Search Engine for Massive System Log Data

Stars: ✭ 43 (-48.19%)

Mutual labels: bigdata

🐳 big data study

Stars: ✭ 141 (+69.88%)

Mutual labels: bigdata

SQL Parsers for BigData, built with antlr4.

Stars: ✭ 135 (+62.65%)

Mutual labels: bigdata

Ecommercerecommendsystem

商品大数据实时推荐系统。前端：Vue + TypeScript + ElementUI，后端 Spring + Spark

Stars: ✭ 139 (+67.47%)

Mutual labels: bigdata

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Stars: ✭ 1,351 (+1527.71%)

Mutual labels: bigdata

大数据学习笔记，学习路线，技术案例整理。

Stars: ✭ 37 (-55.42%)

Mutual labels: bigdata

TipDM建模平台，开源的数据挖掘工具。

Stars: ✭ 130 (+56.63%)

Mutual labels: bigdata

Examples for gauravbytes.com

Stars: ✭ 57 (-31.33%)

Mutual labels: bigdata

.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

Stars: ✭ 1,721 (+1973.49%)

Mutual labels: bigdata

Sort files and pack them into partitions

Stars: ✭ 127 (+53.01%)

Mutual labels: bigdata

An open-source, cloud-ready web application for simplified deployment of big data workflows.

Stars: ✭ 26 (-68.67%)

Mutual labels: bigdata

Amas is recursive acronym for “Amas, monitor alert system”.

Stars: ✭ 77 (-7.23%)

Mutual labels: bigdata

A C++ wrapper of the matio library, with memory ownership handling, to read and write .mat files.

Stars: ✭ 24 (-71.08%)

Mutual labels: hdf5

1-60 of 224 similar projects