All Projects → chatnoir-resiliparse → Similar Projects or Alternatives

225 Open source projects that are alternatives of or similar to chatnoir-resiliparse

mixnode-warcreader-php
Read Web ARChive (WARC) files in PHP.
Stars: ✭ 20 (-23.08%)
Mutual labels:  warc, webarchive
node-warc
Parse And Create Web ARChive (WARC) files with node.js
Stars: ✭ 69 (+165.38%)
Mutual labels:  warc, webarchive
Poli
An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Stars: ✭ 1,850 (+7015.38%)
Mutual labels:  bigdata
Tdengine
An open-source big data platform designed and optimized for the Internet of Things (IoT).
Stars: ✭ 17,434 (+66953.85%)
Mutual labels:  bigdata
Volcano
A Cloud Native Batch System (Project under CNCF)
Stars: ✭ 2,114 (+8030.77%)
Mutual labels:  bigdata
Javainterview
最全的Java技术知识点,以及Java源码分析。为开源贡献自己的一份力。
Stars: ✭ 154 (+492.31%)
Mutual labels:  bigdata
Aws Etl Orchestrator
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+842.31%)
Mutual labels:  bigdata
Twitwork
Monitor twitter stream
Stars: ✭ 133 (+411.54%)
Mutual labels:  bigdata
vandal
Navigator for Web Archive
Stars: ✭ 146 (+461.54%)
Mutual labels:  webarchive
Lambda Arch
Applying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (+326.92%)
Mutual labels:  bigdata
Shifu
An end-to-end machine learning and data mining framework on Hadoop
Stars: ✭ 207 (+696.15%)
Mutual labels:  bigdata
Daudit
🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!
Stars: ✭ 108 (+315.38%)
Mutual labels:  bigdata
Java Notes
☕️ Java 基础 👫 面向对象思想✏️ 算法 📝 操作系统 ☁️ 网络 💾 数据库 🙊 Spring 💡 系统架构🐘大数据
Stars: ✭ 160 (+515.38%)
Mutual labels:  bigdata
bigdatatutorial
bigdatatutorial
Stars: ✭ 34 (+30.77%)
Mutual labels:  bigdata
Athenacli
AthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.
Stars: ✭ 151 (+480.77%)
Mutual labels:  bigdata
hayabusa
Hayabusa: Simple and Fast Full-Text Search Engine for Massive System Log Data
Stars: ✭ 43 (+65.38%)
Mutual labels:  bigdata
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (+438.46%)
Mutual labels:  bigdata
Simple It English
Simple-IT-English: smart wordbook from community for community
Stars: ✭ 233 (+796.15%)
Mutual labels:  bigdata
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+6519.23%)
Mutual labels:  bigdata
young-examples
java学习和项目中一些典型的应用场景样例代码
Stars: ✭ 21 (-19.23%)
Mutual labels:  bigdata
Liteflow
liteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (+330.77%)
Mutual labels:  bigdata
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+726.92%)
Mutual labels:  bigdata
Flinkstreamsql
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (+6369.23%)
Mutual labels:  bigdata
Spark-MLlib-Tutorial
大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件
Stars: ✭ 32 (+23.08%)
Mutual labels:  bigdata
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (+311.54%)
Mutual labels:  bigdata
Awesome Learning
实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (+657.69%)
Mutual labels:  bigdata
Flink Notes
flink学习笔记
Stars: ✭ 106 (+307.69%)
Mutual labels:  bigdata
Bigdata practice
大数据分析可视化实践
Stars: ✭ 166 (+538.46%)
Mutual labels:  bigdata
workflUX
An open-source, cloud-ready web application for simplified deployment of big data workflows.
Stars: ✭ 26 (+0%)
Mutual labels:  bigdata
Nmflibrary
MATLAB library for non-negative matrix factorization (NMF): Version 1.8.1
Stars: ✭ 153 (+488.46%)
Mutual labels:  bigdata
php-article-extractor
A PHP library to extract article text from web pages
Stars: ✭ 28 (+7.69%)
Mutual labels:  extraction
Hudi
Upserts, Deletes And Incremental Processing on Big Data.
Stars: ✭ 2,586 (+9846.15%)
Mutual labels:  bigdata
Every Single Day I Tldr
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (+857.69%)
Mutual labels:  bigdata
Avro
Apache Avro is a data serialization system.
Stars: ✭ 2,005 (+7611.54%)
Mutual labels:  bigdata
intersect
一道面试题的思考 - 6000万数据包和300万数据包在50M内存使用环境中求交集
Stars: ✭ 54 (+107.69%)
Mutual labels:  bigdata
Big Data Study
🐳 big data study
Stars: ✭ 141 (+442.31%)
Mutual labels:  bigdata
Dpark
Python clone of Spark, a MapReduce alike framework in Python
Stars: ✭ 2,668 (+10161.54%)
Mutual labels:  bigdata
Ecommercerecommendsystem
商品大数据实时推荐系统。前端:Vue + TypeScript + ElementUI,后端 Spring + Spark
Stars: ✭ 139 (+434.62%)
Mutual labels:  bigdata
pnextract
Pore network extraction from micro-CT images of porous media
Stars: ✭ 43 (+65.38%)
Mutual labels:  extraction
Tipdm
TipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (+400%)
Mutual labels:  bigdata
Hadoop Attack Library
A collection of pentest tools and resources targeting Hadoop environments
Stars: ✭ 228 (+776.92%)
Mutual labels:  bigdata
Fpart
Sort files and pack them into partitions
Stars: ✭ 127 (+388.46%)
Mutual labels:  bigdata
PersonNotes
个人笔记集中营,快糙猛的形式记录技术性Notes .. 📚☕️⌨️🎧
Stars: ✭ 61 (+134.62%)
Mutual labels:  bigdata
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (+384.62%)
Mutual labels:  bigdata
Node Hbase
Asynchronous HBase client for NodeJs using REST
Stars: ✭ 226 (+769.23%)
Mutual labels:  bigdata
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (+5838.46%)
Mutual labels:  bigdata
xkcd-2048
No description or website provided.
Stars: ✭ 12 (-53.85%)
Mutual labels:  extraction
Books
技术书籍等
Stars: ✭ 110 (+323.08%)
Mutual labels:  bigdata
Flink Boot
懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管理,同时将微服务以及WEB开发领域中经常用到的框架集成进来,进一步提升开发速度。比如集成Mybatis ORM框架,Hibernate Validator校验框架,Spring Retry重试框架等,具体见下面的脚手架特性。
Stars: ✭ 209 (+703.85%)
Mutual labels:  bigdata
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (+319.23%)
Mutual labels:  bigdata
twitter-archive-reader
Full featured TypeScript Twitter archive reader and browser
Stars: ✭ 43 (+65.38%)
Mutual labels:  bigdata
Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+40200%)
Mutual labels:  bigdata
Javaorbigdata Interview
Java开发者或者大数据开发者面试知识点整理
Stars: ✭ 203 (+680.77%)
Mutual labels:  bigdata
Griddb
GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Stars: ✭ 1,587 (+6003.85%)
Mutual labels:  bigdata
optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+5096.15%)
Mutual labels:  bigdata
Sparktutorial
Source code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (+303.85%)
Mutual labels:  bigdata
Kotlin Spark Api
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (+603.85%)
Mutual labels:  bigdata
Clustering4Ever
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+384.62%)
Mutual labels:  bigdata
bigquery-data-lineage
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Stars: ✭ 112 (+330.77%)
Mutual labels:  bigdata
Outlaw
JSON mapper for macOS, iOS, tvOS, and watchOS
Stars: ✭ 24 (-7.69%)
Mutual labels:  extraction
1-60 of 225 similar projects