All Projects → big-data-exploration → Similar Projects or Alternatives

432 Open source projects that are alternatives of or similar to big-data-exploration

gomrjob
gomrjob - a Go Framework for Hadoop Map Reduce Jobs
Stars: ✭ 39 (-9.3%)
Mutual labels:  hadoop
hadoop-ansible
Install hadoop cluster with ansible
Stars: ✭ 35 (-18.6%)
Mutual labels:  hadoop
dagpi
Dagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (-41.86%)
Mutual labels:  datasets
TonY
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Stars: ✭ 687 (+1497.67%)
Mutual labels:  hadoop
RecommendationEngine
Source code and dataset for paper "CBMR: An optimized MapReduce for item‐based collaborative filtering recommendation algorithm with empirical analysis"
Stars: ✭ 43 (+0%)
Mutual labels:  hadoop
HDFS-Netdisc
基于Hadoop的分布式云存储系统 🌴
Stars: ✭ 56 (+30.23%)
Mutual labels:  hadoop
firestore-to-bigquery-export
NPM package for copying and converting Cloud Firestore data to BigQuery.
Stars: ✭ 26 (-39.53%)
Mutual labels:  datasets
Data-pipeline-project
Data pipeline project
Stars: ✭ 18 (-58.14%)
Mutual labels:  hadoop
transfermarkt-datasets
⚽️ Extract, prepare and publish Transfermarkt datasets.
Stars: ✭ 60 (+39.53%)
Mutual labels:  datasets
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-13.95%)
Mutual labels:  hadoop
geodaData
Data package for accessing GeoDa datasets using R
Stars: ✭ 15 (-65.12%)
Mutual labels:  datasets
phoenix
Apache Phoenix / Hbase Spring Boot Microservices
Stars: ✭ 23 (-46.51%)
Mutual labels:  hadoop
biomechanics dataset
Information of public available data sets for biomechanics.
Stars: ✭ 31 (-27.91%)
Mutual labels:  datasets
delitos-caba
🚓 Crime dataset for the City of Buenos Aires, Argentina
Stars: ✭ 44 (+2.33%)
Mutual labels:  datasets
Thirukkural-Tamil-Dataset
திருக்குறள் by திருவள்ளுவர்.
Stars: ✭ 44 (+2.33%)
Mutual labels:  datasets
teraslice
Scalable data processing pipelines in JavaScript
Stars: ✭ 48 (+11.63%)
Mutual labels:  hadoop
CHR
SIXray : A Large-scale Security Inspection X-ray Benchmark in CVPR 2019
Stars: ✭ 78 (+81.4%)
Mutual labels:  datasets
git-rdm
A research data management plugin for the Git version control system.
Stars: ✭ 34 (-20.93%)
Mutual labels:  datasets
torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+2516.28%)
Mutual labels:  datasets
morghulis
No description or website provided.
Stars: ✭ 18 (-58.14%)
Mutual labels:  datasets
learning-hadoop-and-spark
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+239.53%)
Mutual labels:  hadoop
isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (-34.88%)
Mutual labels:  datasets
the-apache-ignite-book
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (+51.16%)
Mutual labels:  hadoop
Scene-Text-Recognition-Recommendations
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
Stars: ✭ 215 (+400%)
Mutual labels:  datasets
humanflow2
Official repository of Learning Multi-Human Optical Flow (IJCV 2019)
Stars: ✭ 37 (-13.95%)
Mutual labels:  datasets
CompBioDatasetsForMachineLearning
A Curated List of Computational Biology Datasets Suitable for Machine Learning
Stars: ✭ 90 (+109.3%)
Mutual labels:  datasets
data.world-r
R library for data.world
Stars: ✭ 59 (+37.21%)
Mutual labels:  datasets
iis
Information Inference Service of the OpenAIRE system
Stars: ✭ 16 (-62.79%)
Mutual labels:  hadoop
data.world-py
Python package for data.world
Stars: ✭ 98 (+127.91%)
Mutual labels:  datasets
qs-hadoop
大数据生态圈学习
Stars: ✭ 18 (-58.14%)
Mutual labels:  hadoop
yarn-prometheus-exporter
Export Hadoop YARN (resource-manager) metrics in prometheus format
Stars: ✭ 44 (+2.33%)
Mutual labels:  hadoop
hive to es
同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-51.16%)
Mutual labels:  hadoop
clothing-detection-ecommerce-dataset
Clothing detection dataset
Stars: ✭ 43 (+0%)
Mutual labels:  datasets
LogAnalyzeHelper
论坛日志分析系统清洗程序(包含IP规则库,UDF开发,MapReduce程序,日志数据)
Stars: ✭ 33 (-23.26%)
Mutual labels:  hadoop
industrial-ml-datasets
A curated list of datasets, publically available for machine learning research in the area of manufacturing
Stars: ✭ 45 (+4.65%)
Mutual labels:  datasets
mlx
Machine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (+206.98%)
Mutual labels:  datasets
JavaFramework
Simple Java Framework,designed for easily develop Spring based java program.Support Bigdata And metadata management.A common elasticsearch comm query tool and so on.
Stars: ✭ 16 (-62.79%)
Mutual labels:  hadoop
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-32.56%)
Mutual labels:  hadoop
beanszoo
Distributed Java micro-services using ZooKeeper
Stars: ✭ 12 (-72.09%)
Mutual labels:  hadoop
smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (+83.72%)
Mutual labels:  hadoop
orion
Management and automation platform for Stateful Distributed Systems
Stars: ✭ 77 (+79.07%)
Mutual labels:  hadoop
rs datasets
Tool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-48.84%)
Mutual labels:  datasets
scrapeOP
A python package for scraping oddsportal.com
Stars: ✭ 99 (+130.23%)
Mutual labels:  datasets
dh-core
Functional data science
Stars: ✭ 123 (+186.05%)
Mutual labels:  datasets
thermostat
Collection of NLP model explanations and accompanying analysis tools
Stars: ✭ 126 (+193.02%)
Mutual labels:  datasets
Google-Playstore-Dataset
Google PlayStore App dataset. (2.3 million App Data) and 24 attributes
Stars: ✭ 27 (-37.21%)
Mutual labels:  datasets
awesome-dynamic-graphs
A collection of resources on dynamic/streaming/temporal/evolving graph processing systems, databases, data structures, datasets, and related academic and industrial work
Stars: ✭ 89 (+106.98%)
Mutual labels:  datasets
metadat
Meta-analytic datasets for R
Stars: ✭ 21 (-51.16%)
Mutual labels:  datasets
awesome-mobile-robotics
Useful links of different content related to AI, Computer Vision, and Robotics.
Stars: ✭ 243 (+465.12%)
Mutual labels:  datasets
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-9.3%)
Mutual labels:  hadoop
ambari-hdp-docker
Dockerfiles and Docker Compose for HDP 2.6 with Blueprints
Stars: ✭ 23 (-46.51%)
Mutual labels:  hadoop
openPDC
Open Source Phasor Data Concentrator
Stars: ✭ 109 (+153.49%)
Mutual labels:  hadoop
kafka-connect-fs
Kafka Connect FileSystem Connector
Stars: ✭ 107 (+148.84%)
Mutual labels:  hadoop
hive-bigquery-storage-handler
Hive Storage Handler for interoperability between BigQuery and Apache Hive
Stars: ✭ 16 (-62.79%)
Mutual labels:  hadoop
webhdfs
Node.js WebHDFS REST API client
Stars: ✭ 88 (+104.65%)
Mutual labels:  hadoop
open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
Stars: ✭ 65 (+51.16%)
Mutual labels:  datasets
bugrepo
A collection of publicly available bug reports
Stars: ✭ 93 (+116.28%)
Mutual labels:  datasets
DiscEval
Discourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-58.14%)
Mutual labels:  datasets
scRNAseq cell cluster labeling
Scripts to run and benchmark scRNA-seq cell cluster labeling methods
Stars: ✭ 41 (-4.65%)
Mutual labels:  datasets
dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+186.05%)
Mutual labels:  hadoop
1-60 of 432 similar projects