gomrjobgomrjob - a Go Framework for Hadoop Map Reduce Jobs
Stars: ✭ 39 (-9.3%)
hadoop-ansibleInstall hadoop cluster with ansible
Stars: ✭ 35 (-18.6%)
dagpiDagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (-41.86%)
TonYTonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Stars: ✭ 687 (+1497.67%)
RecommendationEngineSource code and dataset for paper "CBMR: An optimized MapReduce for item‐based collaborative filtering recommendation algorithm with empirical analysis"
Stars: ✭ 43 (+0%)
geodaDataData package for accessing GeoDa datasets using R
Stars: ✭ 15 (-65.12%)
phoenixApache Phoenix / Hbase Spring Boot Microservices
Stars: ✭ 23 (-46.51%)
biomechanics datasetInformation of public available data sets for biomechanics.
Stars: ✭ 31 (-27.91%)
delitos-caba🚓 Crime dataset for the City of Buenos Aires, Argentina
Stars: ✭ 44 (+2.33%)
terasliceScalable data processing pipelines in JavaScript
Stars: ✭ 48 (+11.63%)
CHRSIXray : A Large-scale Security Inspection X-ray Benchmark in CVPR 2019
Stars: ✭ 78 (+81.4%)
git-rdmA research data management plugin for the Git version control system.
Stars: ✭ 34 (-20.93%)
torchgeoTorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+2516.28%)
morghulisNo description or website provided.
Stars: ✭ 18 (-58.14%)
learning-hadoop-and-sparkCompanion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+239.53%)
isarn-sketches-sparkRoutines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (-34.88%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (+51.16%)
humanflow2Official repository of Learning Multi-Human Optical Flow (IJCV 2019)
Stars: ✭ 37 (-13.95%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-62.79%)
data.world-pyPython package for data.world
Stars: ✭ 98 (+127.91%)
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-51.16%)
LogAnalyzeHelper论坛日志分析系统清洗程序(包含IP规则库,UDF开发,MapReduce程序,日志数据)
Stars: ✭ 33 (-23.26%)
industrial-ml-datasetsA curated list of datasets, publically available for machine learning research in the area of manufacturing
Stars: ✭ 45 (+4.65%)
mlxMachine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (+206.98%)
JavaFrameworkSimple Java Framework,designed for easily develop Spring based java program.Support Bigdata And metadata management.A common elasticsearch comm query tool and so on.
Stars: ✭ 16 (-62.79%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-32.56%)
beanszooDistributed Java micro-services using ZooKeeper
Stars: ✭ 12 (-72.09%)
smart-data-lakeSmart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (+83.72%)
orionManagement and automation platform for Stateful Distributed Systems
Stars: ✭ 77 (+79.07%)
rs datasetsTool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-48.84%)
scrapeOPA python package for scraping oddsportal.com
Stars: ✭ 99 (+130.23%)
dh-coreFunctional data science
Stars: ✭ 123 (+186.05%)
thermostatCollection of NLP model explanations and accompanying analysis tools
Stars: ✭ 126 (+193.02%)
awesome-dynamic-graphsA collection of resources on dynamic/streaming/temporal/evolving graph processing systems, databases, data structures, datasets, and related academic and industrial work
Stars: ✭ 89 (+106.98%)
metadatMeta-analytic datasets for R
Stars: ✭ 21 (-51.16%)
awesome-mobile-roboticsUseful links of different content related to AI, Computer Vision, and Robotics.
Stars: ✭ 243 (+465.12%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-9.3%)
ambari-hdp-dockerDockerfiles and Docker Compose for HDP 2.6 with Blueprints
Stars: ✭ 23 (-46.51%)
openPDCOpen Source Phasor Data Concentrator
Stars: ✭ 109 (+153.49%)
webhdfsNode.js WebHDFS REST API client
Stars: ✭ 88 (+104.65%)
bugrepoA collection of publicly available bug reports
Stars: ✭ 93 (+116.28%)
DiscEvalDiscourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-58.14%)
dpkb大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+186.05%)