cloud云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (+0%)
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+2389.58%)
monopackerA tool for managing builds of monorepo frontend projects with eg. npm- or yarn workspaces, lerna or similar tools into a standalone application - no other tools needed.
Stars: ✭ 17 (-64.58%)
IbisA pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+3295.83%)
DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+8439.58%)
N8nFree and open fair-code licensed node based Workflow Automation Tool. Easily automate tasks across different services.
Stars: ✭ 19,252 (+40008.33%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+16.67%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (-18.75%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-72.92%)
user guideThe CWL v1.0 user guide
Stars: ✭ 20 (-58.33%)
alfred-mailtoSend emails to recipients and groups from Alfred
Stars: ✭ 59 (+22.92%)
bistroA library to build and execute typed scientific workflows
Stars: ✭ 43 (-10.42%)
jekyll-deploy-action🪂 A Github Action to deploy the Jekyll site conveniently for GitHub Pages.
Stars: ✭ 162 (+237.5%)
polystoresA library for performing hyperparameter optimization
Stars: ✭ 48 (+0%)
elegant-gitElegant Git is an assistant who carefully automates routine work with Git.
Stars: ✭ 38 (-20.83%)
alfred-relative-datesAlfred workflow to generate relative dates in different locales
Stars: ✭ 30 (-37.5%)
release-notify-actionGitHub Action that triggers e-mails with release notes when these are created
Stars: ✭ 64 (+33.33%)
RWorkflow📑 My approach to an analysis or product produced with R
Stars: ✭ 25 (-47.92%)
zenaton-ruby💎 Ruby gem to run and orchestrate background jobs with Zenaton Workflow Engine
Stars: ✭ 32 (-33.33%)
ACEseqWorkflowAllele-specific copy number estimation with whole genome sequencing
Stars: ✭ 19 (-60.42%)
version-checkAn action that allows you to check whether your npm package version has been updated
Stars: ✭ 65 (+35.42%)
denoflowConfiguration as Code, use YAML to write automated workflows that run on Deno, with any Deno modules, Typescript/Javascript codes
Stars: ✭ 143 (+197.92%)
bentenA language server for Common Workflow Language
Stars: ✭ 50 (+4.17%)
AlfredWorkflowsMy workflow creations for Alfred on macOS.
Stars: ✭ 55 (+14.58%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-66.67%)
jenkins-scriptletsUseful groovy scripts that can be used while using Jenkins-CI for workflow automation
Stars: ✭ 16 (-66.67%)
iSkyLIMSis an open-source LIMS (laboratory Information Management System) for Next Generation Sequencing sample management, statistics and reports, and bioinformatics analysis service management.
Stars: ✭ 33 (-31.25%)
craft-text-detectorPackaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector
Stars: ✭ 151 (+214.58%)
sonata-workflowIntegrate Symfony workflow component in Sonata Admin
Stars: ✭ 23 (-52.08%)
simpleflowPython library for dataflow programming.
Stars: ✭ 67 (+39.58%)
action-sync-node-metaGitHub Action that syncs package.json with the repository metadata.
Stars: ✭ 25 (-47.92%)
open-development-templateWorkflow and documentation templates that help teams formalize their goals, workflow and governance model to encourage participation and field contributions.
Stars: ✭ 18 (-62.5%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+1181.25%)
simple-workflowLaravel simple implementation of a complete workflow system, allowing you to focus on your business logic and letting the package do the necessary work to make your workflow system work and easy to customize
Stars: ✭ 58 (+20.83%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-27.08%)
cocoon-demoCocoon – a flow-based workflow automation, data mining and visual analytics tool.
Stars: ✭ 19 (-60.42%)
meepo异构存储数据迁移
Stars: ✭ 29 (-39.58%)
veridical-flowMaking it easier to build stable, trustworthy data-science pipelines.
Stars: ✭ 28 (-41.67%)
tumbleweedLightweight workflow engine microservice implement BPMN 2.0
Stars: ✭ 23 (-52.08%)
omgfUse Git Flow with ease – maintain branches, semantic versioning, releases, and changelog with a single command.
Stars: ✭ 39 (-18.75%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+131.25%)
drupal9ciOne-line installers for implementing Continuous Integration in Drupal 9
Stars: ✭ 137 (+185.42%)
goobi-workflowGoobi workflow - Workflow management software for digitisation projects used in more than 70 cultural heritage institutions in at least 17 countries.
Stars: ✭ 43 (-10.42%)
scoopi-scraperScoopi Web Scraper is a heavy duty tool to extract data from HTML pages.
Stars: ✭ 18 (-62.5%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-58.33%)
NotselwynNotSelwyn's over-engineered automatic profile readme
Stars: ✭ 15 (-68.75%)
XLearning-GPUqihoo360 xlearning with GPU support; AI on Hadoop
Stars: ✭ 22 (-54.17%)