Hands On DevopsA hands-on DevOps course covering the culture, methods and repeated practices of modern software development involving Packer, Vagrant, VirtualBox, Ansible, Kubernetes, K3s, MetalLB, Traefik, Docker-Compose, Docker, Taiga, GitLab, Drone CI, SonarQube, Selenium, InSpec, Alpine 3.10, Ubuntu-bionic, CentOS 7...
Stars: ✭ 196 (+1052.94%)
cpp-can-isotpC++ implementation of CAN ISO 15765-2 also known as CAN ISO transport protocol. CPP CAN isotp.
Stars: ✭ 14 (-17.65%)
rec-coreData pipelining service
Stars: ✭ 19 (+11.76%)
gamechanger-dataGAMECHANGER aspires to be the Department’s trusted solution for evidence-based, data-driven decision-making across the universe of DoD requirements
Stars: ✭ 17 (+0%)
ZumiszUMIs: A fast and flexible pipeline to process RNA sequencing data with UMIs
Stars: ✭ 178 (+947.06%)
cardano-pyPython3 lib and cli for operating a Cardano Passive Node and using the API's. (PRE-ALPHA)
Stars: ✭ 17 (+0%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+235.29%)
Pypyrpypyr task-runner cli & api for automation pipelines. Automate anything by combining commands, different scripts in different languages & applications into one pipeline process.
Stars: ✭ 173 (+917.65%)
TEAMThe Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.
Stars: ✭ 27 (+58.82%)
stargateAn Apache Pulsar client written in Elixir
Stars: ✭ 33 (+94.12%)
kozaData transformation framework for LinkML data models
Stars: ✭ 21 (+23.53%)
Rnaseq WorkflowA repository for setting up a RNAseq workflow
Stars: ✭ 170 (+900%)
oesophagusEnterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)
Stars: ✭ 12 (-29.41%)
pyspark-ML-in-ColabPyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (+88.24%)
DataXServer为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Stars: ✭ 130 (+664.71%)
PlexOpen Source Pipeline for Maya, Houdini, 3ds Max and Nuke .
Stars: ✭ 170 (+900%)
MIPS-pipeline-processorA pipelined implementation of the MIPS processor featuring hazard detection as well as forwarding
Stars: ✭ 92 (+441.18%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+1076.47%)
Unity resourcesA list of resources and tutorials for those doing programming in Unity.
Stars: ✭ 170 (+900%)
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (+200%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+13564.71%)
Cloud Dev云研发,是一种生于云上的闭环 + 代码化的软件开发方式。它可以让业务人员、开发人员、运营人员等在同一个云端共同协作、透明化地完成整个软件的生命周期(需求、设计、编码、构建、部署、运营),而非相互隔离,又或者是借助于多个软件才能完成工作。
Stars: ✭ 164 (+864.71%)
bacannotGeneric but comprehensive pipeline for prokaryotic genome annotation and interrogation with interactive reports and shiny app.
Stars: ✭ 51 (+200%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (+764.71%)
OperatorKubernetes operator to manage installation, updation and uninstallation of tektoncd projects (pipeline, …)
Stars: ✭ 161 (+847.06%)
Repo 2019BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Stars: ✭ 133 (+682.35%)
ngs pipelineExome/Capture/RNASeq Pipeline Implementation using snakemake
Stars: ✭ 40 (+135.29%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (+535.29%)
Spacy Wordnetspacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
Stars: ✭ 156 (+817.65%)
Pyspark StubsApache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (+476.47%)
Apos.ContentContent builder library for MonoGame.
Stars: ✭ 14 (-17.65%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+7770.59%)
EctsElastic Crontab System 简单易用的分布式定时任务管理系统
Stars: ✭ 156 (+817.65%)
Bitcoin Value Predictor[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (+435.29%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+276.47%)
PetastormPetastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Stars: ✭ 1,108 (+6417.65%)
MTBseq sourceMTBseq is an automated pipeline for mapping, variant calling and detection of resistance mediating and phylogenetic variants from illumina whole genome sequence data of Mycobacterium tuberculosis complex isolates.
Stars: ✭ 26 (+52.94%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+5700%)
proposal-hack-pipesOld specification for Hack pipes in JavaScript. Please go to the new specification.
Stars: ✭ 87 (+411.76%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-17.65%)
needlestackMulti-sample somatic variant caller
Stars: ✭ 45 (+164.71%)
katana-skipperSimple and flexible ML workflow engine
Stars: ✭ 234 (+1276.47%)
MotorwayCloud ready pure-python streaming data pipeline library
Stars: ✭ 150 (+782.35%)
flamingoFreeCAD - flamingo workbench
Stars: ✭ 30 (+76.47%)
kafka-connect-datagenA Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (+58.82%)
mech🦾 Main repository for the Mech programming language. Start here!
Stars: ✭ 135 (+694.12%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+41.18%)
prospectrR package: Misc. Functions for Processing and Sample Selection of Spectroscopic Data
Stars: ✭ 26 (+52.94%)