az-ml-batch-scoreDeploying a Batch Scoring Pipeline for Python Models
Stars: ✭ 17 (-54.05%)
DataXServer为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Stars: ✭ 130 (+251.35%)
HeroesMatchTrackerHeroes of the Storm match tracker for personal statistics
Stars: ✭ 59 (+59.46%)
es2postgresElasticSearch to PostgreSQL loader
Stars: ✭ 18 (-51.35%)
forestErrorA Unified Framework for Random Forest Prediction Error Estimation
Stars: ✭ 23 (-37.84%)
kf2-magicked-admin🕷️ Mutator-free management, statistics, and in-game bot for ranked Killing Floor 2 servers
Stars: ✭ 27 (-27.03%)
Expectations.jlExpectation operators for Distributions.jl objects
Stars: ✭ 50 (+35.14%)
hdfeNo description or website provided.
Stars: ✭ 22 (-40.54%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (+18.92%)
rsienaAn R package for Simulation Investigation for Empirical Network Analysis
Stars: ✭ 56 (+51.35%)
oesophagusEnterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)
Stars: ✭ 12 (-67.57%)
roc comparisonThe fast version of DeLong's method for computing the covariance of unadjusted AUC.
Stars: ✭ 83 (+124.32%)
carryPython ETL(Extract-Transform-Load) tool / Data migration tool
Stars: ✭ 115 (+210.81%)
infantryRun MapReduce in user's browser.
Stars: ✭ 14 (-62.16%)
data-science-notesOpen-source project hosted at https://makeuseofdata.com to crowdsource a robust collection of notes related to data science (math, visualization, modeling, etc)
Stars: ✭ 52 (+40.54%)
kafka-connect-datagenA Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (-27.03%)
future.callr🚀 R package future.callr: A Future API for Parallel Processing using 'callr'
Stars: ✭ 52 (+40.54%)
foremast-brainForemast-brain is a component of Foremast project.
Stars: ✭ 17 (-54.05%)
dswarman open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+54.05%)
retrosheetProject to parse retrosheet baseball data in python
Stars: ✭ 19 (-48.65%)
vtuber-livechat-dataset📊 VTuber 1B: Billion-scale Live Chat and Moderation Event Dataset for NLP
Stars: ✭ 30 (-18.92%)
math-statsA small library that does the statistics for your numbers.
Stars: ✭ 18 (-51.35%)
gitstatssimple statistical analysis tool for git repositories
Stars: ✭ 16 (-56.76%)
veridical-flowMaking it easier to build stable, trustworthy data-science pipelines.
Stars: ✭ 28 (-24.32%)
btsaBerlin Time Series Analysis Repository
Stars: ✭ 60 (+62.16%)
ciencia datosEl curso en español, de acceso abierto y gratuito más grande del mundo sobre Ciencia de Datos en salud.
Stars: ✭ 66 (+78.38%)
dmlR package for Distance Metric Learning
Stars: ✭ 58 (+56.76%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+113.51%)
lineageGenerate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-56.76%)
Algorithmic-TradingI have been deeply interested in algorithmic trading and systematic trading algorithms. This Repository contains the code of what I have learnt on the way. It starts form some basic simple statistics and will lead up to complex machine learning algorithms.
Stars: ✭ 47 (+27.03%)
ballpark-trackerA simple application used for tracking which MLB and AAA stadiums a "Ballpark Chaser" has been to.
Stars: ✭ 15 (-59.46%)
sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-54.05%)
scanstatisticsAn R package for space-time anomaly detection using scan statistics.
Stars: ✭ 41 (+10.81%)
baseballstatsBaseball win expectancy and expected runs per inning calculators
Stars: ✭ 23 (-37.84%)
yt-channels-DS-AI-ML-CSA comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+2705.41%)
maxwell-sinkconsume maxwell generated message from kafka,export it to another mysql.
Stars: ✭ 16 (-56.76%)
tics🎢 Simple self-hosted analytics ideal for Express / React Native stacks
Stars: ✭ 22 (-40.54%)
k9Self-Taught Data Science
Stars: ✭ 25 (-32.43%)
persistityA persistence framework for game developers
Stars: ✭ 34 (-8.11%)
batter-pitcher-2vecA model for learning distributed representations of MLB players.
Stars: ✭ 75 (+102.7%)
snapSnap Programming Language
Stars: ✭ 20 (-45.95%)
TEAMThe Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.
Stars: ✭ 27 (-27.03%)
kozaData transformation framework for LinkML data models
Stars: ✭ 21 (-43.24%)
AlgorithmsFree hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.
Stars: ✭ 117 (+216.22%)
mathlionMathlion is an advanced math plugin for Kibana's Timelion
Stars: ✭ 77 (+108.11%)
openrefine-clientThe OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the command line interface (CLI) and is distributed as a convenient one-file-executable (Windows, Linux, Mac). It is also available via Docker Hub, PyPI and Binder.
Stars: ✭ 67 (+81.08%)
procstatEasy way to expose process internal state to filesystem using fuse.
Stars: ✭ 14 (-62.16%)
wrapperrWebsite and API that collects Plex statistics using Tautulli and displays it. Similar to the Spotify Wrapped concept.
Stars: ✭ 93 (+151.35%)