Open-source project hosted at https://makeuseofdata.com to crowdsource a robust collection of notes related to data science (math, visualization, modeling, etc)

Stars: ✭ 52 (+40.54%)

Mutual labels: statistics

kafka-connect-datagen

A Kafka Connect source connector that generates data for tests

Stars: ✭ 27 (-27.03%)

Mutual labels: etl

Data-Science-and-Machine-Learning-Resources

List of Data Science and Machine Learning Resource that I frequently use

Stars: ✭ 19 (-48.65%)

Mutual labels: statistics

future.callr

🚀 R package future.callr: A Future API for Parallel Processing using 'callr'

Stars: ✭ 52 (+40.54%)

Mutual labels: parallel-processing

foremast-brain

Foremast-brain is a component of Foremast project.

Stars: ✭ 17 (-54.05%)

Mutual labels: statistics

dswarm

an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)

Stars: ✭ 57 (+54.05%)

Mutual labels: etl

retrosheet

Project to parse retrosheet baseball data in python

Stars: ✭ 19 (-48.65%)

Mutual labels: baseball

awesome-datascience-python

Awesome list Data Science and Python. 🐍

Stars: ✭ 62 (+67.57%)

Mutual labels: statistics

web-click-flow

网站点击流离线日志分析

Stars: ✭ 14 (-62.16%)

Mutual labels: etl

vtuber-livechat-dataset

📊 VTuber 1B: Billion-scale Live Chat and Moderation Event Dataset for NLP

Stars: ✭ 30 (-18.92%)

Mutual labels: statistics

FantasyPremierLeague.py

⚽ Statistics for your mini leagues.

Stars: ✭ 123 (+232.43%)

Mutual labels: statistics

math-stats

A small library that does the statistics for your numbers.

Stars: ✭ 18 (-51.35%)

Mutual labels: statistics

gitstats

simple statistical analysis tool for git repositories

Stars: ✭ 16 (-56.76%)

Mutual labels: statistics

veridical-flow

Making it easier to build stable, trustworthy data-science pipelines.

Stars: ✭ 28 (-24.32%)

Mutual labels: statistics

btsa

Berlin Time Series Analysis Repository

Stars: ✭ 60 (+62.16%)

Mutual labels: statistics

ciencia datos

El curso en español, de acceso abierto y gratuito más grande del mundo sobre Ciencia de Datos en salud.

Stars: ✭ 66 (+78.38%)

Mutual labels: statistics

stats for soil survey

S4SS: Statistics for Soil Survey

Stars: ✭ 21 (-43.24%)

Mutual labels: statistics

dml

R package for Distance Metric Learning

Stars: ✭ 58 (+56.76%)

Mutual labels: statistics

astro

Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.

Stars: ✭ 79 (+113.51%)

Mutual labels: etl

GeomMLBStadiums

Geoms to draw MLB stadiums in ggplot2

Stars: ✭ 44 (+18.92%)

Mutual labels: baseball

lineage

Generate beautiful documentation for your data pipelines in markdown format

Stars: ✭ 16 (-56.76%)

Mutual labels: etl

Algorithmic-Trading

I have been deeply interested in algorithmic trading and systematic trading algorithms. This Repository contains the code of what I have learnt on the way. It starts form some basic simple statistics and will lead up to complex machine learning algorithms.

Stars: ✭ 47 (+27.03%)

Mutual labels: statistics

ballpark-tracker

A simple application used for tracking which MLB and AAA stadiums a "Ballpark Chaser" has been to.

Stars: ✭ 15 (-59.46%)

Mutual labels: baseball

spdr-etf-holdings

ETL for the SPDR ETF holdings XLS documents

Stars: ✭ 14 (-62.16%)

Mutual labels: etl

sparklanes

A lightweight data processing framework for Apache Spark

Stars: ✭ 17 (-54.05%)

Mutual labels: etl

scanstatistics

An R package for space-time anomaly detection using scan statistics.

Stars: ✭ 41 (+10.81%)

Mutual labels: statistics

baseballstats

Baseball win expectancy and expected runs per inning calculators

Stars: ✭ 23 (-37.84%)

Mutual labels: baseball

yt-channels-DS-AI-ML-CS

A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.

Stars: ✭ 1,038 (+2705.41%)

Mutual labels: statistics

maxwell-sink

consume maxwell generated message from kafka,export it to another mysql.

Stars: ✭ 16 (-56.76%)

Mutual labels: etl

tics

🎢 Simple self-hosted analytics ideal for Express / React Native stacks

Stars: ✭ 22 (-40.54%)

Mutual labels: statistics

redis-connect-dist

Real-Time Event Streaming & Change Data Capture

Stars: ✭ 21 (-43.24%)

Mutual labels: etl

Self-Taught Data Science

Stars: ✭ 25 (-32.43%)

Mutual labels: statistics

persistity

A persistence framework for game developers

Stars: ✭ 34 (-8.11%)

Mutual labels: etl

batter-pitcher-2vec

A model for learning distributed representations of MLB players.

Stars: ✭ 75 (+102.7%)

Mutual labels: baseball

snap

Snap Programming Language

Stars: ✭ 20 (-45.95%)

Mutual labels: parallel-processing

TEAM

The Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.

Stars: ✭ 27 (-27.03%)

Mutual labels: etl

koza

Data transformation framework for LinkML data models

Stars: ✭ 21 (-43.24%)

Mutual labels: etl

Algorithms

Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.

Stars: ✭ 117 (+216.22%)

Mutual labels: statistics

mathlion

Mathlion is an advanced math plugin for Kibana's Timelion

Stars: ✭ 77 (+108.11%)

Mutual labels: statistics

openrefine-client

The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the command line interface (CLI) and is distributed as a convenient one-file-executable (Windows, Linux, Mac). It is also available via Docker Hub, PyPI and Binder.

Stars: ✭ 67 (+81.08%)

Mutual labels: etl

procstat

Easy way to expose process internal state to filesystem using fuse.

Stars: ✭ 14 (-62.16%)

Mutual labels: statistics

wrapperr

Website and API that collects Plex statistics using Tautulli and displays it. Similar to the Spotify Wrapped concept.

Stars: ✭ 93 (+151.35%)

Mutual labels: statistics

1-60 of 647 similar projects

›

next*5