All Projects → joshday → Onlinestats.jl

joshday / Onlinestats.jl

Licence: mit
Single-pass algorithms for statistics

Programming Languages

julia
2034 projects

Projects that are alternatives of or similar to Onlinestats.jl

Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-78.9%)
Mutual labels:  statistics, big-data
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-51.28%)
Mutual labels:  big-data, streaming-data
Datumbox Framework
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Stars: ✭ 1,063 (+109.66%)
Mutual labels:  statistics, big-data
Data Science Live Book
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-61.93%)
Mutual labels:  statistics, big-data
Miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Stars: ✭ 4,633 (+813.81%)
Mutual labels:  statistics, streaming-data
bigstatsr
R package for statistical tools with big matrices stored on disk.
Stars: ✭ 139 (-72.58%)
Mutual labels:  statistics, big-data
OnlineStatsBase.jl
Base types for OnlineStats.
Stars: ✭ 26 (-94.87%)
Mutual labels:  big-data, streaming-data
Datascience Ai Machinelearning Resources
Alex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-18.34%)
Mutual labels:  statistics, big-data
Hazelcast
Open-source distributed computation and storage platform
Stars: ✭ 4,662 (+819.53%)
Mutual labels:  big-data
Fit Sne
Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE)
Stars: ✭ 485 (-4.34%)
Mutual labels:  big-data
Conjure Up
Deploying complex solutions, magically.
Stars: ✭ 454 (-10.45%)
Mutual labels:  big-data
R
Exercises (incl. analyses) with R language (math+statistics)
Stars: ✭ 462 (-8.88%)
Mutual labels:  statistics
Scikit Multiflow
A machine learning package for streaming data in Python. The other ancestor of River.
Stars: ✭ 485 (-4.34%)
Mutual labels:  streaming-data
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-10.45%)
Mutual labels:  big-data
Pgm Index
🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes
Stars: ✭ 499 (-1.58%)
Mutual labels:  big-data
Gtsummary
Presentation-Ready Data Summary and Analytic Result Tables
Stars: ✭ 450 (-11.24%)
Mutual labels:  statistics
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+4248.72%)
Mutual labels:  big-data
Edward
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Stars: ✭ 4,674 (+821.89%)
Mutual labels:  statistics
Stream Framework
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
Stars: ✭ 4,576 (+802.56%)
Mutual labels:  big-data
Python For Probability Statistics And Machine Learning
Jupyter Notebooks for Springer book "Python for Probability, Statistics, and Machine Learning"
Stars: ✭ 481 (-5.13%)
Mutual labels:  statistics

Online Algorithms for Statistics, Models, and Big Data Viz

Online algorithms are well suited for streaming data or when data is too large to hold in memory. OnlineStats processes observations one by one and all algorithms use O(1) memory.

Docs Build Test Citation
Build Status codecov DOI

Quickstart

import Pkg

Pkg.add("OnlineStats")

using OnlineStats

o = Series(Mean(), Variance(), P2Quantile(), Extrema())

fit!(o, 1.0)

fit!(o, randn(10^6))

Documentation

Contributing

  • Trivial PRs such as fixing typos are very welcome!
  • For nontrivial changes, you'll probably want to first discuss the changes via issue/email/slack with @joshday.

Authors

See also the list of contributors to OnlineStats.

License

MIT

Packages Using OnlineStats/OnlineStatsBase

See JuliaHub:

deps

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].