joshday / Onlinestats.jl
Licence: mit
Single-pass algorithms for statistics
Stars: ✭ 507
Programming Languages
julia
2034 projects
Projects that are alternatives of or similar to Onlinestats.jl
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-78.9%)
Mutual labels: statistics, big-data
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-51.28%)
Mutual labels: big-data, streaming-data
Datumbox Framework
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Stars: ✭ 1,063 (+109.66%)
Mutual labels: statistics, big-data
Data Science Live Book
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-61.93%)
Mutual labels: statistics, big-data
Miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Stars: ✭ 4,633 (+813.81%)
Mutual labels: statistics, streaming-data
bigstatsr
R package for statistical tools with big matrices stored on disk.
Stars: ✭ 139 (-72.58%)
Mutual labels: statistics, big-data
OnlineStatsBase.jl
Base types for OnlineStats.
Stars: ✭ 26 (-94.87%)
Mutual labels: big-data, streaming-data
Datascience Ai Machinelearning Resources
Alex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-18.34%)
Mutual labels: statistics, big-data
Hazelcast
Open-source distributed computation and storage platform
Stars: ✭ 4,662 (+819.53%)
Mutual labels: big-data
Fit Sne
Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE)
Stars: ✭ 485 (-4.34%)
Mutual labels: big-data
R
Exercises (incl. analyses) with R language (math+statistics)
Stars: ✭ 462 (-8.88%)
Mutual labels: statistics
Scikit Multiflow
A machine learning package for streaming data in Python. The other ancestor of River.
Stars: ✭ 485 (-4.34%)
Mutual labels: streaming-data
Pgm Index
🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes
Stars: ✭ 499 (-1.58%)
Mutual labels: big-data
Gtsummary
Presentation-Ready Data Summary and Analytic Result Tables
Stars: ✭ 450 (-11.24%)
Mutual labels: statistics
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+4248.72%)
Mutual labels: big-data
Edward
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Stars: ✭ 4,674 (+821.89%)
Mutual labels: statistics
Stream Framework
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
Stars: ✭ 4,576 (+802.56%)
Mutual labels: big-data
Python For Probability Statistics And Machine Learning
Jupyter Notebooks for Springer book "Python for Probability, Statistics, and Machine Learning"
Stars: ✭ 481 (-5.13%)
Mutual labels: statistics
Online Algorithms for Statistics, Models, and Big Data Viz
Online algorithms are well suited for streaming data or when data is too large to hold in memory. OnlineStats processes observations one by one and all algorithms use O(1) memory.
Docs | Build | Test | Citation |
---|---|---|---|
Quickstart
import Pkg
Pkg.add("OnlineStats")
using OnlineStats
o = Series(Mean(), Variance(), P2Quantile(), Extrema())
fit!(o, 1.0)
fit!(o, randn(10^6))
Documentation
Contributing
- Trivial PRs such as fixing typos are very welcome!
- For nontrivial changes, you'll probably want to first discuss the changes via issue/email/slack with
@joshday
.
Authors
- Primary Author: Josh Day (@joshday)
- Significant early contributions from Tom Breloff (@tbreloff)
- Many algorithms developed under mentorship of Hua Zhou (@Hua-Zhou)
See also the list of contributors to OnlineStats.
License
OnlineStatsBase
Packages Using OnlineStats/See JuliaHub:
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].