Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.

Stars: ✭ 214 (+69.84%)

Mutual labels: big-data

Onlinestats.jl

Single-pass algorithms for statistics

Stars: ✭ 507 (+302.38%)

Mutual labels: big-data

hyper-engine

Python library for Bayesian hyper-parameters optimization

Stars: ✭ 80 (-36.51%)

Mutual labels: big-data

Panoptes

A Global Scale Network Telemetry Ecosystem

Stars: ✭ 80 (-36.51%)

Mutual labels: big-data

Couchdb Documentation

Apache CouchDB Documentation

Stars: ✭ 128 (+1.59%)

Mutual labels: big-data

Magellan

Geo Spatial Data Analytics on Spark

Stars: ✭ 507 (+302.38%)

Mutual labels: big-data

falcon

Mirror of Apache Falcon

Stars: ✭ 95 (-24.6%)

Mutual labels: big-data

Iotdb

Apache IoTDB

Stars: ✭ 1,221 (+869.05%)

Mutual labels: big-data

Pgm Index

🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes

Stars: ✭ 499 (+296.03%)

Mutual labels: big-data

Attic Predictionio Sdk Python

PredictionIO Python SDK

Stars: ✭ 196 (+55.56%)

Mutual labels: big-data

wrangler

Wrangler Transform: A DMD system for transforming Big Data

Stars: ✭ 63 (-50%)

Mutual labels: big-data

Attic Predictionio Template Recommender

PredictionIO Recommendation Engine Template (Scala-based parallelized engine)

Stars: ✭ 78 (-38.1%)

Mutual labels: big-data

scSeqR

This package has migrated to https://github.com/rezakj/iCellR please use iCellR instead of scSeqR for more functionalities and updates.

Stars: ✭ 16 (-87.3%)

Mutual labels: clustering

Machine-Learning-Algorithms

All Machine Learning Algorithms

Stars: ✭ 24 (-80.95%)

Mutual labels: clustering

Tajo

Mirror of Apache Tajo

Stars: ✭ 128 (+1.59%)

Mutual labels: big-data

Stream Framework

Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:

Stars: ✭ 4,576 (+3531.75%)

Mutual labels: big-data

check-engine

Data validation library for PySpark 3.0.0

Stars: ✭ 29 (-76.98%)

Mutual labels: big-data

Labs

Research on distributed system

Stars: ✭ 73 (-42.06%)

Mutual labels: big-data

classifai

🔥 One of the most comprehensive open-source data annotation platform.

Stars: ✭ 99 (-21.43%)

Mutual labels: big-data

Data Science Live Book

An open source book to learn data science, data analysis and machine learning, suitable for all ages!

Stars: ✭ 193 (+53.17%)

Mutual labels: big-data

storm-ml

an online learning algorithm library for Storm

Stars: ✭ 18 (-85.71%)

Mutual labels: big-data

My Journey In The Data Science World

📢 Ready to learn or review your knowledge!

Stars: ✭ 1,175 (+832.54%)

Mutual labels: big-data

TT Tech Space

TT Tech Research Notes

Stars: ✭ 21 (-83.33%)

Mutual labels: big-data

ByteSlice

"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)

Stars: ✭ 24 (-80.95%)

Mutual labels: big-data

Appdocs

Application Performance Optimization Summary

Stars: ✭ 1,169 (+827.78%)

Mutual labels: big-data

Gun

An open source cybersecurity protocol for syncing decentralized graph data.

Stars: ✭ 15,172 (+11941.27%)

Mutual labels: big-data

Fit Sne

Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE)

Stars: ✭ 485 (+284.92%)

Mutual labels: big-data

Carbondata

Mirror of Apache CarbonData

Stars: ✭ 1,158 (+819.05%)

Mutual labels: big-data

accumulo-docker

Apache Accumulo Docker

Stars: ✭ 17 (-86.51%)

Mutual labels: big-data

Azuredatalake

Samples and Docs for Azure Data Lake Store and Analytics

Stars: ✭ 128 (+1.59%)

Mutual labels: big-data

Redislite

Redis in a python module.

Stars: ✭ 464 (+268.25%)

Mutual labels: big-data

LoL-Match-Prediction

Win probability predictions for League of Legends matches using neural networks

Stars: ✭ 34 (-73.02%)

Mutual labels: big-data

Flink Shaded

Apache Flink shaded artifacts repository

Stars: ✭ 67 (-46.83%)

Mutual labels: big-data

insightedge

InsightEdge Core

Stars: ✭ 22 (-82.54%)

Mutual labels: big-data

Flume

Mirror of Apache Flume

Stars: ✭ 2,200 (+1646.03%)

Mutual labels: big-data

incubator-liminal

Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.

Stars: ✭ 117 (-7.14%)

Mutual labels: big-data

Cloud Volume

Read and write Neuroglancer datasets programmatically.

Stars: ✭ 63 (-50%)

Mutual labels: big-data

beekeeper

Service for automatically managing and cleaning up unreferenced data

Stars: ✭ 43 (-65.87%)

Mutual labels: big-data

optimus

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Stars: ✭ 1,351 (+972.22%)

Mutual labels: bigdata

MAL-Map

Cluster and visualize relationships between anime on MyAnimeList

Stars: ✭ 201 (+59.52%)

Mutual labels: clustering

dmmclust

dmmclust is a package for clustering short texts, based on Yin and Wang (2014)

Stars: ✭ 23 (-81.75%)

Mutual labels: clustering

Feast

Feature Store for Machine Learning

Stars: ✭ 2,576 (+1944.44%)

Mutual labels: big-data

Courses

Quiz & Assignment of Coursera

Stars: ✭ 454 (+260.32%)

Mutual labels: big-data

Conjure Up

Deploying complex solutions, magically.

Stars: ✭ 454 (+260.32%)

Mutual labels: big-data

Griffon Vm

Griffon Data Science Virtual Machine

Stars: ✭ 128 (+1.59%)

Mutual labels: big-data

Data Science Ipython Notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Stars: ✭ 22,048 (+17398.41%)

Mutual labels: big-data

Clickhouse

ClickHouse® is a free analytics DBMS for big data

Stars: ✭ 21,089 (+16637.3%)

Mutual labels: big-data

301-360 of 947 similar projects