All Projects → predictionio → Similar Projects or Alternatives

369 Open source projects that are alternatives of or similar to predictionio

cdp-service
cdp数据平台,帮助企业充分了解客户,实现千人千面的精准营销。
Stars: ✭ 30 (-99.76%)
Mutual labels:  big-data
Grouparoo
🦘 The Grouparoo Monorepo - open source customer data sync framework
Stars: ✭ 334 (-97.33%)
Mutual labels:  big-data
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (-86.87%)
Mutual labels:  big-data
Tez
Apache Tez
Stars: ✭ 313 (-97.5%)
Mutual labels:  big-data
dxram
A distributed in-memory key-value storage for billions of small objects.
Stars: ✭ 25 (-99.8%)
Mutual labels:  big-data
Delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (-68.8%)
Mutual labels:  big-data
Tajo
Mirror of Apache Tajo
Stars: ✭ 128 (-98.98%)
Mutual labels:  big-data
Fluid
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud
Stars: ✭ 265 (-97.88%)
Mutual labels:  big-data
sgd
An R package for large scale estimation with stochastic gradient descent
Stars: ✭ 55 (-99.56%)
Mutual labels:  big-data
Morpheus
Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Stars: ✭ 303 (-97.58%)
Mutual labels:  big-data
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (-79.41%)
Mutual labels:  big-data
Couchdb Fauxton
Apache CouchDB
Stars: ✭ 295 (-97.64%)
Mutual labels:  big-data
classifai
🔥 One of the most comprehensive open-source data annotation platform.
Stars: ✭ 99 (-99.21%)
Mutual labels:  big-data
Smooks
An extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (-97.66%)
Mutual labels:  big-data
Richdem
High-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-98.98%)
Mutual labels:  big-data
Flink
Apache Flink is an open source project of The Apache Software Foundation (ASF). The Apache Flink project originated from the Stratosphere research project.
Stars: ✭ 17,781 (+42.13%)
Mutual labels:  big-data
ytpriv
YT metadata exporter
Stars: ✭ 28 (-99.78%)
Mutual labels:  big-data
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (-63.38%)
Mutual labels:  big-data
Hazelcast Nodejs Client
Hazelcast IMDG Node.js Client
Stars: ✭ 124 (-99.01%)
Mutual labels:  big-data
Parquet Dotnet
🏐 Apache Parquet for modern .NET
Stars: ✭ 276 (-97.79%)
Mutual labels:  big-data
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Stars: ✭ 1,173 (-90.62%)
Mutual labels:  big-data
Datahub
The Metadata Platform for the Modern Data Stack
Stars: ✭ 4,232 (-66.17%)
Mutual labels:  big-data
Scala Spark Tutorial
Project for James' Apache Spark with Scala course
Stars: ✭ 121 (-99.03%)
Mutual labels:  big-data
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (-76.83%)
Mutual labels:  big-data
scikit-learn-intelex
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Stars: ✭ 887 (-92.91%)
Mutual labels:  big-data
bigstatsr
R package for statistical tools with big matrices stored on disk.
Stars: ✭ 139 (-98.89%)
Mutual labels:  big-data
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-99.06%)
Mutual labels:  big-data
insightedge
InsightEdge Core
Stars: ✭ 22 (-99.82%)
Mutual labels:  big-data
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-99.89%)
Mutual labels:  big-data
Cmak
CMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (-15.72%)
Mutual labels:  big-data
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-99.11%)
Mutual labels:  big-data
accumulo-docker
Apache Accumulo Docker
Stars: ✭ 17 (-99.86%)
Mutual labels:  big-data
Asakusafw
Asakusa Framework
Stars: ✭ 114 (-99.09%)
Mutual labels:  big-data
ibmpairs
open source tools for interaction with IBM PAIRS:
Stars: ✭ 23 (-99.82%)
Mutual labels:  big-data
GDLibrary
Matlab library for gradient descent algorithms: Version 1.0.1
Stars: ✭ 50 (-99.6%)
Mutual labels:  big-data
spark-acid
ACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-99.27%)
Mutual labels:  big-data
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-99.1%)
Mutual labels:  big-data
Sqoop
Mirror of Apache Sqoop
Stars: ✭ 817 (-93.47%)
Mutual labels:  big-data
bullet-core
Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Storm, Spark or Flink.
Stars: ✭ 36 (-99.71%)
Mutual labels:  big-data
vxquery
Mirror of Apache VXQuery
Stars: ✭ 19 (-99.85%)
Mutual labels:  big-data
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (-87.66%)
Mutual labels:  big-data
ByteSlice
"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (-99.81%)
Mutual labels:  big-data
Keyvi
Keyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 171 (-98.63%)
Mutual labels:  big-data
Parquet Format
Apache Parquet
Stars: ✭ 800 (-93.61%)
Mutual labels:  big-data
hotmap
WebGL Heatmap Viewer for Big Data and Bioinformatics
Stars: ✭ 13 (-99.9%)
Mutual labels:  big-data
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-99.13%)
Mutual labels:  big-data
egis
Egis - a handy Ruby interface for AWS Athena
Stars: ✭ 38 (-99.7%)
Mutual labels:  big-data
incubator-tez
Mirror of Apache Tez (Incubating)
Stars: ✭ 60 (-99.52%)
Mutual labels:  big-data
big-sorter
Java library that sorts very large files of records by splitting into smaller sorted files and merging
Stars: ✭ 49 (-99.61%)
Mutual labels:  big-data
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-99.14%)
Mutual labels:  big-data
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-99.73%)
Mutual labels:  big-data
lcbo-api
A crawler and API server for Liquor Control Board of Ontario retail data
Stars: ✭ 152 (-98.78%)
Mutual labels:  big-data
Maha
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-99.19%)
Mutual labels:  big-data
clusterdock
clusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (-99.79%)
Mutual labels:  big-data
opendc
Collaborative Datacenter Simulation and Exploration for Everybody
Stars: ✭ 40 (-99.68%)
Mutual labels:  big-data
SynapseML
Simple and Distributed Machine Learning
Stars: ✭ 3,355 (-73.18%)
Mutual labels:  big-data
xcast
A High-Performance Data Science Toolkit for the Earth Sciences
Stars: ✭ 28 (-99.78%)
Mutual labels:  big-data
beam-site
Apache Beam Site
Stars: ✭ 28 (-99.78%)
Mutual labels:  big-data
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (-99.42%)
Mutual labels:  big-data
Titanoboa
Titanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.
Stars: ✭ 787 (-93.71%)
Mutual labels:  big-data
301-360 of 369 similar projects