Presto Go ClientA Presto client for the Go programming language.
Stars: ✭ 183 (+553.57%)
Stream FrameworkStream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
Stars: ✭ 4,576 (+16242.86%)
saas-react-starter-kit-boilerplateSaaStr is a React SaaS boilerplate to kickstart your new SaaS adventure as fast as possible. Built on top of Adonis JS for the BackEnd and React Starter Kit for the Front-End
Stars: ✭ 100 (+257.14%)
RedisliteRedis in a python module.
Stars: ✭ 464 (+1557.14%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+532.14%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+1521.43%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+78642.86%)
KeyviKeyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 171 (+510.71%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+1421.43%)
Vehicle TrackingOpenCV 3 & Keras implementation of vehicle tracking with video data.
Stars: ✭ 112 (+300%)
Datascience Ai Machinelearning ResourcesAlex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (+1378.57%)
GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (+496.43%)
Cogcomp NlpCogComp's Natural Language Processing libraries and Demos:
Stars: ✭ 410 (+1364.29%)
couchdb-pkgApache CouchDB Packaging support files
Stars: ✭ 24 (-14.29%)
recurlyA Recurly API client written in golang. Actively maintained and unit tested. No external dependencies.
Stars: ✭ 40 (+42.86%)
Kibble 1Apache Kibble - a tool to collect, aggregate and visualize data about any software project
Stars: ✭ 54 (+92.86%)
FluoApache Fluo
Stars: ✭ 159 (+467.86%)
OrcApache ORC - the smallest, fastest columnar storage for Hadoop workloads
Stars: ✭ 389 (+1289.29%)
HiveApache Hive
Stars: ✭ 4,031 (+14296.43%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+442.86%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+1189.29%)
cdp-servicecdp数据平台,帮助企业充分了解客户,实现千人千面的精准营销。
Stars: ✭ 30 (+7.14%)
SylphStream computing platform for bigdata
Stars: ✭ 362 (+1192.86%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (+446.43%)
VespaThe open big data serving engine. https://vespa.ai
Stars: ✭ 3,747 (+13282.14%)
big-data-upfRECSM-UPF Summer School: Social Media and Big Data Research
Stars: ✭ 21 (-25%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+435.71%)
sgdAn R package for large scale estimation with stochastic gradient descent
Stars: ✭ 55 (+96.43%)
Grouparoo🦘 The Grouparoo Monorepo - open source customer data sync framework
Stars: ✭ 334 (+1092.86%)
100daysofmlcodeMy journey to learn and grow in the domain of Machine Learning and Artificial Intelligence by performing the #100DaysofMLCode Challenge.
Stars: ✭ 146 (+421.43%)
TezApache Tez
Stars: ✭ 313 (+1017.86%)
HadoopDedup🍉基于Hadoop和HBase的大规模海量数据去重
Stars: ✭ 27 (-3.57%)
DeltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+13839.29%)
MetamodelMirror of Apache Metamodel
Stars: ✭ 143 (+410.71%)
FluidFluid, elastic data abstraction and acceleration for BigData/AI applications in cloud
Stars: ✭ 265 (+846.43%)
ytprivYT metadata exporter
Stars: ✭ 28 (+0%)
MorpheusMorpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Stars: ✭ 303 (+982.14%)
SmooksAn extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (+946.43%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (+400%)
FlinkApache Flink is an open source project of The Apache Software Foundation (ASF).
The Apache Flink project originated from the Stratosphere research project.
Stars: ✭ 17,781 (+63403.57%)
scikit-learn-intelexIntel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Stars: ✭ 887 (+3067.86%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (+396.43%)
Parquet Dotnet🏐 Apache Parquet for modern .NET
Stars: ✭ 276 (+885.71%)
pmOCRA wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
Stars: ✭ 53 (+89.29%)
Lifion KinesisA native Node.js producer and consumer library for Amazon Kinesis Data Streams
Stars: ✭ 54 (+92.86%)
FlameStreamDistributed stream processing model and its implementation
Stars: ✭ 14 (-50%)
ccn-liteCCN-lite, a lightweight implementation of the CCNx protocol and its variations
Stars: ✭ 71 (+153.57%)
datartDatart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+3621.43%)
ngmswissgeol.ch gives you insight in geoscientific data - above and below the surface.
Stars: ✭ 23 (-17.86%)
gps-utilGPS related functionalities for nodejs
Stars: ✭ 31 (+10.71%)
F1-demoReal-time vehicle telematics analytics demo using OmniSci
Stars: ✭ 27 (-3.57%)