Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-20.25%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+524.38%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-40.08%)
AmbariMirror of Apache Ambari
Stars: ✭ 1,576 (+551.24%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-10.74%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-54.55%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-40.5%)
GunAn open source cybersecurity protocol for syncing decentralized graph data.
Stars: ✭ 15,172 (+6169.42%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-58.68%)
Books整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、Scala 、TensorFlow 、大数据 、推荐系统、数据库、数据挖掘 、机器学习 、深度学习 、算法等。
Stars: ✭ 222 (-8.26%)
Belajarpython.comOpen Source Indonesian Python Programming Tutorial Site
Stars: ✭ 141 (-41.74%)
KuduMirror of Apache Kudu
Stars: ✭ 1,360 (+461.98%)
FlumeMirror of Apache Flume
Stars: ✭ 2,200 (+809.09%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-59.92%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+452.89%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-11.16%)
ReefMirror of Apache REEF
Stars: ✭ 92 (-61.98%)
PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+640.91%)
Bitcoin Value Predictor[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-62.4%)
DvidDistributed, Versioned, Image-oriented Dataservice
Stars: ✭ 174 (-28.1%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-43.39%)
PanoptesA Global Scale Network Telemetry Ecosystem
Stars: ✭ 80 (-66.94%)
Selinon An advanced distributed task flow management on top of Celery
Stars: ✭ 237 (-2.07%)
IotdbApache IoTDB
Stars: ✭ 1,221 (+404.55%)
Attic PredictionioPredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,522 (+5074.38%)
CookbookThe Data Engineering Cookbook
Stars: ✭ 9,829 (+3961.57%)
HamaMirror of Apache Hama
Stars: ✭ 129 (-46.69%)
BookkeeperApache Bookkeeper
Stars: ✭ 1,178 (+386.78%)
CalciteApache Calcite
Stars: ✭ 2,816 (+1063.64%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-71.49%)
KeyviKeyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 161 (-33.47%)
AzuredatalakeSamples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-47.11%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-73.14%)
NakedtensorBare bone examples of machine learning in TensorFlow
Stars: ✭ 2,443 (+909.5%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-47.11%)
NabhashAn extremely fast Non-crypto-safe AES Based Hash algorithm for Big Data
Stars: ✭ 62 (-74.38%)
PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+5254.13%)
Attic LensMirror of Apache Lens
Stars: ✭ 58 (-76.03%)
Mobydq🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-49.17%)
Report自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-49.17%)
Kafka UiOpen-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (-4.96%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-2.89%)
UsqlU-SQL Examples and Issue Tracking
Stars: ✭ 221 (-8.68%)
Couchdb DockerSemi-official Apache CouchDB Docker images
Stars: ✭ 194 (-19.83%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-36.78%)
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-51.65%)