Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-11.72%)
Macro mlCourse Website on Macroeconomic Analysis with Machine Learning and Big Data
Stars: ✭ 53 (-58.59%)
TreevizTree diagrams with JavaScript 🌲 📈
Stars: ✭ 95 (-25.78%)
Datumbox FrameworkDatumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Stars: ✭ 1,063 (+730.47%)
TraildbTrailDB is an efficient tool for storing and querying series of events
Stars: ✭ 1,029 (+703.91%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (+1106.25%)
EgadsA Java package to automatically detect anomalies in large scale time-series data
Stars: ✭ 997 (+678.91%)
Esper TvEsper instance for TV news analysis
Stars: ✭ 37 (-71.09%)
RichdemHigh-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-0.78%)
QcportalA client interface to the QCArchive Project (read-only image of QCFractal)
Stars: ✭ 29 (-77.34%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-14.84%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+24601.56%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-37.5%)
PhoenixMirror of Apache Phoenix
Stars: ✭ 867 (+577.34%)
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-8.59%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-91.41%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-38.28%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (+567.97%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-16.41%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+567.19%)
Pyspark Setup DemoDemo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-81.25%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+1912.5%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-96.09%)
LabsResearch on distributed system
Stars: ✭ 73 (-42.97%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-21.09%)
Rakam Api📈 Collect customer event data from your apps. (Note that this project only includes the API collector, not the visualization platform)
Stars: ✭ 772 (+503.13%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+482.03%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+8137.5%)
AppdocsApplication Performance Optimization Summary
Stars: ✭ 1,169 (+813.28%)
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (+392.19%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-22.66%)
Kafka Streamsequivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (+378.91%)
CarbondataMirror of Apache CarbonData
Stars: ✭ 1,158 (+804.69%)
OozieMirror of Apache Oozie
Stars: ✭ 602 (+370.31%)
GiraphMirror of Apache Giraph
Stars: ✭ 569 (+344.53%)
Flink ShadedApache Flink shaded artifacts repository
Stars: ✭ 67 (-47.66%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+4044.53%)
CouchdbSeamless multi-master syncing database with an intuitive HTTP/JSON API, designed for reliability
Stars: ✭ 5,166 (+3935.94%)
Cloud VolumeRead and write Neuroglancer datasets programmatically.
Stars: ✭ 63 (-50.78%)
ArkimeArkime (formerly Moloch) is an open source, large scale, full packet capturing, indexing, and database system.
Stars: ✭ 4,994 (+3801.56%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (-10.94%)
WarpConvert and analyze large data sets at light speed, on Mac and iOS.
Stars: ✭ 62 (-51.56%)
AzuredatalakeSamples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (+0%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (+0%)
Report自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-3.91%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+1080.47%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-24.22%)
Attic LensMirror of Apache Lens
Stars: ✭ 58 (-54.69%)