Circosjsd3 library to build circular graphs
Stars: ✭ 436 (+209.22%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+428.37%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (+995.04%)
awesome-coder-resources编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (-61.7%)
SparkrdmaRDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (+52.48%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-90.78%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-86.52%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-75.89%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-51.06%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+848.94%)
v6.dooring.public可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+129.08%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-96.45%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+73.76%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+202.13%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-24.11%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (-10.64%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-22.7%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-57.45%)
Uproot3ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (+121.28%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-43.26%)
HamaMirror of Apache Hama
Stars: ✭ 129 (-8.51%)
MindforgerThinking notebook and Markdown editor.
Stars: ✭ 1,695 (+1102.13%)
SigmfThe Signal Metadata Format Specification
Stars: ✭ 120 (-14.89%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+1120.57%)
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-17.02%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+1048.23%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+1064.54%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+7378.01%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-2.84%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-18.44%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (-19.15%)
TajoMirror of Apache Tajo
Stars: ✭ 128 (-9.22%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+971.63%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-19.86%)
AzuredatalakeSamples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-9.22%)
AmbariMirror of Apache Ambari
Stars: ✭ 1,576 (+1017.73%)
Liteflowliteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (-20.57%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+1726.95%)
Ni PytMateriály k předmětu NI-PYT na FIT ČVUT
Stars: ✭ 112 (-20.57%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-0.71%)
TwitworkMonitor twitter stream
Stars: ✭ 133 (-5.67%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-9.22%)
Learn Golang慕课网 Google 资深工程师深度讲解 Go 语言
Stars: ✭ 113 (-19.86%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-21.28%)
RichdemHigh-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-9.93%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-21.99%)
Books技术书籍等
Stars: ✭ 110 (-21.99%)