GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+1081.29%)
NetworkitNetworKit is a growing open-source toolkit for large-scale network analysis.
Stars: ✭ 383 (+175.54%)
FxgraphalgorithmsimulatorVisualizes specific Graph Algorithms like BFS, DFS, MST etc. on interactive user input graphs.
Stars: ✭ 22 (-84.17%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+159.71%)
MagellanGeo Spatial Data Analytics on Spark
Stars: ✭ 507 (+264.75%)
SwiftgraphA Graph Data Structure in Pure Swift
Stars: ✭ 588 (+323.02%)
Deepwalk CDeepWalk implementation in C++
Stars: ✭ 88 (-36.69%)
RglRGL is a framework for graph data structures and algorithms in Ruby.
Stars: ✭ 279 (+100.72%)
Ngraph.graphGraph data structure in JavaScript
Stars: ✭ 295 (+112.23%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+160.43%)
DeltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+2707.91%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+15761.87%)
LittleballoffurLittle Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)
Stars: ✭ 505 (+263.31%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+3969.06%)
Competitive codingThis repository contains some useful codes, techniques, algorithms and problem solutions helpful in Competitive Coding.
Stars: ✭ 393 (+182.73%)
GraphroleAutomatic feature extraction and node role assignment for transfer learning on graphs (ReFeX & RolX)
Stars: ✭ 38 (-72.66%)
Pydata NetworkxA short tutorial on network analysis using Game of Thrones, US Airports and Python!
Stars: ✭ 50 (-64.03%)
LabsResearch on distributed system
Stars: ✭ 73 (-47.48%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-53.24%)
WorkbaseGrakn Workbase (Knowledge IDE)
Stars: ✭ 106 (-23.74%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-28.78%)
OgreClojure library for querying Apache TinkerPop graphs
Stars: ✭ 118 (-15.11%)
Javascript Datastructures Algorithms📚 collection of JavaScript and TypeScript data structures and algorithms for education purposes. Source code bundle of JavaScript algorithms and data structures book
Stars: ✭ 3,221 (+2217.27%)
PymeasureScientific measurement library for instruments, experiments, and live-plotting
Stars: ✭ 255 (+83.45%)
MorpheusMorpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Stars: ✭ 303 (+117.99%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (+84.89%)
CommunitiesLibrary of community detection algorithms and visualization tools
Stars: ✭ 348 (+150.36%)
VivagraphjsGraph drawing library for JavaScript
Stars: ✭ 3,442 (+2376.26%)
BigdlBuilding Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+2643.17%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-89.93%)
C Sharp Algorithms📚 📈 Plug-and-play class-library project of standard Data Structures and Algorithms in C#
Stars: ✭ 4,684 (+3269.78%)
Data StructuresCommon data structures and algorithms implemented in JavaScript
Stars: ✭ 139 (+0%)
TidygraphA tidy API for graph manipulation
Stars: ✭ 398 (+186.33%)
Lightgraphs.jlAn optimized graphs package for the Julia programming language
Stars: ✭ 611 (+339.57%)
ZeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+3866.19%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+435.97%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-20.14%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+1753.24%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+22646.76%)
PotironPotiron - Normalize, Index and Visualize Network Capture
Stars: ✭ 66 (-52.52%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-43.17%)
LeaderboardxA tool for building graphs quickly
Stars: ✭ 13 (-90.65%)
VerseReference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures
Stars: ✭ 98 (-29.5%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-20.86%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-30.22%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-34.53%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-90.65%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-92.09%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+862.59%)
DepthmapxdepthmapX is a multi-platform Spatial Network Analysis Software
Stars: ✭ 120 (-13.67%)
UrbanaccessA tool for GTFS transit and OSM pedestrian network accessibility analysis
Stars: ✭ 137 (-1.44%)