tika-similarityTika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Stars: ✭ 92 (+384.21%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+773.68%)
realtimemap-dotnetA showcase for Proto.Actor - an ultra-fast distributed actors solution for Go, C#, and Java/Kotlin.
Stars: ✭ 47 (+147.37%)
PencilFFTs.jlFast Fourier transforms of MPI-distributed Julia arrays
Stars: ✭ 48 (+152.63%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (+242.11%)
RATTLEReference-free reconstruction and error correction of transcriptomes from Nanopore long-read sequencing
Stars: ✭ 35 (+84.21%)
neworderA dynamic microsimulation framework for python
Stars: ✭ 15 (-21.05%)
ImplicitGlobalGrid.jlAlmost trivial distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid
Stars: ✭ 88 (+363.16%)
cereCERE: Codelet Extractor and REplayer
Stars: ✭ 27 (+42.11%)
mcxxMercurium is a C/C++/Fortran source-to-source compilation infrastructure aimed at fast prototyping developed by the Programming Models group at the Barcelona Supercomputing Center
Stars: ✭ 59 (+210.53%)
PartitionedArrays.jlVectors and sparse matrices partitioned into pieces for parallel distributed-memory computations.
Stars: ✭ 45 (+136.84%)
influxdb-haHigh-availability and horizontal scalability for InfluxDB
Stars: ✭ 45 (+136.84%)
ClusteringImplements "Clustering a Million Faces by Identity"
Stars: ✭ 128 (+573.68%)
Sampled-MinHashingA method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery
Stars: ✭ 24 (+26.32%)
protoactor-goProto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 4,138 (+21678.95%)
LinearCorexFast, linear version of CorEx for covariance estimation, dimensionality reduction, and subspace clustering with very under-sampled, high-dimensional data
Stars: ✭ 39 (+105.26%)
FoxNNSimple neural network
Stars: ✭ 20 (+5.26%)
newerhoodsA Data Clinic project that aggregates NYC Open Data at the tract-level and uses Machine Learning techniques to re-imagine neighborhood boundaries.
Stars: ✭ 36 (+89.47%)
morphoclusterSource code for the MorphoCluster application described in Schroeder et al. 2020
Stars: ✭ 13 (-31.58%)
QuestionClusteringClasificador de preguntas escrito en python 3 que fue implementado en el siguiente vídeo: https://youtu.be/qnlW1m6lPoY
Stars: ✭ 15 (-21.05%)
dropClustVersion 2.1.0 released
Stars: ✭ 19 (+0%)
nlp-ltNatural Language Processing for Lithuanian language
Stars: ✭ 17 (-10.53%)
watchmanWatchman: An open-source social-media event-detection system
Stars: ✭ 18 (-5.26%)
mpifxModern Fortran wrappers around MPI routines
Stars: ✭ 25 (+31.58%)
buddhabrotSingle Core and Multi Core (CPU and GPU) versions of Buddhabrot
Stars: ✭ 15 (-21.05%)
Clustering-DatasetsThis repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.
Stars: ✭ 189 (+894.74%)
faytheAn experimental cluster brings Prometheus and OpenStack together
Stars: ✭ 18 (-5.26%)
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
Stars: ✭ 27 (+42.11%)
ZeitlineA polylinear timeline with clustering, centred on interactions. — Doc and demo https://octree-gva.github.io/Zeitline/
Stars: ✭ 15 (-21.05%)
minicoreFast and memory-efficient clustering + coreset construction, including fast distance kernels for Bregman and f-divergences.
Stars: ✭ 28 (+47.37%)
SpectreA computational toolkit in R for the integration, exploration, and analysis of high-dimensional single-cell cytometry and imaging data.
Stars: ✭ 31 (+63.16%)
centrifuge-toolkitTool for visualizing and empirically analyzing information encoded in binary files
Stars: ✭ 49 (+157.89%)
cramTool to run many small MPI jobs inside of one large MPI job.
Stars: ✭ 23 (+21.05%)
magento-clusterHighly Available and Auto-scalable Magento Cluster
Stars: ✭ 21 (+10.53%)
fuzzballOngoing development of the Fuzzball MUCK server software and associated functionality.
Stars: ✭ 38 (+100%)
Fraud-Detection-in-Online-TransactionsDetecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Stars: ✭ 41 (+115.79%)
py-lbgPython Implementation for Linde-Buzo-Gray / Generalized Lloyd Algorithm for vector quantization.
Stars: ✭ 22 (+15.79%)
DBSCANSDJava implementation for DBSCANSD, a trajectory clustering algorithm.
Stars: ✭ 35 (+84.21%)
Apartment-Interest-PredictionPredict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text and images.
Stars: ✭ 17 (-10.53%)
NPB-CPPNAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.
Stars: ✭ 18 (-5.26%)
kmeansK-Means clustering
Stars: ✭ 51 (+168.42%)
hclustHierarchical clustering in JavaScript
Stars: ✭ 39 (+105.26%)
product-quantization🙃Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.
Stars: ✭ 40 (+110.53%)
lightdashAn open source alternative to Looker built using dbt. Made for analysts ❤️
Stars: ✭ 1,082 (+5594.74%)
es pytorchHigh performance implementation of Deep neuroevolution in pytorch using mpi4py. Intended for use on HPC clusters
Stars: ✭ 20 (+5.26%)
hero-sdk⛔ DEPRECATED ⛔ HERO Software Development Kit
Stars: ✭ 21 (+10.53%)
ex unitedEasily spawn Elixir nodes (supervising, Mix configured, easy asserted / refuted) within ExUnit tests
Stars: ✭ 40 (+110.53%)
pyparEfficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Stars: ✭ 66 (+247.37%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (+26.32%)
GrouProxFedGroup, A Clustered Federated Learning framework based on Tensorflow
Stars: ✭ 20 (+5.26%)
hotspot3d3D hotspot mutation proximity analysis tool
Stars: ✭ 43 (+126.32%)