All Projects → SparseLSH → Similar Projects or Alternatives

748 Open source projects that are alternatives of or similar to SparseLSH

teanaps
자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (-28.35%)
Mutual labels:  text-mining, data-mining, clustering
genieclust
Genie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (-73.23%)
Mutual labels:  data-mining, clustering
Orange3
🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+2381.89%)
Mutual labels:  data-mining, clustering
R
All Algorithms implemented in R
Stars: ✭ 294 (+131.5%)
Mutual labels:  data-mining, clustering
kmeans
A simple implementation of K-means (and Bisecting K-means) clustering algorithm in Python
Stars: ✭ 18 (-85.83%)
Mutual labels:  data-mining, clustering
Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Stars: ✭ 2,936 (+2211.81%)
Mutual labels:  data-mining, clustering
Gwu data mining
Materials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+70.87%)
Mutual labels:  text-mining, data-mining
Qminer
Analytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+62.2%)
Mutual labels:  text-mining, data-mining
Tadw
An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-66.14%)
Mutual labels:  text-mining, data-mining
Artificial Adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+174.02%)
Mutual labels:  text-mining, data-mining
Cogcomp Nlpy
CogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-9.45%)
Mutual labels:  text-mining, data-mining
Pyclustering
pyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (+534.65%)
Mutual labels:  data-mining, clustering
Bagofconcepts
Python implementation of bag-of-concepts
Stars: ✭ 18 (-85.83%)
Mutual labels:  text-mining, clustering
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
Stars: ✭ 16 (-87.4%)
Mutual labels:  text-mining, data-mining
Pyss3
A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+50.39%)
Mutual labels:  text-mining, data-mining
Rmdl
RMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+195.28%)
Mutual labels:  text-mining, data-mining
Lda Topic Modeling
A PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-28.35%)
Mutual labels:  text-mining, clustering
genie
Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-83.46%)
Mutual labels:  data-mining, clustering
Data mining
The Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms
Stars: ✭ 10 (-92.13%)
Mutual labels:  data-mining, clustering
advanced-text-mining
TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.
Stars: ✭ 15 (-88.19%)
Mutual labels:  text-mining, data-mining
2018 Machinelearning Lectures Esa
Machine Learning Lectures at the European Space Agency (ESA) in 2018
Stars: ✭ 280 (+120.47%)
Mutual labels:  text-mining, clustering
Metasra Pipeline
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (-74.02%)
Mutual labels:  text-mining, data-mining
Text mining resources
Resources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+181.89%)
Mutual labels:  text-mining, data-mining
Heart disease prediction
Heart Disease prediction using 5 algorithms
Stars: ✭ 43 (-66.14%)
Mutual labels:  data-mining, clustering
perke
A keyphrase extractor for Persian
Stars: ✭ 60 (-52.76%)
Mutual labels:  text-mining, data-mining
hierarchical-clustering
A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (-51.18%)
Mutual labels:  data-mining, clustering
Matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (+11.02%)
Mutual labels:  data-mining, clustering
Textract
extract text from any document. no muss. no fuss.
Stars: ✭ 3,165 (+2392.13%)
Mutual labels:  text-mining, data-mining
Elki
ELKI Data Mining Toolkit
Stars: ✭ 613 (+382.68%)
Mutual labels:  data-mining, clustering
Xioc
Extract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+16.54%)
Mutual labels:  text-mining, data-mining
iis
Information Inference Service of the OpenAIRE system
Stars: ✭ 16 (-87.4%)
Mutual labels:  text-mining, data-mining
tf-idf-python
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (-22.83%)
Mutual labels:  text-mining, data-mining
color cloth
color_cloth gets the main colors and its proportions from a cloth image ignoring the background, it uses the EM algorithm from OpenCV library, the algorithm needs an image with an item in the center of the picture.
Stars: ✭ 20 (-84.25%)
Mutual labels:  clustering
restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-83.46%)
Mutual labels:  text-mining
clusters
Cluster analysis library for Golang
Stars: ✭ 68 (-46.46%)
Mutual labels:  clustering
watchman
Watchman: An open-source social-media event-detection system
Stars: ✭ 18 (-85.83%)
Mutual labels:  clustering
TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-78.74%)
Mutual labels:  text-mining
4chanMarkovText
Text Generation using Markov Chains fed by 4chan APIs
Stars: ✭ 28 (-77.95%)
Mutual labels:  data-mining
influxdb-ha
High-availability and horizontal scalability for InfluxDB
Stars: ✭ 45 (-64.57%)
Mutual labels:  clustering
Quran-and-Arabic-Language-Repository
Projects & Libraries related to Quran & Arabic Language
Stars: ✭ 26 (-79.53%)
Mutual labels:  text-mining
pathpy
pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models
Stars: ✭ 124 (-2.36%)
Mutual labels:  data-mining
T-CorEx
Implementation of linear CorEx and temporal CorEx.
Stars: ✭ 31 (-75.59%)
Mutual labels:  clustering
sacred
📖 Sacred texts in R
Stars: ✭ 19 (-85.04%)
Mutual labels:  text-mining
Spectre
A computational toolkit in R for the integration, exploration, and analysis of high-dimensional single-cell cytometry and imaging data.
Stars: ✭ 31 (-75.59%)
Mutual labels:  clustering
DigitalCellSorter
Digital Cell Sorter (DCS): single cell RNA-seq analysis toolkit. Documentation:
Stars: ✭ 19 (-85.04%)
Mutual labels:  clustering
civicmine
Text mining cancer biomarkers for the CIVIC database
Stars: ✭ 19 (-85.04%)
Mutual labels:  text-mining
SpectralClustering.jl
Spectral clustering algorithms written in Julia
Stars: ✭ 46 (-63.78%)
Mutual labels:  clustering
Introduction-to-text-mining-with-Python
Lectures in Urban Data Science Lab, Seoul
Stars: ✭ 25 (-80.31%)
Mutual labels:  text-mining
Network-Embedding-Resources
Network Embedding Survey and Resources
Stars: ✭ 43 (-66.14%)
Mutual labels:  data-mining
impfuzzy
Fuzzy Hash calculated from import API of PE files
Stars: ✭ 67 (-47.24%)
Mutual labels:  clustering
Tencent2017 Final Rank28 code
2017第一届腾讯社交广告高校算法大赛Rank28_code
Stars: ✭ 85 (-33.07%)
Mutual labels:  data-mining
Instagram-Comments-Scraper
Instagram comment scraper using python and selenium. Save the comments into excel.
Stars: ✭ 73 (-42.52%)
Mutual labels:  data-mining
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-78.74%)
Mutual labels:  text-mining
Sampled-MinHashing
A method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery
Stars: ✭ 24 (-81.1%)
Mutual labels:  clustering
mousetrap
Process and Analyze Mouse-Tracking Data
Stars: ✭ 33 (-74.02%)
Mutual labels:  clustering
BLUELAY
Searches online paste sites for certain search terms which can indicate a possible data breach.
Stars: ✭ 24 (-81.1%)
Mutual labels:  data-mining
G-SimCLR
This is the code base for paper "G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling" by Souradip Chakraborty, Aritra Roy Gosthipaty and Sayak Paul.
Stars: ✭ 69 (-45.67%)
Mutual labels:  clustering
LinearCorex
Fast, linear version of CorEx for covariance estimation, dimensionality reduction, and subspace clustering with very under-sampled, high-dimensional data
Stars: ✭ 39 (-69.29%)
Mutual labels:  clustering
Bankruptcy-Prediction
Mining the Polish Bankruptcy Data
Stars: ✭ 21 (-83.46%)
Mutual labels:  data-mining
DBSCANSD
Java implementation for DBSCANSD, a trajectory clustering algorithm.
Stars: ✭ 35 (-72.44%)
Mutual labels:  clustering
1-60 of 748 similar projects