All Projects → SparseLSH → Similar Projects or Alternatives

748 Open source projects that are alternatives of or similar to SparseLSH

emperor-os
(new released v2.5 LTS.2022-06-25) It has focused on developing an All in One operating system for programming, designing and data science.Emperor-OS has over 500 apps and important tools
Stars: ✭ 32 (-74.8%)
Mutual labels:  data-mining
R Text Data
List of textual data sources to be used for text mining in R
Stars: ✭ 85 (-33.07%)
Mutual labels:  text-mining
PySPOD
A Python package for spectral proper orthogonal decomposition (SPOD).
Stars: ✭ 50 (-60.63%)
Mutual labels:  data-mining
Python nlp tutorial
This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-43.31%)
Mutual labels:  text-mining
T-CorEx
Implementation of linear CorEx and temporal CorEx.
Stars: ✭ 31 (-75.59%)
Mutual labels:  clustering
How To Mine Newsfeed Data And Extract Interactive Insights In Python
A practical guide to topic mining and interactive visualizations
Stars: ✭ 61 (-51.97%)
Mutual labels:  text-mining
LabelPropagation
A NetworkX implementation of Label Propagation from a "Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks" (Physical Review E 2008).
Stars: ✭ 101 (-20.47%)
Mutual labels:  clustering
Konlpy
Python package for Korean natural language processing.
Stars: ✭ 1,098 (+764.57%)
Mutual labels:  text-mining
reader
Distant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (-85.83%)
Mutual labels:  text-mining
Ngram
Fast n-Gram Tokenization
Stars: ✭ 55 (-56.69%)
Mutual labels:  text-mining
tsam
A python-based time series aggregation module (tsam) which can be used to reduce the number of time steps using typical periods or by decreasing the temporal resolution
Stars: ✭ 112 (-11.81%)
Mutual labels:  clustering
sacred
📖 Sacred texts in R
Stars: ✭ 19 (-85.04%)
Mutual labels:  text-mining
Gsoc2018 3gm
💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (-71.65%)
Mutual labels:  text-mining
AILA-Artificial-Intelligence-for-Legal-Assistance
Python implementations of the various methods used in FIRE 2019 conference.
Stars: ✭ 39 (-69.29%)
Mutual labels:  data-mining
TabInOut
Framework for information extraction from tables
Stars: ✭ 37 (-70.87%)
Mutual labels:  text-mining
Tidy Text Mining
Manuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
Stars: ✭ 961 (+656.69%)
Mutual labels:  text-mining
2018-Tencent-Lookalike
2018-腾讯广告算法大赛-相似人群拓展(初赛):10th/1563 (Top 0.64%)
Stars: ✭ 46 (-63.78%)
Mutual labels:  data-mining
Spider
A configurable web spider with a easy-to-use web console
Stars: ✭ 954 (+651.18%)
Mutual labels:  text-mining
SpectralClustering.jl
Spectral clustering algorithms written in Julia
Stars: ✭ 46 (-63.78%)
Mutual labels:  clustering
rabbitmq-clusterer
This project is ABANDONWARE. Use https://www.rabbitmq.com/cluster-formation.html instead.
Stars: ✭ 72 (-43.31%)
Mutual labels:  clustering
simon-frontend
💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻
Stars: ✭ 114 (-10.24%)
Mutual labels:  data-mining
Rake Nltk
Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
Stars: ✭ 793 (+524.41%)
Mutual labels:  text-mining
candis
🎀 A data mining suite for gene expression data.
Stars: ✭ 28 (-77.95%)
Mutual labels:  data-mining
Text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+462.99%)
Mutual labels:  text-mining
Data-Analyst-Nanodegree
This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-89.76%)
Mutual labels:  data-mining
Nlp Notebooks
A collection of notebooks for Natural Language Processing from NLP Town
Stars: ✭ 513 (+303.94%)
Mutual labels:  text-mining
Introduction-to-text-mining-with-Python
Lectures in Urban Data Science Lab, Seoul
Stars: ✭ 25 (-80.31%)
Mutual labels:  text-mining
Ldavis
R package for web-based interactive topic model visualization.
Stars: ✭ 466 (+266.93%)
Mutual labels:  text-mining
website-to-json
Converts website to json using jQuery selectors
Stars: ✭ 37 (-70.87%)
Mutual labels:  data-mining
Pyshorttextcategorization
Various Algorithms for Short Text Mining
Stars: ✭ 429 (+237.8%)
Mutual labels:  text-mining
rosette-elasticsearch-plugin
Document Enrichment plugin for Elasticsearch
Stars: ✭ 25 (-80.31%)
Mutual labels:  text-mining
gosquito
gosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (-80.31%)
Mutual labels:  data-mining
FixedEffectjlr
R interface for Fixed Effect Models
Stars: ✭ 20 (-84.25%)
Mutual labels:  clustering
MAL-Map
Cluster and visualize relationships between anime on MyAnimeList
Stars: ✭ 201 (+58.27%)
Mutual labels:  clustering
impfuzzy
Fuzzy Hash calculated from import API of PE files
Stars: ✭ 67 (-47.24%)
Mutual labels:  clustering
mongo-replica-with-docker
How to deploy a MongoDB Replica Set using Docker
Stars: ✭ 105 (-17.32%)
Mutual labels:  clustering
R.TeMiS
R.TeMiS: R Text Mining Solution
Stars: ✭ 21 (-83.46%)
Mutual labels:  text-mining
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (-75.59%)
Mutual labels:  text-mining
imbalanced-ensemble
Class-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库
Stars: ✭ 199 (+56.69%)
Mutual labels:  data-mining
Loan-Approval-Prediction
Loan Application Data Analysis
Stars: ✭ 61 (-51.97%)
Mutual labels:  data-mining
Nlpython
This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Stars: ✭ 265 (+108.66%)
Mutual labels:  text-mining
xgboost-smote-detect-fraud
Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (-53.54%)
Mutual labels:  data-mining
Tencent2017 Final Rank28 code
2017第一届腾讯社交广告高校算法大赛Rank28_code
Stars: ✭ 85 (-33.07%)
Mutual labels:  data-mining
snorkeling
Extracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-55.91%)
Mutual labels:  text-mining
sparse
Sparse matrix formats for linear algebra supporting scientific and machine learning applications
Stars: ✭ 136 (+7.09%)
Mutual labels:  sparse-matrices
kwx
BERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-74.02%)
Mutual labels:  text-mining
FEATHER
The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".
Stars: ✭ 34 (-73.23%)
Mutual labels:  data-mining
support-tickets-classification
This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+11.81%)
Mutual labels:  text-mining
sugarcube
Monoidal data processes.
Stars: ✭ 32 (-74.8%)
Mutual labels:  data-mining
DaDengAndHisPython
【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱[email protected]
Stars: ✭ 59 (-53.54%)
Mutual labels:  text-mining
Sampled-MinHashing
A method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery
Stars: ✭ 24 (-81.1%)
Mutual labels:  clustering
deduce
Deduce: de-identification method for Dutch medical text
Stars: ✭ 40 (-68.5%)
Mutual labels:  text-mining
TrajectoryTracking
Trajectory Tracking Project
Stars: ✭ 16 (-87.4%)
Mutual labels:  clustering
readability
Fast readability scores for text data
Stars: ✭ 22 (-82.68%)
Mutual labels:  text-mining
faythe
An experimental cluster brings Prometheus and OpenStack together
Stars: ✭ 18 (-85.83%)
Mutual labels:  clustering
TurboDataMiner
The objective of this Burp Suite extension is the flexible and dynamic extraction, correlation, and structured presentation of information from the Burp Suite project as well as the flexible and dynamic on-the-fly modification of outgoing or incoming HTTP requests using Python scripts. Thus, Turbo Data Miner shall aid in gaining a better and fas…
Stars: ✭ 46 (-63.78%)
Mutual labels:  data-mining
NNet
algorithm for study: multi-layer-perceptron, cluster-graph, cnn, rnn, restricted boltzmann machine, bayesian network
Stars: ✭ 24 (-81.1%)
Mutual labels:  clustering
Semantic-Bus
object flow treatment, data transformation
Stars: ✭ 49 (-61.42%)
Mutual labels:  data-mining
Machine-learning
This repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (-78.74%)
Mutual labels:  clustering
tsp-essay
A fun study of some heuristics for the Travelling Salesman Problem.
Stars: ✭ 15 (-88.19%)
Mutual labels:  clustering
301-360 of 748 similar projects