FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-88.15%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+1515.64%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-66.35%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+131.75%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (-36.49%)
PkePython Keyphrase Extraction module
Stars: ✭ 855 (+305.21%)
Sf1r LiteSearch Formula-1——A distributed high performance massive data engine for enterprise/vertical search
Stars: ✭ 158 (-25.12%)
AnseriniA Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+171.56%)
Ds2iA library of inverted index data structures
Stars: ✭ 104 (-50.71%)
Pyndripyndri is a Python interface to the Indri search engine.
Stars: ✭ 85 (-59.72%)
Sequence Semantic EmbeddingTools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
Stars: ✭ 435 (+106.16%)
GaanaapiUnofficial Gaana API
Stars: ✭ 59 (-72.04%)
BooksBooks worth spreading
Stars: ✭ 161 (-23.7%)
Domain discovery toolThis repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
Stars: ✭ 33 (-84.36%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-41.23%)
Date InfoAPI to let user fetch the events that happen(ed) on a specific date
Stars: ✭ 7 (-96.68%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-10.9%)
Pytrec evalpytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
Stars: ✭ 114 (-45.97%)
Deep Semantic Similarity ModelMy Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (+141.23%)
Telegram Scrapertelegram group scraper tool. fetch all information about group members
Stars: ✭ 450 (+113.27%)
FlexneuartFlexible classic and NeurAl Retrieval Toolkit
Stars: ✭ 99 (-53.08%)
SolrpluginsDice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (-59.24%)
Ip TracerTrack any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
Stars: ✭ 399 (+89.1%)
InvoicenetDeep neural network to extract intelligent information from invoice documents.
Stars: ✭ 1,886 (+793.84%)
Textrank Keyword ExtractionKeyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and other techniques.
Stars: ✭ 79 (-62.56%)
RankingLearning to Rank in TensorFlow
Stars: ✭ 2,362 (+1019.43%)
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (-70.14%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+6240.76%)
FreediscoveryWeb Service for E-Discovery Analytics
Stars: ✭ 59 (-72.04%)
OpenmatchAn Open-Source Package for Information Retrieval.
Stars: ✭ 186 (-11.85%)
ScdvText classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-74.41%)
FoundryThe Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning
Stars: ✭ 124 (-41.23%)
NprfNPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval
Stars: ✭ 31 (-85.31%)
Bm25A Python implementation of the BM25 ranking function.
Stars: ✭ 159 (-24.64%)
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+300.47%)
Drl4nlp.scratchpadNotes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (-87.68%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (-9.48%)
RelevancyfeedbackDice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-91%)
Scilla🏴☠️ Information Gathering tool 🏴☠️ DNS / Subdomains / Ports / Directories enumeration
Stars: ✭ 116 (-45.02%)
TalismanStraightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+176.78%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+5948.82%)
ResinHardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+150.71%)
VtextSimple NLP in Rust with Python bindings
Stars: ✭ 108 (-48.82%)
Cdqa⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Stars: ✭ 500 (+136.97%)
NeuralqaNeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Stars: ✭ 185 (-12.32%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+118.01%)
SertSemantic Entity Retrieval Toolkit
Stars: ✭ 100 (-52.61%)
Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+1898.58%)
PyseriniPython interface to the Anserini IR toolkit built on Lucene
Stars: ✭ 148 (-29.86%)
PwnbackBurp Extender plugin that generates a sitemap of a website using Wayback Machine
Stars: ✭ 203 (-3.79%)
Rank bm25A Collection of BM25 Algorithms in Python
Stars: ✭ 187 (-11.37%)
K NrmK-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
Stars: ✭ 183 (-13.27%)
Tutorial Utilizing KgResources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"
Stars: ✭ 148 (-29.86%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-57.82%)