RelevancyTuningDice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon Hughes Dice.com
Stars: ✭ 28 (-54.84%)
SolrpluginsDice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (+38.71%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (+14.52%)
SolrConfigExamplesExamples of Solr configuration entries for Solr plugins and Conceptual Search\Semantic Search from Simon Hughes Dice.com
Stars: ✭ 26 (-58.06%)
luceneApache Lucene open-source search software
Stars: ✭ 1,009 (+1527.42%)
Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+6701.61%)
AnseriniA Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+824.19%)
solrApache Solr open-source search software
Stars: ✭ 651 (+950%)
Bm25A Python implementation of the BM25 ranking function.
Stars: ✭ 159 (+156.45%)
ComposeAEOfficial code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
Stars: ✭ 49 (-20.97%)
PyseriniPython interface to the Anserini IR toolkit built on Lucene
Stars: ✭ 148 (+138.71%)
RankingLearning to Rank in TensorFlow
Stars: ✭ 2,362 (+3709.68%)
cloud-note无道云笔记,原生JSP的仿有道云笔记项目
Stars: ✭ 66 (+6.45%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+20485.48%)
RedisDirectory🔒 A simple redis storage engine for lucene - 基于Redis的Lucene索引存储引擎 - Star me if you like it!
Stars: ✭ 18 (-70.97%)
ConceptualsearchTrain a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Stars: ✭ 245 (+295.16%)
InvoicenetDeep neural network to extract intelligent information from invoice documents.
Stars: ✭ 1,886 (+2941.94%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+21479.03%)
FoundryThe Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning
Stars: ✭ 124 (+100%)
lqtLucene Query Tool
Stars: ✭ 19 (-69.35%)
query-wellformedness25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (+29.03%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+4422.58%)
Scilla🏴☠️ Information Gathering tool 🏴☠️ DNS / Subdomains / Ports / Directories enumeration
Stars: ✭ 116 (+87.1%)
RanknetMy (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
Stars: ✭ 211 (+240.32%)
VtextSimple NLP in Rust with Python bindings
Stars: ✭ 108 (+74.19%)
K NrmK-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
Stars: ✭ 183 (+195.16%)
IR-exercisesSolutions of the various test exams of the Information Retrieval course
Stars: ✭ 28 (-54.84%)
BooksBooks worth spreading
Stars: ✭ 161 (+159.68%)
LogiEM面向Elasticsearch研发与运维人员,围绕集群、索引构建的零侵入、多租户的Elasticsearch GUI管控平台
Stars: ✭ 209 (+237.1%)
Sf1r LiteSearch Formula-1——A distributed high performance massive data engine for enterprise/vertical search
Stars: ✭ 158 (+154.84%)
sigir19-neural-irSource code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19
Stars: ✭ 44 (-29.03%)
ConvDRCode repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
Stars: ✭ 36 (-41.94%)
Tutorial Utilizing KgResources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"
Stars: ✭ 148 (+138.71%)
hermesA library and microservice implementing the health and care terminology SNOMED CT with support for cross-maps, inference, fast full-text search, autocompletion, compositional grammar and the expression constraint language.
Stars: ✭ 131 (+111.29%)
lucene-demo基于lucene-5.5.4实现的全文检索demo
Stars: ✭ 70 (+12.9%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (+116.13%)
TrinityTrinity IR Infrastructure
Stars: ✭ 227 (+266.13%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+100%)
Valley-eCommerce-prototypeAn eCommerce website prototype with a layered architecture and MVC using Spring Boot v1.2, Spring Security, Hibernate, and Apache Lucene for full-text searching. for front-end: Bootstrap, Typeahead.js and Graph.js using Thymeleaf as RE.
Stars: ✭ 28 (-54.84%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+5398.39%)
AquiladbDrop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Stars: ✭ 222 (+258.06%)
Pytrec evalpytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
Stars: ✭ 114 (+83.87%)
gplPowerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
Stars: ✭ 216 (+248.39%)
PwnbackBurp Extender plugin that generates a sitemap of a website using Wayback Machine
Stars: ✭ 203 (+227.42%)
Ds2iA library of inverted index data structures
Stars: ✭ 104 (+67.74%)
SertSemantic Entity Retrieval Toolkit
Stars: ✭ 100 (+61.29%)
FlexneuartFlexible classic and NeurAl Retrieval Toolkit
Stars: ✭ 99 (+59.68%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (-3.23%)
patzillaPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (+14.52%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+208.06%)
Rank bm25A Collection of BM25 Algorithms in Python
Stars: ✭ 187 (+201.61%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (+43.55%)
Pyndripyndri is a Python interface to the Indri search engine.
Stars: ✭ 85 (+37.1%)
pqlite⚡ A fast embedded library for approximate nearest neighbor search
Stars: ✭ 141 (+127.42%)
OpenmatchAn Open-Source Package for Information Retrieval.
Stars: ✭ 186 (+200%)
Textrank Keyword ExtractionKeyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and other techniques.
Stars: ✭ 79 (+27.42%)