AquiladbDrop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Stars: ✭ 222 (+39.62%)
query-wellformedness25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (-49.69%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (-15.72%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+2044.03%)
solrApache Solr open-source search software
Stars: ✭ 651 (+309.43%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+207.55%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-55.35%)
seeSearch Engine in Erlang
Stars: ✭ 27 (-83.02%)
patzillaPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (-55.35%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+127.67%)
ResinHardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+232.7%)
evildorkEvildork targeting your fiancee👁️
Stars: ✭ 46 (-71.07%)
ConceptualsearchTrain a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Stars: ✭ 245 (+54.09%)
Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+2552.2%)
luceneApache Lucene open-source search software
Stars: ✭ 1,009 (+534.59%)
RelevancyfeedbackDice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-88.05%)
Sf1r LiteSearch Formula-1——A distributed high performance massive data engine for enterprise/vertical search
Stars: ✭ 158 (-0.63%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+8314.47%)
Dirty JsonA parser for invalid JSON
Stars: ✭ 141 (-11.32%)
Node CsvtojsonBlazing fast and Comprehensive CSV Parser for Node.JS / Browser / Command Line.
Stars: ✭ 1,760 (+1006.92%)
SlapPainless shell argument parsing and dependency check.
Stars: ✭ 130 (-18.24%)
SlangSystemVerilog compiler and language services
Stars: ✭ 145 (-8.81%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (+1050.31%)
FoundryThe Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning
Stars: ✭ 124 (-22.01%)
Olefileolefile is a Python package to parse, read and write Microsoft OLE2 files (also called Structured Storage, Compound File Binary Format or Compound Document File Format), such as Microsoft Office 97-2003 documents, vbaProject.bin in MS Office 2007+ files, Image Composer and FlashPix files, Outlook messages, StickyNotes, several Microscopy file formats, McAfee antivirus quarantine files, etc.
Stars: ✭ 142 (-10.69%)
Json AutotypeAutomatic Haskell type inference from JSON input
Stars: ✭ 139 (-12.58%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-22.01%)
MdfreaderRead Measurement Data Format (MDF) versions 3.x and 4.x file formats in python
Stars: ✭ 131 (-17.61%)
Tutorial Utilizing KgResources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"
Stars: ✭ 148 (-6.92%)
Collector HttpNorconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.
Stars: ✭ 130 (-18.24%)
Awesome Deep Learning Papers For Search Recommendation AdvertisingAwesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR prediction, CVR prediction), Post Ranking, Transfer, Reinforcement Learning, Self-supervised Learning and so on.
Stars: ✭ 136 (-14.47%)
Instantsearch AndroidA library of widgets and helpers to build instant-search applications on Android.
Stars: ✭ 129 (-18.87%)
Swift Selection SearchSwift Selection Search (SSS) is a simple Firefox add-on that lets you quickly search for some text in a page using your favorite search engines.
Stars: ✭ 125 (-21.38%)
SearchAn Open Source Search Engine
Stars: ✭ 139 (-12.58%)
Downloadsearchsearch for any kinds of files to download
Stars: ✭ 124 (-22.01%)
TorrentinimA very low memory-footprint, self hosted API-only torrent search engine. Sonarr + Radarr Compatible, native support for Linux, Mac and Windows.
Stars: ✭ 123 (-22.64%)
QuerqyQuery preprocessor for Java-based search engines (Querqy Core and Solr implementation)
Stars: ✭ 122 (-23.27%)
Tis Solran enterprise search engine base on Apache Solr
Stars: ✭ 158 (-0.63%)
Html Agility PackHtml Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
Stars: ✭ 2,014 (+1166.67%)
LibnmeaLightweight C library for parsing NMEA 0183 sentences
Stars: ✭ 146 (-8.18%)
PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+1027.67%)
Dato.rssThe best RSS Search experience you can find
Stars: ✭ 122 (-23.27%)
React Csv ReaderReact component that handles csv file input and its parsing
Stars: ✭ 138 (-13.21%)
Whoogle SearchA self-hosted, ad-free, privacy-respecting metasearch engine
Stars: ✭ 4,645 (+2821.38%)
SrchxA standalone lightweight full-text search engine built on top of blevesearch and Go with multiple storage (scorch, boltdb, leveldb, badger)
Stars: ✭ 118 (-25.79%)
DekuDeclarative binary reading and writing: bit-level, symmetric, serialization/deserialization
Stars: ✭ 136 (-14.47%)
WikimanWikiman is an offline search engine for manual pages, Arch Wiki, Gentoo Wiki and other documentation.
Stars: ✭ 117 (-26.42%)
Gray MatterContributing
Pull requests and stars are always welcome. For bugs and feature requests, please create an issue.
Stars: ✭ 2,105 (+1223.9%)
Parse EnglishEnglish (natural language) parser
Stars: ✭ 137 (-13.84%)
Laravel ParseA Parse SDK bridge for Laravel 5
Stars: ✭ 116 (-27.04%)
Xinahn Client一个开源,高隐私,自架自用的聚合搜索引擎。https://xinahn.com
Stars: ✭ 116 (-27.04%)
Bash ParserParses bash into an AST
Stars: ✭ 151 (-5.03%)
Genieparsersub-component of Genie that parse the device output into structured datastructure
Stars: ✭ 146 (-8.18%)