VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (+273.68%)
ConceptualsearchTrain a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Stars: ✭ 245 (+1189.47%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+1805.26%)
Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+22094.74%)
solrApache Solr open-source search software
Stars: ✭ 651 (+3326.32%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+2473.68%)
Sf1r LiteSearch Formula-1——A distributed high performance massive data engine for enterprise/vertical search
Stars: ✭ 158 (+731.58%)
Awesome SolrA curated list of Awesome Apache Solr links and resources.
Stars: ✭ 69 (+263.16%)
Tis Solran enterprise search engine base on Apache Solr
Stars: ✭ 158 (+731.58%)
xCommerce Search & Discovery frontend web components
Stars: ✭ 54 (+184.21%)
QuerqyQuery preprocessor for Java-based search engines (Querqy Core and Solr implementation)
Stars: ✭ 122 (+542.11%)
AquiladbDrop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Stars: ✭ 222 (+1068.42%)
patzillaPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (+273.68%)
seeSearch Engine in Erlang
Stars: ✭ 27 (+42.11%)
Ik Analyzer支持Lucene5/6/7/8+版本, 长期维护。
Stars: ✭ 112 (+489.47%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+17842.11%)
SolrpluginsDice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (+352.63%)
SolrConfigExamplesExamples of Solr configuration entries for Solr plugins and Conceptual Search\Semantic Search from Simon Hughes Dice.com
Stars: ✭ 26 (+36.84%)
ResinHardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+2684.21%)
SrchxA standalone lightweight full-text search engine built on top of blevesearch and Go with multiple storage (scorch, boltdb, leveldb, badger)
Stars: ✭ 118 (+521.05%)
Bm25A Python implementation of the BM25 ranking function.
Stars: ✭ 159 (+736.84%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (+605.26%)
luceneApache Lucene open-source search software
Stars: ✭ 1,009 (+5210.53%)
evildorkEvildork targeting your fiancee👁️
Stars: ✭ 46 (+142.11%)
query-wellformedness25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (+321.05%)
RelevancyTuningDice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon Hughes Dice.com
Stars: ✭ 28 (+47.37%)
Search EngineA math-aware search engine.
Stars: ✭ 278 (+1363.16%)
Taoshop开源电子商务项目,SpringBoot+Dubbo技术栈实现微服务,实现一款分布式集群的电商系统. 项目releases链接:https://github.com/u014427391/taoshop/releases (开发中...)
Stars: ✭ 491 (+2484.21%)
PysolrPysolr — Python Solr client
Stars: ✭ 582 (+2963.16%)
Awesome Privacy💡Limiting personal data leaks on the internet
Stars: ✭ 488 (+2468.42%)
Search copSearch engine like fulltext query support for ActiveRecord
Stars: ✭ 660 (+3373.68%)
AnseriniA Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+2915.79%)
Pdf编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
Stars: ✭ 12,009 (+63105.26%)
FlycmsFlyCms 是一个类似知乎以问答为基础的完全开源的JAVA语言开发的社交网络建站程序,基于 Spring Boot+Bootstrap3+MyBatis+MySql+Solr +Ehcache应用架构,专注于社区内容的整理、归类和检索,它集合了问答,digg,wiki 等多个程序的优点,帮助用户轻松搭建专业的知识库和在线问答社区。业务模块包括:权限管理,会员管理,角色管理,定时任务管理(调度管理),问答管理,文章管理,分享管理,短信接口管理和邮件系统发送(注册、找回密码、邮件订阅),跨域登录,消息推送,全文检索、前端国际化等等众多模块,等您自己来体验!
Stars: ✭ 472 (+2384.21%)
FilemastaA search application to explore, discover and share online files
Stars: ✭ 571 (+2905.26%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+2321.05%)
Telegram Scrapertelegram group scraper tool. fetch all information about group members
Stars: ✭ 450 (+2268.42%)
BertsearchElasticsearch with BERT for advanced document search.
Stars: ✭ 684 (+3500%)
ElasticsuiteSmile ElasticSuite - Magento 2 merchandising and search engine built on ElasticSearch
Stars: ✭ 647 (+3305.26%)
Algoliasearch Client Php⚡️ A fully-featured and blazing-fast PHP API client to interact with Algolia.
Stars: ✭ 565 (+2873.68%)
PickyPicky is an easy to use and fast Ruby semantic search engine that helps your users find what they are looking for.
Stars: ✭ 441 (+2221.05%)
FessFess is very powerful and easily deployable Enterprise Search Server.
Stars: ✭ 561 (+2852.63%)
SisSimple image search engine
Stars: ✭ 438 (+2205.26%)
Sequence Semantic EmbeddingTools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
Stars: ✭ 435 (+2189.47%)
QuarkStay happy while offline | World's first offline search engine.
Stars: ✭ 561 (+2852.63%)
RsolrA Ruby client for Apache Solr
Stars: ✭ 416 (+2089.47%)
Spark SolrTools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
Stars: ✭ 411 (+2063.16%)
NboostNBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)
Stars: ✭ 549 (+2789.47%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+2036.84%)
OpensearchserverOpen-source Enterprise Grade Search Engine Software
Stars: ✭ 408 (+2047.37%)
FunpyspidersearchengineWord2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
Stars: ✭ 782 (+4015.79%)
RiotGo Open Source, Distributed, Simple and efficient Search Engine; Warning: This is V1 and beta version, because of big memory consume, and the V2 will be rewrite all code.
Stars: ✭ 6,025 (+31610.53%)
MetaA Modern C++ Data Sciences Toolkit
Stars: ✭ 600 (+3057.89%)
Python Seo AnalyzerAn SEO tool that analyzes the structure of a site, crawls the site, count words in the body of the site and warns of any technical SEO issues.
Stars: ✭ 529 (+2684.21%)
JanusgraphJanusGraph: an open-source, distributed graph database
Stars: ✭ 4,277 (+22410.53%)
Ip TracerTrack any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
Stars: ✭ 399 (+2000%)
ParaOpen source back-end server for web, mobile and IoT. The backend for busy developers. (self-hosted or hosted)
Stars: ✭ 389 (+1947.37%)
Open Semantic SearchOpen Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Stars: ✭ 386 (+1931.58%)