All Projects → discoverygarden → basic-solr-config

discoverygarden / basic-solr-config

Licence: other
A starting point for solr schema, config and xslt.

Programming Languages

XSLT
1337 projects
HTML
75241 projects

Labels

Projects that are alternatives of or similar to basic-solr-config

ltr-tools
Set of command line tools for Learning To Rank
Stars: ✭ 13 (-23.53%)
Mutual labels:  solr
IATI.cloud
The open-source IATI datastore for IATI data with RESTful web API providing XML, JSON, CSV output. It extracts and parses IATI XML files referenced in the IATI Registry and powered by Apache Solr.
Stars: ✭ 35 (+105.88%)
Mutual labels:  solr
mdserver-web
Simple Linux Panel
Stars: ✭ 1,064 (+6158.82%)
Mutual labels:  solr
vacomall
☀️☀️ 基于 dubbo 实现的分布式电商平台。
Stars: ✭ 42 (+147.06%)
Mutual labels:  solr
kitodo-presentation
Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
Stars: ✭ 33 (+94.12%)
Mutual labels:  solr
hello-nlp
A natural language search microservice
Stars: ✭ 85 (+400%)
Mutual labels:  solr
multi-select-facet
An example of multi-select facet with Solr, Vue and Go
Stars: ✭ 30 (+76.47%)
Mutual labels:  solr
jesterj
Document Ingestion Framework for Search Systems
Stars: ✭ 26 (+52.94%)
Mutual labels:  solr
jstarcraft-nlp
专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.
Stars: ✭ 92 (+441.18%)
Mutual labels:  solr
turing
✨ 🧬 Turing AI - Semantic Navigation, Chatbot using Search Engine and Many NLP Vendors.
Stars: ✭ 30 (+76.47%)
Mutual labels:  solr
skipchunk
Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr
Stars: ✭ 18 (+5.88%)
Mutual labels:  solr
wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (+11.76%)
Mutual labels:  solr
nlpir-analysis-cn-ictclas
Lucene/Solr Analyzer Plugin. Support MacOS,Linux x86/64,Windows x86/64. It's a maven project, which allows you change the lucene/solr version. //Maven工程,修改Lucene/Solr版本,以兼容相应版本。
Stars: ✭ 71 (+317.65%)
Mutual labels:  solr
solr-zkutil
Solr Cloud and ZooKeeper CLI
Stars: ✭ 14 (-17.65%)
Mutual labels:  solr
BnLMetsExporter
Command Line Interface (CLI) to export METS/ALTO documents to other formats.
Stars: ✭ 11 (-35.29%)
Mutual labels:  solr
searchhub
Fusion demo app searching open-source project data from the Apache Software Foundation
Stars: ✭ 42 (+147.06%)
Mutual labels:  solr
solr-stack
Ambari stack service for easily installing and managing Solr on HDP cluster
Stars: ✭ 18 (+5.88%)
Mutual labels:  solr
solr-ontology-tagger
Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri
Stars: ✭ 36 (+111.76%)
Mutual labels:  solr
ezplatform-search-extra
Netgen's extra bits for eZ Platform search
Stars: ✭ 13 (-23.53%)
Mutual labels:  solr
solrq
Python Solr query utility // http://solrq.readthedocs.org/en/latest/
Stars: ✭ 18 (+5.88%)
Mutual labels:  solr

This repo will holds a basic solr config, schema and xslt to use as a starting point for future projects.

It is now dependent on the discoverygarden GSearch extensions--which includes the Joda time library.

If one wishes to index Drupal content and users, one might process the conf/data-import-config.xml.erb into conf/data-import-config.xml. It takes three parameters:

  • drupal_dbname
  • drupal_db_username
  • drupal_db_password

General Installation

See the wiki page for installation details.

Custom Parameters

In our gsearch fork as of version 2.9.0, we allow for an addition custom_parameters.properties file to be placed beside the foxmlToSolr.xslt file (or whatever the "top-level" XSLT is named, when deployed).

Parameter Default Description
index_ancestors false Boolean flag: true to produce an ancestors_ms field (as used by the islandora_collection_search module); otherwise, false avoid generating. Also, note: When migrating objects between collections, it would be necessary to update all descendents to ensure their list of ancestors reflect the current state... We do this in the islandora_collection_search module when migrating, instead of reindexing all the descendents whenever indexing an object (updating a collection label would be fairly expensive if we blindly reindexed).
index_ancestors_models false Boolean flag: true to produce an ancestors_models_ms field (as used by the islandora_child_filter module; otherwise, false avoid generating. NOTE: A triplestore with Sparql 1.1 is required for the query to work.
maintain_dataset_latest_version_flag false Boolean flag: true to produce a mmv_is_latest_b field, as used by the islandora_research_data module; however, we are not given the opportunity to promote another version should the latest be purged (similarly described in the README.md for islandora_research_data). NOTE: Due to how this is calculated, it requires an RI which supports Sparql 1.1 (such as Blazegraph).
index_compound_sequence true Boolean flag: false to remove RELS_EXT_isSequenceNumberOf<pid>_literal* and RELS_EXT_http://islandora.ca/ontology/relsext#isSequenceNumberOf<pid>_literal_* fields, as used by the islandora_solr_table_of_contents module. NOTE: It is a known fact this field on large repositories will cause significant performance issues with he Luke Request Handler due to the fact it creates multiple Solr fields for each compound relationship.
index_checksums false Boolean flag: true to produce fields used by the islandora_checksum_duplicate_files module; otherwise false to avoid generating. NOTE: Checksums will need to be enabled either on a repository level or via the islandora_checksum module to have any effect.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].