Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+1064.92%)
ResinHardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+46.13%)
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-96.13%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-80.39%)
RelevancyfeedbackDice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-94.75%)
ConceptualsearchTrain a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Stars: ✭ 245 (-32.32%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (-62.98%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-73.2%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+841.71%)
solrApache Solr open-source search software
Stars: ✭ 651 (+79.83%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+35.08%)
Awesome SolrA curated list of Awesome Apache Solr links and resources.
Stars: ✭ 69 (-80.94%)
Downloadsearchsearch for any kinds of files to download
Stars: ✭ 124 (-65.75%)
Instantsearch AndroidA library of widgets and helpers to build instant-search applications on Android.
Stars: ✭ 129 (-64.36%)
MinsqlHigh-performance log search engine.
Stars: ✭ 356 (-1.66%)
Awesome Deep Learning Papers For Search Recommendation AdvertisingAwesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR prediction, CVR prediction), Post Ranking, Transfer, Reinforcement Learning, Self-supervised Learning and so on.
Stars: ✭ 136 (-62.43%)
Sonic🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
Stars: ✭ 12,347 (+3310.77%)
KeyviKeyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 161 (-55.52%)
RusticsearchLightweight Elasticsearch compatible search server.
Stars: ✭ 171 (-52.76%)
ScoutRESTful search server written in Python, powered by SQLite.
Stars: ✭ 213 (-41.16%)
Search Engine ParserLightweight package to query popular search engines and scrape for result titles, links and descriptions
Stars: ✭ 216 (-40.33%)
patzillaPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (-80.39%)
KeyviKeyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 171 (-52.76%)
query-wellformedness25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (-77.9%)
xCommerce Search & Discovery frontend web components
Stars: ✭ 54 (-85.08%)
VespaThe open big data serving engine. https://vespa.ai
Stars: ✭ 3,747 (+935.08%)
Whoogle SearchA self-hosted, ad-free, privacy-respecting metasearch engine
Stars: ✭ 4,645 (+1183.15%)
ElassandraElassandra = Elasticsearch + Apache Cassandra
Stars: ✭ 1,610 (+344.75%)
Ext SolrA TYPO3 extension that integrates the Apache Solr search server with TYPO3 CMS. dkd Internet Service GmbH is developing the extension. Community contributions are welcome. See CONTRIBUTING.md for details.
Stars: ✭ 118 (-67.4%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (+405.25%)
SearchAn Open Source Search Engine
Stars: ✭ 139 (-61.6%)
Query TranslatorQuery Translator is a search query translator with AST representation
Stars: ✭ 165 (-54.42%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-92.54%)
TntsearchA fully featured full text search engine written in PHP
Stars: ✭ 2,693 (+643.92%)
VectoraiVector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.
Stars: ✭ 195 (-46.13%)
TrinityTrinity IR Infrastructure
Stars: ✭ 227 (-37.29%)
Lolcate RsLolcate -- A comically fast way of indexing and querying your filesystem. Replaces locate / mlocate / updatedb. Written in Rust.
Stars: ✭ 191 (-47.24%)
shiftingA privacy-focused list of alternatives to mainstream services to help the competition.
Stars: ✭ 31 (-91.44%)
Search Ads Web ServiceOnline search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (-91.71%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-96.41%)
nebulaA distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+2164.09%)
HypertagKnowledge Management for Humans using Machine Learning & Tags
Stars: ✭ 116 (-67.96%)
awesome-AI-kubernetes❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (-73.76%)
seeSearch Engine in Erlang
Stars: ✭ 27 (-92.54%)
spark-acidACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-74.86%)
evildorkEvildork targeting your fiancee👁️
Stars: ✭ 46 (-87.29%)
SolrConfigExamplesExamples of Solr configuration entries for Solr plugins and Conceptual Search\Semantic Search from Simon Hughes Dice.com
Stars: ✭ 26 (-92.82%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-69.34%)
luceneApache Lucene open-source search software
Stars: ✭ 1,009 (+178.73%)
RelevancyTuningDice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon Hughes Dice.com
Stars: ✭ 28 (-92.27%)
indieweb-searchSource code for the IndieWeb search engine.
Stars: ✭ 16 (-95.58%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (-29.01%)
Go CyberYour 🔵 Superintelligence
Stars: ✭ 270 (-25.41%)
RedisearchA query and indexing engine for Redis, providing secondary indexing, full-text search, and aggregations.
Stars: ✭ 3,393 (+837.29%)
BitfunnelA signature-based search engine
Stars: ✭ 313 (-13.54%)
Searchcode ServerThe offical home of searchcode-server where you can run searchcode locally. Note that master is generally unstable in the sense that it is not a release. Check releases for release versions https://github.com/boyter/searchcode-server/releases
Stars: ✭ 262 (-27.62%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+1165.47%)