flow-indexerFlow-Indexer indexes flows found in chunked log files from bro,nfdump,syslog, or pcap files
Stars: ✭ 43 (+207.14%)
hohserHighlight or Hide Search Engine Results
Stars: ✭ 89 (+535.71%)
iresearchIResearch is a cross-platform, high-performance document oriented search engine library written entirely in C++ with the focus on a pluggability of different ranking/similarity models
Stars: ✭ 121 (+764.29%)
evildorkEvildork targeting your fiancee👁️
Stars: ✭ 46 (+228.57%)
PoetryCorpusПоэтический корпус русского языка
Stars: ✭ 40 (+185.71%)
openverse-apiThe Openverse API allows programmatic access to search for CC-licensed and public domain digital media.
Stars: ✭ 41 (+192.86%)
HorizonA ZeroNet search engine
Stars: ✭ 15 (+7.14%)
jamesFast and extendable modern launcher for Windows
Stars: ✭ 32 (+128.57%)
gosearcha fast, real-time file searching program for linux
Stars: ✭ 68 (+385.71%)
Free-Internet-PluginA free Internet is a better Internet. This Chrome browser plugin removes paywalled content from Google search results.
Stars: ✭ 121 (+764.29%)
odcrawler-frontendA frontend for ODCrawler, an Open Directory search engine.
Stars: ✭ 20 (+42.86%)
OpenDialogAn Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Stars: ✭ 94 (+571.43%)
indexer4jSimple full text indexing and searching library for Java
Stars: ✭ 47 (+235.71%)
imsearchFramework to build your own reverse image search engine
Stars: ✭ 64 (+357.14%)
app-search-flask-appThis is an example of a Python Flask app with Elasticsearch/ Elastic App Search with respective Python Clients
Stars: ✭ 17 (+21.43%)
lafzi-webAntarmuka web untuk Lafzi: mesin pencari lafadz dalam Al-Quran
Stars: ✭ 25 (+78.57%)
thai-languagecomputer tools for thai language
Stars: ✭ 20 (+42.86%)
PeARS-orchardThis is the decentralised version of PeARS, the people's search engine, to be taken as Phase 1 of the fully distributed system.
Stars: ✭ 34 (+142.86%)
cljs-corpusA greppable archive of ClojureScript code
Stars: ✭ 37 (+164.29%)
vim-wwwToolbox to open & search URLs from vim
Stars: ✭ 32 (+128.57%)
google-this🔎 A simple yet powerful module to retrieve organic search results and much more from Google.
Stars: ✭ 88 (+528.57%)
Search Ads Web ServiceOnline search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (+114.29%)
KWDLCKyoto University Web Document Leads Corpus
Stars: ✭ 64 (+357.14%)
elliotforwater.comWebapp which run the https://elliotforwater.com/ website
Stars: ✭ 15 (+7.14%)
hsploitAn advanced command-line search engine for Exploit-DB
Stars: ✭ 16 (+14.29%)
openverse-catalogIdentifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (+92.86%)
starterCreate vertical search web application in minutes with generator (based on ItemsAPI)
Stars: ✭ 21 (+50%)
ISeeNNA CNN feature based image retrieval website
Stars: ✭ 15 (+7.14%)
bible-corpusA multilingual parallel corpus created from translations of the Bible.
Stars: ✭ 115 (+721.43%)
lupynePythonic search engine based on PyLucene.
Stars: ✭ 61 (+335.71%)
Filipino-Text-BenchmarksOpen-source benchmark datasets and pretrained transformer models in the Filipino language.
Stars: ✭ 22 (+57.14%)
code-compassa contextual search engine for software packages built on import2vec embeddings (https://www.code-compass.com)
Stars: ✭ 33 (+135.71%)
bing-ip2hostsbingip2hosts is a Bing.com web scraper that discovers websites by IP address
Stars: ✭ 99 (+607.14%)
SitemapBolt Sitemap extension - create XML sitemaps for your Bolt website.
Stars: ✭ 19 (+35.71%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+300%)
luceneApache Lucene open-source search software
Stars: ✭ 1,009 (+7107.14%)
sonar-tantivySearch engine based on tantivy with a Node.js frontend
Stars: ✭ 30 (+114.29%)
domhttpxdomhttpx is a google search engine dorker with HTTP toolkit built with python, can make it easier for you to find many URLs/IPs at once with fast time.
Stars: ✭ 59 (+321.43%)
collector-filesystemNorconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
Stars: ✭ 17 (+21.43%)
nlp-ltNatural Language Processing for Lithuanian language
Stars: ✭ 17 (+21.43%)
dialogue-datasetscollect the open dialog corpus and some useful data processing utils.
Stars: ✭ 24 (+71.43%)
BDExamenesBase de datos de exámenes de la ETSIIT
Stars: ✭ 23 (+64.29%)
gosearchWeb crawler and Search engine in Golang.
Stars: ✭ 19 (+35.71%)
SpiCE-CorpusAn open-access corpus of conversational bilingual speech in Cantonese and English
Stars: ✭ 33 (+135.71%)
seeSearch Engine in Erlang
Stars: ✭ 27 (+92.86%)
open-discourseOpen Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
Stars: ✭ 47 (+235.71%)
DeepSentiPersRepository for the experiments described in the paper named "DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus"
Stars: ✭ 17 (+21.43%)
JASSv2Experimental search engine in C/C++17 - still in early development.
Stars: ✭ 22 (+57.14%)
bitshiftA semantic search engine for source code
Stars: ✭ 30 (+114.29%)