JivesearchA search engine that doesn't track you.
Stars: ✭ 364 (-80.1%)
MaryamMaryam: Open-source Intelligence(OSINT) Framework
Stars: ✭ 371 (-79.72%)
Lieucommunity search engine
Stars: ✭ 76 (-95.84%)
RemarksExtract highlights, scribbles, and annotations from PDFs marked with the reMarkable tablet. Export to Markdown, PDF, PNG, and SVG
Stars: ✭ 94 (-94.86%)
Darksearch🔍 Search engine for hidden material. Scraping dark web onions, irc logs, deep web etc...
Stars: ✭ 260 (-85.78%)
OcrmypdfOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Stars: ✭ 5,549 (+203.39%)
FessFess is very powerful and easily deployable Enterprise Search Server.
Stars: ✭ 561 (-69.33%)
MyboxEasy tools of document, image, file, network, location, color, and media.
Stars: ✭ 45 (-97.54%)
MagneticoAutonomous (self-hosted) BitTorrent DHT search engine suite.
Stars: ✭ 2,626 (+43.58%)
Open Semantic EtlPython based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (-90.98%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+7.65%)
Open PaperlessScan, index, and archive all of your paper documents (acquired by Mayan EDMS)
Stars: ✭ 2,538 (+38.76%)
Lambda Text ExtractorAWS Lambda functions to extract text from various binary formats.
Stars: ✭ 159 (-91.31%)
Lolcate RsLolcate -- A comically fast way of indexing and querying your filesystem. Replaces locate / mlocate / updatedb. Written in Rust.
Stars: ✭ 191 (-89.56%)
RusticsearchLightweight Elasticsearch compatible search server.
Stars: ✭ 171 (-90.65%)
TntsearchA fully featured full text search engine written in PHP
Stars: ✭ 2,693 (+47.24%)
BlastBlast is a full text search and indexing server, written in Go, built on top of Bleve.
Stars: ✭ 934 (-48.93%)
Whoogle SearchA self-hosted, ad-free, privacy-respecting metasearch engine
Stars: ✭ 4,645 (+153.96%)
MagnetissimoWeb application that indexes all popular torrent sites, and saves it to the local database.
Stars: ✭ 2,551 (+39.48%)
PdfocrAdds text to PDF files using the cuneiform OCR software
Stars: ✭ 287 (-84.31%)
Search Ui🔍 A set of UI components to build a fully customized search!
Stars: ✭ 24 (-98.69%)
SearxPrivacy-respecting metasearch engine
Stars: ✭ 10,074 (+450.79%)
SimpleaudioindexerSearching for the occurrence seconds of words/phrases or arbitrary regex patterns within audio files
Stars: ✭ 100 (-94.53%)
Search Online🔍A simple extension for VSCode to search online easily using search engine.
Stars: ✭ 115 (-93.71%)
RapipdfPDF generation from OpenAPI / Swagger Spec
Stars: ✭ 132 (-92.78%)
FfindA sane replacement for find
Stars: ✭ 124 (-93.22%)
PinryThe open-source core of Pinry, a tiling image board system for people who want to save, tag, and share images, videos and webpages in an easy to skim through format.
Stars: ✭ 1,819 (-0.55%)
Transformer strPyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer,Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.
Stars: ✭ 131 (-92.84%)
LucenenetApache Lucene.NET
Stars: ✭ 1,704 (-6.83%)
Search Engine Optimization🔍 A helpful checklist/collection of Search Engine Optimization (SEO) tips and techniques.
Stars: ✭ 1,798 (-1.69%)
Ptext ReleasepText is a library for reading, creating and manipulating PDF files in python.
Stars: ✭ 124 (-93.22%)
Alfred OcrOCR & Translate using multiple interfaces for Alfred Workflow
Stars: ✭ 136 (-92.56%)
Pdfview AndroidSmall Android library to show PDF files
Stars: ✭ 132 (-92.78%)
The Economist Ebooks经济学人(含音频)、纽约客、自然、新科学人、卫报、科学美国人、连线、大西洋月刊、新闻周刊、国家地理等英语杂志免费下载、订阅(kindle推送),支持epub、mobi、pdf格式, 每周更新. The Economist 、The New Yorker 、Nature、The Atlantic 、New Scientist、The Guardian、Scientific American、Wired、Newsweek magazines, free download and subscription for kindle, mobi、epub、pdf format.
Stars: ✭ 3,471 (+89.78%)
QuerqyQuery preprocessor for Java-based search engines (Querqy Core and Solr implementation)
Stars: ✭ 122 (-93.33%)
Documents收集的程序开发相关的书籍与文档,多数为 PDF 格式文件,欢迎 fork 和 star。
Stars: ✭ 130 (-92.89%)
Pdf2imageA utility for converting pdf to image and base64 format.
Stars: ✭ 122 (-93.33%)
Endesiveen-crypt, de-crypt, si-gn, ve-rify - smime, pdf, xades and plain files in pure python
Stars: ✭ 122 (-93.33%)
Vue Innersearch🔎 UI components built with Vue.js for ElasticSearch
Stars: ✭ 135 (-92.62%)
Algoliasearch Magento 2Algolia Search integration for Magento 2 - compatible with versions from 2.3.x to 2.4.x
Stars: ✭ 131 (-92.84%)
TypefontThe first open-source library that detects the font of a text in a image.
Stars: ✭ 1,575 (-13.89%)
PdfboxingNice wrapper of PDFBox in Clojure
Stars: ✭ 122 (-93.33%)
SpacextractExtraction and analysis of telemetry from rocket launch webcasts (from SpaceX and RocketLab)
Stars: ✭ 131 (-92.84%)
RobinRObust document image BINarization
Stars: ✭ 131 (-92.84%)
Collector HttpNorconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.
Stars: ✭ 130 (-92.89%)
Trienet.NET Implementations of Trie Data Structures for Substring Search, Auto-completion and Intelli-sense. Includes: patricia trie, suffix trie and a trie implementation using Ukkonen's algorithm.
Stars: ✭ 122 (-93.33%)
EasyadapterRecyclerview adapter library- Create adapter in just 3 lines of code
Stars: ✭ 122 (-93.33%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+631.49%)