shiftingA privacy-focused list of alternatives to mainstream services to help the competition.
Stars: ✭ 31 (-98.27%)
VespaThe open big data serving engine. https://vespa.ai
Stars: ✭ 3,747 (+108.98%)
pyparEfficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Stars: ✭ 66 (-96.32%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (-79.81%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-92.86%)
SrchxA standalone lightweight full-text search engine built on top of blevesearch and Go with multiple storage (scorch, boltdb, leveldb, badger)
Stars: ✭ 118 (-93.42%)
HypertagKnowledge Management for Humans using Machine Learning & Tags
Stars: ✭ 116 (-93.53%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (-93.64%)
HamaMirror of Apache Hama
Stars: ✭ 129 (-92.81%)
Downloadsearchsearch for any kinds of files to download
Stars: ✭ 124 (-93.08%)
Ik Analyzer支持Lucene5/6/7/8+版本, 长期维护。
Stars: ✭ 112 (-93.75%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (-9.7%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+488.06%)
Collector HttpNorconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.
Stars: ✭ 130 (-92.75%)
Search Online🔍A simple extension for VSCode to search online easily using search engine.
Stars: ✭ 115 (-93.59%)
Swift Selection SearchSwift Selection Search (SSS) is a simple Firefox add-on that lets you quickly search for some text in a page using your favorite search engines.
Stars: ✭ 125 (-93.03%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-93.7%)
Sonic🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
Stars: ✭ 12,347 (+588.62%)
TorrentinimA very low memory-footprint, self hosted API-only torrent search engine. Sonarr + Radarr Compatible, native support for Linux, Mac and Windows.
Stars: ✭ 123 (-93.14%)
BayardA full-text search and indexing server written in Rust.
Stars: ✭ 1,555 (-13.27%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-93.87%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (-8.42%)
Report自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-93.14%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-94.03%)
Hdfs ShellHDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-93.47%)
AzuredatalakeSamples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-92.86%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+90.13%)
WikimanWikiman is an offline search engine for manual pages, Arch Wiki, Gentoo Wiki and other documentation.
Stars: ✭ 117 (-93.47%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+43.67%)
Xinahn Client一个开源,高隐私,自架自用的聚合搜索引擎。https://xinahn.com
Stars: ✭ 116 (-93.53%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-92.36%)
Tinysearch🔍 Tiny, full-text search engine for static websites built with Rust and Wasm
Stars: ✭ 1,705 (-4.91%)
RichdemHigh-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-92.92%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-93.59%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-15.73%)
Mobydq🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-93.14%)
AmbariMirror of Apache Ambari
Stars: ✭ 1,576 (-12.1%)
Cosmos Search🌱 The next generation unbiased real-time privacy and user focused code search engine for everyone; Join us at https://discourse.opengenus.org/
Stars: ✭ 137 (-92.36%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (-13.89%)
Search PluginsSearch plugins for the search feature
Stars: ✭ 1,860 (+3.74%)
Instantsearch AndroidA library of widgets and helpers to build instant-search applications on Android.
Stars: ✭ 129 (-92.81%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-93.92%)
QuerqyQuery preprocessor for Java-based search engines (Querqy Core and Solr implementation)
Stars: ✭ 122 (-93.2%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (-92.53%)
Dato.rssThe best RSS Search experience you can find
Stars: ✭ 122 (-93.2%)
Text SherlockText (source code) search engine with indexer and a front end web interface to search. Uses Python 3.
Stars: ✭ 103 (-94.26%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-94.37%)
Whoogle SearchA self-hosted, ad-free, privacy-respecting metasearch engine
Stars: ✭ 4,645 (+159.06%)
FsearchA fast file search utility for Unix-like systems based on GTK+3
Stars: ✭ 1,370 (-23.59%)
SimpleaudioindexerSearching for the occurrence seconds of words/phrases or arbitrary regex patterns within audio files
Stars: ✭ 100 (-94.42%)
Datasets🎁 3,000,000+ Unsplash images made available for research and machine learning
Stars: ✭ 1,805 (+0.67%)