Norconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.

Stars: ✭ 130 (-92.75%)

Mutual labels: search-engine

Search Online

🔍A simple extension for VSCode to search online easily using search engine.

Stars: ✭ 115 (-93.59%)

Mutual labels: search-engine

Swift Selection Search

Swift Selection Search (SSS) is a simple Firefox add-on that lets you quickly search for some text in a page using your favorite search engines.

Stars: ✭ 125 (-93.03%)

Mutual labels: search-engine

Pythondata

repo for code published on pythondata.com

Stars: ✭ 113 (-93.7%)

Mutual labels: big-data

Sonic

🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.

Stars: ✭ 12,347 (+588.62%)

Mutual labels: search-engine

Torrentinim

A very low memory-footprint, self hosted API-only torrent search engine. Sonarr + Radarr Compatible, native support for Linux, Mac and Windows.

Stars: ✭ 123 (-93.14%)

Mutual labels: search-engine

Bayard

A full-text search and indexing server written in Rust.

Stars: ✭ 1,555 (-13.27%)

Mutual labels: search-engine

Bigdataclass

Two-day workshop that covers how to use R to interact databases and Spark

Stars: ✭ 110 (-93.87%)

Mutual labels: big-data

Covoiturage Libre

UNMAINTAINED

Stars: ✭ 109 (-93.92%)

Mutual labels: search-engine

Gaffer

A large-scale entity and relation database supporting aggregation of properties

Stars: ✭ 1,642 (-8.42%)

Mutual labels: big-data

Report

自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456

Stars: ✭ 123 (-93.14%)

Mutual labels: big-data

Tennis Crystal Ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction

Stars: ✭ 107 (-94.03%)

Mutual labels: big-data

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-93.47%)

Mutual labels: big-data

Azuredatalake

Samples and Docs for Azure Data Lake Store and Analytics

Stars: ✭ 128 (-92.86%)

Mutual labels: big-data

Haystack

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (+90.13%)

Mutual labels: search-engine

Open Source Handbook

⭐️ Open source projects for all skill levels

Stars: ✭ 131 (-92.69%)

Mutual labels: big-data

Wikiman

Wikiman is an offline search engine for manual pages, Arch Wiki, Gentoo Wiki and other documentation.

Stars: ✭ 117 (-93.47%)

Mutual labels: search-engine

Feast

Feature Store for Machine Learning

Stars: ✭ 2,576 (+43.67%)

Mutual labels: big-data

Xinahn Client

一个开源，高隐私，自架自用的聚合搜索引擎。https://xinahn.com

Stars: ✭ 116 (-93.53%)

Mutual labels: search-engine

Accelerator

The Accelerator is a tool for fast and reproducible processing of large amounts of data.

Stars: ✭ 137 (-92.36%)

Mutual labels: big-data

Tinysearch

🔍 Tiny, full-text search engine for static websites built with Rust and Wasm

Stars: ✭ 1,705 (-4.91%)

Mutual labels: search-engine

Richdem

High-performance Terrain and Hydrology Analysis

Stars: ✭ 127 (-92.92%)

Mutual labels: big-data

Amazon S3 Find And Forget

Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

Stars: ✭ 115 (-93.59%)

Mutual labels: big-data

Calcite Avatica

Mirror of Apache Calcite - Avatica

Stars: ✭ 130 (-92.75%)

Mutual labels: big-data

Just Dashboard

📊 📋 Dashboards using YAML or JSON files

Stars: ✭ 1,511 (-15.73%)

Mutual labels: big-data

Mobydq

🐳 Tool to automate data quality checks on data pipelines

Stars: ✭ 123 (-93.14%)

Mutual labels: big-data

Ambari

Mirror of Apache Ambari

Stars: ✭ 1,576 (-12.1%)

Mutual labels: big-data

Cosmos Search

🌱 The next generation unbiased real-time privacy and user focused code search engine for everyone; Join us at https://discourse.opengenus.org/

Stars: ✭ 137 (-92.36%)

Mutual labels: search-engine

Genie

Distributed Big Data Orchestration Service

Stars: ✭ 1,544 (-13.89%)

Mutual labels: big-data

Hazelcast Nodejs Client

Hazelcast IMDG Node.js Client

Stars: ✭ 124 (-93.08%)

Mutual labels: big-data

Search Plugins

Search plugins for the search feature

Stars: ✭ 1,860 (+3.74%)

Mutual labels: search-engine

Instantsearch Android

A library of widgets and helpers to build instant-search applications on Android.

Stars: ✭ 129 (-92.81%)

Mutual labels: search-engine

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (-93.92%)

Mutual labels: big-data

Querqy

Query preprocessor for Java-based search engines (Querqy Core and Solr implementation)

Stars: ✭ 122 (-93.2%)

Mutual labels: search-engine

Attic Predictionio Sdk Java

PredictionIO Java SDK

Stars: ✭ 107 (-94.03%)

Mutual labels: big-data

Rated Ranking Evaluator

Search Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures

Stars: ✭ 134 (-92.53%)

Mutual labels: search-engine

Dato.rss

The best RSS Search experience you can find

Stars: ✭ 122 (-93.2%)

Mutual labels: search-engine

Mysql perf analyzer

MySQL performance monitoring and analysis.

Stars: ✭ 1,423 (-20.64%)

Mutual labels: big-data

Text Sherlock

Text (source code) search engine with indexer and a front end web interface to search. Uses Python 3.

Stars: ✭ 103 (-94.26%)

Mutual labels: search-engine

Maha

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

Stars: ✭ 101 (-94.37%)

Mutual labels: big-data

Couchdb Documentation

Apache CouchDB Documentation

Stars: ✭ 128 (-92.86%)

Mutual labels: big-data

Whoogle Search

A self-hosted, ad-free, privacy-respecting metasearch engine

Stars: ✭ 4,645 (+159.06%)

Mutual labels: search-engine

Fsearch

A fast file search utility for Unix-like systems based on GTK+3

Stars: ✭ 1,370 (-23.59%)

Mutual labels: search-engine

Simpleaudioindexer

Searching for the occurrence seconds of words/phrases or arbitrary regex patterns within audio files

Stars: ✭ 100 (-94.42%)

Mutual labels: search-engine

Scala Spark Tutorial

Project for James' Apache Spark with Scala course

Stars: ✭ 121 (-93.25%)

Mutual labels: big-data

Datasets

🎁 3,000,000+ Unsplash images made available for research and machine learning

Stars: ✭ 1,805 (+0.67%)

Mutual labels: search-engine

1-60 of 659 similar projects

›

next*5