All Projects → dadoonet → Fscrawler

dadoonet / Fscrawler

Licence: apache-2.0
Elasticsearch File System Crawler (FS Crawler)

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to Fscrawler

Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-69.43%)
Mutual labels:  crawler, elasticsearch
Diskover
File system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
Stars: ✭ 977 (+7.84%)
Mutual labels:  crawler, elasticsearch
News Please
news-please - an integrated web crawler and information extractor for news that just works.
Stars: ✭ 969 (+6.95%)
Mutual labels:  crawler, elasticsearch
Fess
Fess is very powerful and easily deployable Enterprise Search Server.
Stars: ✭ 561 (-38.08%)
Mutual labels:  crawler, elasticsearch
Zhihu Crawler
zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (-1.77%)
Mutual labels:  crawler
Instagram Profilecrawl
📝 quickly crawl the information (e.g. followers, tags etc...) of an instagram profile.
Stars: ✭ 816 (-9.93%)
Mutual labels:  crawler
Demo Scene
👾Scripts and samples to support Confluent Demos and Talks. ⚠️Might be rough around the edges ;-) 👉For automated tutorials and QA'd code, see https://github.com/confluentinc/examples/
Stars: ✭ 806 (-11.04%)
Mutual labels:  elasticsearch
Springbootexamples
Spring Boot 学习教程
Stars: ✭ 794 (-12.36%)
Mutual labels:  elasticsearch
Hugo Elasticsearch
Generate Elasticsearch indexes for Hugo static sites by parsing front matter
Stars: ✭ 19 (-97.9%)
Mutual labels:  elasticsearch
Laravel Docker Elasticsearch
This is a simple repo for practicing elasticsearch with laravel and docker.
Stars: ✭ 18 (-98.01%)
Mutual labels:  elasticsearch
Complete Guide To Elasticsearch
Contains all of the queries used within the Complete Guide to Elasticsearch course.
Stars: ✭ 829 (-8.5%)
Mutual labels:  elasticsearch
Py3 scripts
Life is short, *****.
Stars: ✭ 5 (-99.45%)
Mutual labels:  crawler
Scalable Image Matching
This is a image matching system for scalable and efficient matching of images from a large database. The basic idea is to compute perceptural hash value for each image and compare the similarity based on the pHash computed. Searching are scalable with the elasticsearch as the backend database.
Stars: ✭ 17 (-98.12%)
Mutual labels:  elasticsearch
Datastream.io
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Stars: ✭ 814 (-10.15%)
Mutual labels:  elasticsearch
Elasticsearch Query Builder
Build query for an ElasticSearch client using a fluent interface
Stars: ✭ 18 (-98.01%)
Mutual labels:  elasticsearch
Serverless Appsync Plugin
serverless plugin for appsync
Stars: ✭ 804 (-11.26%)
Mutual labels:  elasticsearch
Psi Report
Crawls a website, gets PageSpeed Insights data for each page, and exports an HTML report.
Stars: ✭ 6 (-99.34%)
Mutual labels:  crawler
Great Big Example Application
A full-stack example app built with JHipster, Spring Boot, Kotlin, Angular 4, ngrx, and Webpack
Stars: ✭ 899 (-0.77%)
Mutual labels:  elasticsearch
Szt Bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
Stars: ✭ 826 (-8.83%)
Mutual labels:  elasticsearch
Python
Python脚本。模拟登录知乎, 爬虫,操作excel,微信公众号,远程开机
Stars: ✭ 7,355 (+711.81%)
Mutual labels:  crawler

File System Crawler for Elasticsearch

Welcome to the FS Crawler for Elasticsearch

This crawler helps to index binary documents such as PDF, Open Office, MS Office.

Main features:

  • Local file system (or a mounted drive) crawling and index new files, update existing ones and removes old ones.
  • Remote file system over SSH crawling.
  • REST interface to let you "upload" your binary documents to elasticsearch.

You need to install a version matching your Elasticsearch version:

Elasticsearch FS Crawler Released Docs
6.x, 7.x 2.7-SNAPSHOT 2.7-SNAPSHOT
2.x, 5.x, 6.x 2.6 2019-01-09 2.6
2.x, 5.x, 6.x 2.5 2018-08-04 2.5
2.x, 5.x, 6.x 2.4 2017-08-11 2.4
2.x, 5.x, 6.x 2.3 2017-07-10 2.3
1.x, 2.x, 5.x 2.2 2017-02-03 2.2
1.x, 2.x, 5.x 2.1 2016-07-26 2.1
es-2.0 2.0.0 2015-10-30 2.0.0

Build and Quality Status

Maven Central Travis Documentation Status Code Quality: Java Total Alerts

Lines Duplicated Lines Maintainability Technical Debt Reliability

Vulnerabilities Bugs Quality Gate Code Smells Coverage

The guide has been moved to ReadTheDocs.

License

Read more about the License.

Thanks

Thanks to JetBrains for the IntelliJ IDEA License!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].