All Projects → hedii → Php Crawler

hedii / Php Crawler

Licence: mit
A php crawler that finds emails on the internets

Projects that are alternatives of or similar to Php Crawler

Polite
Be nice on the web
Stars: ✭ 253 (+112.61%)
Mutual labels:  crawler, webscraping
Youtube Projects
This repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+21.01%)
Mutual labels:  crawler, webscraping
Avbook
AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+6734.45%)
Mutual labels:  crawler, laravel
Laravel Crawler Detect
A Laravel wrapper for CrawlerDetect - the web crawler detection library
Stars: ✭ 227 (+90.76%)
Mutual labels:  crawler, laravel
Rcrawler
An R web crawler and scraper
Stars: ✭ 274 (+130.25%)
Mutual labels:  crawler, webscraping
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+3326.05%)
Mutual labels:  crawler, webscraping
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-15.97%)
Mutual labels:  crawler, webscraping
Laravel Province City Area
全国「省市区县乡镇街道」数据,来源于【京东】,【内置爬虫】,可自行获取最新数据)
Stars: ✭ 117 (-1.68%)
Mutual labels:  laravel
Sentinel Crawler
Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (-0.84%)
Mutual labels:  crawler
Examples Of Web Crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Stars: ✭ 10,724 (+8911.76%)
Mutual labels:  crawler
Laravel Parse
A Parse SDK bridge for Laravel 5
Stars: ✭ 116 (-2.52%)
Mutual labels:  laravel
Laravel Eventsauce
Use EventSauce in Laravel apps
Stars: ✭ 117 (-1.68%)
Mutual labels:  laravel
Coreblog
一款优雅的博客系统
Stars: ✭ 119 (+0%)
Mutual labels:  laravel
Laravel Hateoas
Expose the authorization logic of your REST API using HATEOAS links
Stars: ✭ 116 (-2.52%)
Mutual labels:  laravel
Sansdaemon
Batch process Laravel Queue without a daemon; Processes queue jobs and kills the process
Stars: ✭ 119 (+0%)
Mutual labels:  laravel
Jwt Auth
🔐 JSON Web Token Authentication for Laravel & Lumen
Stars: ✭ 10,305 (+8559.66%)
Mutual labels:  laravel
Laravel Template
RESTful 模板,企业号/企业微信微信扫码登录/同步通讯录——Laravel vue2 iView
Stars: ✭ 119 (+0%)
Mutual labels:  laravel
Nova Translatable
Making Nova fields translatable
Stars: ✭ 119 (+0%)
Mutual labels:  laravel
Urpm
urpm 是一套基于Laravel封装的后台用户管理权限系统,能够让开发者不用再关心权限问题,实现后台功能的快速开发。
Stars: ✭ 118 (-0.84%)
Mutual labels:  laravel
Docs
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-0.84%)
Mutual labels:  crawler

php-crawler

Build Status

A crawler application with a php backend using Laravel, and a js frontend using vuejs, that finds email addresses on the internets.

Given an entry point url, the crawler will search for emails in all the urls available from this entry point domain name. The emails are downloadable as a text file.

Multiple users can start searching for emails without being able see the other users' searches (searches are related to a single user).

Screenshots

Screenshot1

Screenshot2

Screenshot3

Screenshot4

Server requirements

  • PHP >= 7.2.0
  • OpenSSL PHP Extension
  • PDO PHP Extension
  • Mbstring PHP Extension
  • Tokenizer PHP Extension
  • XML PHP Extension

Installation

  • Create a mysql database (default name: crawler)
  • Install the project with composer:
composer create-project hedii/php-crawler crawler
cd crawler
  • Open the .env file, check the database credentials, and modify it if needed:
DB_CONNECTION=mysql
DB_HOST=127.0.0.1
DB_PORT=3306
DB_DATABASE=crawler
DB_USERNAME=root
DB_PASSWORD=your_password_here
  • In the .env file, set the application url:
APP_URL=http://crawler.localhost
  • Build the crawler application
php artisan crawler:build
  • Point your web server's document / web root to be the public directory: /some/path/crawler/public. The index.php in this directory serves as the front controller for all HTTP requests entering your application. See Laravel documentation. I highly recommend using Laravel Valet if you are using a Mac. Otherwise, check Laravel Homestead.
  • Done

Usage

  • Navigate to your php-crawler website
  • Register a new account
  • Create a new search
  • Create more searches
  • Download the found emails

Testing

composer test

Contributing

All contributions are welcome :)

Please write some tests if you are adding or modifying features.

License

php-crawler is open-sourced software licensed under the MIT license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].