Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → spekulatius → Phpscraper

spekulatius / Phpscraper

Licence: gpl-3.0

PHP Scraper - an highly opinionated web-interface for PHP

Labels

scraper scraping web-scraping web-scraper

Projects that are alternatives of or similar to Phpscraper

Scrape Linkedin Selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

Stars: ✭ 239 (+61.49%)

Mutual labels: scraper, scraping, web-scraping, web-scraper

Linkedin-Client

Web scraper for grabing data from Linkedin profiles or company pages (personal project)

Stars: ✭ 42 (-71.62%)

Mutual labels: scraper, web-scraper, web-scraping

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (+213.51%)

Mutual labels: scraping, web-scraping, web-scraper

PHP Library for detecting CMS

Stars: ✭ 78 (-47.3%)

Mutual labels: scraping, web-scraping, web-scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-89.86%)

Mutual labels: scraper, web-scraper, web-scraping

top-github-scraper

Scape top GitHub repositories and users based on keywords

Stars: ✭ 40 (-72.97%)

Mutual labels: scraping, web-scraper, web-scraping

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-89.86%)

Mutual labels: scraper, scraping, web-scraping

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+2654.73%)

Mutual labels: scraper, scraping, web-scraping

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+343.24%)

Mutual labels: scraper, web-scraping, web-scraper

Zillow Scraper for Python using Selenium

Stars: ✭ 141 (-4.73%)

Mutual labels: scraper, web-scraping

Go cascadia package command line CSS selector

Stars: ✭ 67 (-54.73%)

Mutual labels: web-scraping, web-scraper

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-54.05%)

Mutual labels: web-scraping, web-scraper

Social Media Profile Scrapers

Fetch user's data across social media

Stars: ✭ 60 (-59.46%)

Mutual labels: web-scraping, web-scraper

Scrapy Craigslist

Web Scraping Craigslist's Engineering Jobs in NY with Scrapy

Stars: ✭ 54 (-63.51%)

Mutual labels: web-scraping, web-scraper

MetaData html scraper and parser for Node.js (supports Promises and callback style)

Stars: ✭ 129 (-12.84%)

Mutual labels: web-scraping, web-scraper

A Router WiFi key recovery/cracking tool with a twist.

Stars: ✭ 52 (-64.86%)

Mutual labels: web-scraping, web-scraper

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+591.89%)

Mutual labels: scraper, scraping

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-45.27%)

Mutual labels: scraper, scraping

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+741.89%)

Mutual labels: scraper, scraping

Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)

Stars: ✭ 144 (-2.7%)

Mutual labels: scraping, web-scraping

View All Similar Projects ➔

PHP Scraper

An opinionated & limited way to scrape the web using PHP. The main goal is to get stuff done instead of getting distracted with xPath selectors, preparing data structures, etc. Instead, you can just "go to a website" and get an array with all details relevant to your scraping project.

Under the hood, it uses Goutte and a few other packages. See composer.json.

Sponsors

This project is sponsored by:

Want to sponsor this project? Contact me.

Examples

Here are a few impressions on the way the library works. More examples are on the project website.

Get the Title of a Website

All scraping functionality can be accessed either as a function call or a property call. On the example of title scraping this would like like this:

$web = new \spekulatius\phpscraper();

$web->go('https://google.com');

// Returns "Google"
echo $web->title;

// Also returns "Google"
echo $web->title();

Scrape the Images from a Website

Scraping the images including the attributes of the img-tags:

$web = new \spekulatius\phpscraper();

/**
 * Navigate to the test page.
 *
 * This page contains twice the image "cat.jpg".
 * Once with a relative path and once with an absolute path.
 */
$web->go('https://test-pages.phpscraper.de/meta/lorem-ipsum.html');

var_dump($web->imagesWithDetails);
/**
 * Contains:
 *
 * [
 *     'url' => 'https://test-pages.phpscraper.de/assets/cat.jpg',
 *     'alt' => 'absolute path',
 *     'width' => null,
 *     'height' => null,
 * ],
 * [
 *     'url' => 'https://test-pages.phpscraper.de/assets/cat.jpg',
 *     'alt' => 'relative path',
 *     'width' => null,
 *     'height' => null,
 * ]
 */

See the full documentation for more information and examples.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 148

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (10) 🔗