All Projects → jadbin → serlist

jadbin / serlist

Licence: Apache-2.0 license
Search engine results page scraper

Programming Languages

python
139335 projects - #7 most used programming language
Makefile
30231 projects

Projects that are alternatives of or similar to serlist

Pyquery
A jquery-like library for python
Stars: ✭ 2,058 (+17050%)
Mutual labels:  lxml
Requests Html
Pythonic HTML Parsing for Humans™
Stars: ✭ 12,268 (+102133.33%)
Mutual labels:  lxml
LBDuoDian
No description or website provided.
Stars: ✭ 21 (+75%)
Mutual labels:  lxml
codechef-rank-comparator
Web application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
Stars: ✭ 23 (+91.67%)
Mutual labels:  lxml
terrain generator
A wizard that generates terrains for Gazebo using height maps.
Stars: ✭ 46 (+283.33%)
Mutual labels:  lxml
github-trending
GitHub trending API powered by Python Tornado.
Stars: ✭ 36 (+200%)
Mutual labels:  lxml
crawler
requests+lxml爬虫,简单爬虫架构
Stars: ✭ 72 (+500%)
Mutual labels:  lxml
dnevnikru
dnevnik.ru parser
Stars: ✭ 20 (+66.67%)
Mutual labels:  lxml
WorkAggregation
招聘岗位信息聚合系统,拥有爬虫爬取、数据分析、可视化、互动等功能
Stars: ✭ 258 (+2050%)
Mutual labels:  lxml
python3-mal
Python interface to MyAnimeList
Stars: ✭ 18 (+50%)
Mutual labels:  lxml
Instagram Stalker Scraper
(UNMAINTAINED) Fetch data of any public Instagram profile, without using api
Stars: ✭ 39 (+225%)
Mutual labels:  lxml
chopper
Chopper is a tool to extract elements from HTML by preserving ancestors and CSS rules
Stars: ✭ 22 (+83.33%)
Mutual labels:  lxml
yellowpages-scraper
Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Stars: ✭ 56 (+366.67%)
Mutual labels:  lxml
iHealth crawler
iHealth 项目的内容爬虫(一个基于 python 和 MongoDB 的医疗咨询爬虫)
Stars: ✭ 24 (+100%)
Mutual labels:  lxml

SERList

https://travis-ci.org/jadbin/serlist.svg?branch=master https://coveralls.io/repos/github/jadbin/serlist/badge.svg?branch=master

Overview

SERList is used to scrape the information from a search engine results page including:

  • title
  • link
  • description

Now, SERList can well deal with the results from these search engines without setting anything (e.g. XPath):

Installation

Install using pip:

pip install serlist

Basic Usage

from serlist import SerpScraper

SerpScraper().scrape(text)

The variable text is the HTML text of a search engine results page.

Documentation

https://serlist.readthedocs.io/

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].