All Projects → mediacloud → ultimate-sitemap-parser

mediacloud / ultimate-sitemap-parser

Licence: other
Ultimate Website Sitemap Parser

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to ultimate-sitemap-parser

X.Web.Sitemap
Simple sitemap generator for .NET
Stars: ✭ 66 (-44.07%)
Mutual labels:  sitemap, sitemap-xml
sitemapper
parses sitemaps for Node.JS
Stars: ✭ 70 (-40.68%)
Mutual labels:  sitemap, sitemap-xml
jsitemapgenerator
Java sitemap generator. This library generates a web sitemap, can ping Google, generate RSS feed, robots.txt and more with friendly, easy to use Java 8 functional style of programming
Stars: ✭ 38 (-67.8%)
Mutual labels:  sitemap, robots-txt
express-sitemap-xml
Serve sitemap.xml from a list of URLs in Express
Stars: ✭ 56 (-52.54%)
Mutual labels:  sitemap, sitemap-xml
grav-plugin-sitemap
Grav Sitemap Plugin
Stars: ✭ 34 (-71.19%)
Mutual labels:  sitemap, sitemap-xml
Sitemap
Bolt Sitemap extension - create XML sitemaps for your Bolt website.
Stars: ✭ 19 (-83.9%)
Mutual labels:  sitemap, xml-sitemap
SitemapParser
XML Sitemap parser class compliant with the Sitemaps.org protocol.
Stars: ✭ 57 (-51.69%)
Mutual labels:  sitemap, xml-sitemap-parser
SitemapTools
A sitemap (sitemap.xml) querying and parsing library for .NET
Stars: ✭ 19 (-83.9%)
Mutual labels:  sitemap, sitemap-xml
Go Sitemap Generator
go-sitemap-generator is the easiest way to generate Sitemaps in Go
Stars: ✭ 152 (+28.81%)
Mutual labels:  sitemap
Bard
Developer friendly Bard that writes sitemap poetry in xml.
Stars: ✭ 47 (-60.17%)
Mutual labels:  sitemap
Sitemap
PHP XML Sitemap Generation
Stars: ✭ 128 (+8.47%)
Mutual labels:  sitemap
Sitemap Generator Crawler
Script that generates a sitemap by crawling a given URL
Stars: ✭ 169 (+43.22%)
Mutual labels:  sitemap
django-qartez
The missing XML sitemaps for Django: images sitemap, static sitemaps, alternate hreflang sitemaps.
Stars: ✭ 13 (-88.98%)
Mutual labels:  xml-sitemap
Craft Seomatic
SEOmatic facilitates modern SEO best practices & implementation for Craft CMS 3. It is a turnkey SEO system that is comprehensive, powerful, and flexible.
Stars: ✭ 135 (+14.41%)
Mutual labels:  sitemap
Vue Router Sitemap
Generate sitemap.xml by vue-router configuration
Stars: ✭ 102 (-13.56%)
Mutual labels:  sitemap
Craft Sitemap
Craft plugin to generate a sitemap.
Stars: ✭ 105 (-11.02%)
Mutual labels:  sitemap
robotify-netcore
Provides robots.txt middleware for .NET core
Stars: ✭ 15 (-87.29%)
Mutual labels:  robots-txt
classicpress-seo
Classic SEO is the first SEO plugin built specifically to work with ClassicPress. A fork of Rank Math, the plugin contains many essential SEO tools to help optimize your website.
Stars: ✭ 18 (-84.75%)
Mutual labels:  sitemap
Python Sitemap
Mini website crawler to make sitemap from a website.
Stars: ✭ 246 (+108.47%)
Mutual labels:  sitemap
Sitemap
Google sitemap builder for Laravel
Stars: ✭ 243 (+105.93%)
Mutual labels:  sitemap
Build Status Documentation Status Coverage Status PyPI package Download stats

Website sitemap parser for Python 3.5+.

Features

Installation

pip install ultimate-sitemap-parser

Usage

from usp.tree import sitemap_tree_for_homepage

tree = sitemap_tree_for_homepage('https://www.nytimes.com/')
print(tree)

sitemap_tree_for_homepage() will return a tree of AbstractSitemap subclass objects that represent the sitemap hierarchy found on the website; see a reference of AbstractSitemap subclasses.

If you'd like to just list all the pages found in all of the sitemaps within the website, consider using all_pages() method:

# all_pages() returns an Iterator
for page in tree.all_pages():
    print(page)

all_pages() method will return an iterator yielding SitemapPage objects; see a reference of SitemapPage.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].