All Projects → flother → htmltab

flother / htmltab

Licence: MIT license
Command-line utility to convert HTML tables into CSV files

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to htmltab

ioweb
Web Scraping Framework
Stars: ✭ 31 (+138.46%)
Mutual labels:  scraping
crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (+69.23%)
Mutual labels:  scraping
scrapman
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (+61.54%)
Mutual labels:  scraping
oversmash
Overwatch API library for player details and career stats
Stars: ✭ 42 (+223.08%)
Mutual labels:  scraping
PrawWallpaperDownloader
Download images from reddit
Stars: ✭ 18 (+38.46%)
Mutual labels:  scraping
LInkedIn-Reverese-Lookup
🔎Search LinkedIn profile by email address📧
Stars: ✭ 20 (+53.85%)
Mutual labels:  scraping
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (+30.77%)
Mutual labels:  scraping
browser-automation-api
Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (+84.62%)
Mutual labels:  scraping
reason-rust-scraper
🦀 Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (+61.54%)
Mutual labels:  scraping
html-table-extractor
extract data from html table
Stars: ✭ 74 (+469.23%)
Mutual labels:  scraping
node-red-contrib-nbrowser
Provides a virtual web browser (a.k.a. "headless browser") appearing as a node.
Stars: ✭ 31 (+138.46%)
Mutual labels:  scraping
docker-selenium-lambda
The simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+1223.08%)
Mutual labels:  scraping
yttrex
youtube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling
Stars: ✭ 31 (+138.46%)
Mutual labels:  scraping
asyncio-hn
Python (asyncio) wrapper for hackernews api
Stars: ✭ 27 (+107.69%)
Mutual labels:  scraping
ksoup
Kotlin Wrapper for Jsoup
Stars: ✭ 59 (+353.85%)
Mutual labels:  scraping
shorter.recipes
A website dedicated to making recipes from any website easy to read.
Stars: ✭ 27 (+107.69%)
Mutual labels:  scraping
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+307.69%)
Mutual labels:  scraping
ha-multiscrape
Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (+692.31%)
Mutual labels:  scraping
zcrawl
An open source web crawling platform
Stars: ✭ 21 (+61.54%)
Mutual labels:  scraping
puppeteer-botcheck
🕵‍♂ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (+223.08%)
Mutual labels:  scraping

HTMLTab

Results from latest push build Code coverage report

HTMLTab is a command-line utility to select a table within an HTML document and convert it to CSV. Here we can get the foreign-born population of Edinburgh from Wikipedia:

$ htmltab --select p+table.wikitable.plainrowheaders https://en.wikipedia.org/wiki/Edinburgh
Place of birth,Estimated resident population (2011)[117]
Poland,11651
India,4888
Ireland,4743
Mainland China [A],4188
United States,3700
Germany,3500
Pakistan,2472
Australia,2100
France,2000
Spain,2000
South Africa,1800
Canada,1800
Hong Kong,1600

For further details on installation and usage, see HTMLTab's online documentation. If you find a bug or have a feature request, please create an issue.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].