All Projects β†’ ab-anand β†’ Scrapping

ab-anand / Scrapping

Licence: other
Mastering the art of scrapping πŸŽ“

Programming Languages

python
139335 projects - #7 most used programming language
PHP
23972 projects - #3 most used programming language
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Scrapping

reason-rust-scraper
πŸ¦€ Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (-12.5%)
Mutual labels:  scraping
ksoup
Kotlin Wrapper for Jsoup
Stars: ✭ 59 (+145.83%)
Mutual labels:  scraping
proxycrawl-python
ProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+112.5%)
Mutual labels:  scraping
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+120.83%)
Mutual labels:  scraping
html-table-extractor
extract data from html table
Stars: ✭ 74 (+208.33%)
Mutual labels:  scraping
browser-automation-api
Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (+0%)
Mutual labels:  scraping
docker-selenium-lambda
The simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+616.67%)
Mutual labels:  scraping
copycat
A PHP Scraping Class
Stars: ✭ 70 (+191.67%)
Mutual labels:  scraping
scrapman
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-12.5%)
Mutual labels:  scraping
Instagram-to-discord
Monitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (+370.83%)
Mutual labels:  scraping
LInkedIn-Reverese-Lookup
πŸ”ŽSearch LinkedIn profile by email addressπŸ“§
Stars: ✭ 20 (-16.67%)
Mutual labels:  scraping
puppeteer-botcheck
πŸ•΅β€β™‚ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (+75%)
Mutual labels:  scraping
ha-multiscrape
Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (+329.17%)
Mutual labels:  scraping
crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-8.33%)
Mutual labels:  scraping
scrap
Scrapping Facebook with JavaScript.
Stars: ✭ 25 (+4.17%)
Mutual labels:  scraping
PrawWallpaperDownloader
Download images from reddit
Stars: ✭ 18 (-25%)
Mutual labels:  scraping
zcrawl
An open source web crawling platform
Stars: ✭ 21 (-12.5%)
Mutual labels:  scraping
anime-scraper
[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3
Stars: ✭ 21 (-12.5%)
Mutual labels:  scraping
InstaBot
Simple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Stars: ✭ 32 (+33.33%)
Mutual labels:  scraping
htmltab
Command-line utility to convert HTML tables into CSV files
Stars: ✭ 13 (-45.83%)
Mutual labels:  scraping

What is scrapping?

Data scraping is a technique in which a computer program extracts data from human-readable output coming from another program.

Note

This repository contains programs that performs web scrapping. The programs are in JavaScript and Python.

  • epl_team.py - epl team scrapper
  • hacker_news.py - hackernews scrapper
  • snapdeal_snapdeal.py - scrapping snapdeal extracting product details
  • scrape_flipkart.py - collect data from flipkart
  • webcrawler_stack.py - crawl most relevant and highest voted questions for a specific term from stackoverflow
  • webcrawler_g2.py - a webcrawler for the review forum G2Crowd.com
  • hacker_news.py - hackernews scrapper
  • snapdeal_spider.js - scrapping snapdeal using Cheerio.js
  • scrape_myanimelist - scrape details about anime(rank, episodes, etc.,)
  • 21cinema.py - scrape theater showtimes 21cinema.com
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].