All Projects → roofman2008 → Pahe.ph-Scraper

roofman2008 / Pahe.ph-Scraper

Licence: MIT license
Pahe.ph [Pahe.in] Movies Website Scraper

Programming Languages

C#
18002 projects

Projects that are alternatives of or similar to Pahe.ph-Scraper

Seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (+105.26%)
Mutual labels:  scraper, scraping
Serpscrap
SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (+168.42%)
Mutual labels:  scraper, scraping
Udemycoursegrabber
Your will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [insert big number here] websites, compares the last updated date, and gives you the download link of the latest one back, but you also have the choice to see the other ones as well!
Stars: ✭ 137 (+140.35%)
Mutual labels:  scraper, scraping
Django Dynamic Scraper
Creating Scrapy scrapers via the Django admin interface
Stars: ✭ 1,024 (+1696.49%)
Mutual labels:  scraper, scraping
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+27154.39%)
Mutual labels:  scraper, scraping
Email Extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (+42.11%)
Mutual labels:  scraper, scraping
google-scraper
This class can retrieve search results from Google.
Stars: ✭ 33 (-42.11%)
Mutual labels:  scraper, scraping
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+8898.25%)
Mutual labels:  scraper, scraping
Jsonframe Cheerio
simple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (+243.86%)
Mutual labels:  scraper, scraping
Anime Dl
Anime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (+233.33%)
Mutual labels:  scraper, scraping
Pypatent
Search for and retrieve US Patent and Trademark Office Patent Data
Stars: ✭ 31 (-45.61%)
Mutual labels:  scraper, scraping
Scrapysharp
reborn of https://bitbucket.org/rflechner/scrapysharp
Stars: ✭ 226 (+296.49%)
Mutual labels:  scraper, scraping
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+1284.21%)
Mutual labels:  scraper, scraping
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+2085.96%)
Mutual labels:  scraper, scraping
Imagescraper
✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (+1005.26%)
Mutual labels:  scraper, scraping
Phpscraper
PHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+159.65%)
Mutual labels:  scraper, scraping
Dataflowkit
Extract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+700%)
Mutual labels:  scraper, scraping
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+8385.96%)
Mutual labels:  scraper, scraping
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+200%)
Mutual labels:  scraper, scraping
Goose Parser
Universal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+270.18%)
Mutual labels:  scraper, scraping

Pahe.ph-Scraper (Postponed For While)

Pahe.ph Movies Website Scraper https://pahe.ph/

This project i have done from 2 years ago, but i'm reviving the project again cause i'm evil.

Full Documentation (Article)

https://roofman.me/2021/09/03/breaking-scraping-pahe-in/

Features

  • Bypass Surcuri WAF
  • Handle SoraLink & Extract Direct Links
  • Extract Posts in Full Details
  • Decode Download Section Obfuscation
  • Stateful Scraper which support Resume, Failsafe, Looping operations.

ToDo

  • Implementing Cloudflare Bypassing Automatic Mechansims [Not Needed Now]
  • Full Site Test [In Progress]
  • Move From .NET Framework To .NET Core [Not Started]

Contribution

https://github.com/roofman2008/Pahe.ph-Scraper/blob/main/CONTRIBUTING.md

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].