All Projects → turtleDev → web-scraping-101

turtleDev / web-scraping-101

Licence: MIT license
An Introduction to Web Scraping

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to web-scraping-101

fBrowser
Helpful Selenium functions to make web-scraping easier and faster
Stars: ✭ 16 (+23.08%)
Mutual labels:  webscraping
browser-automation-api
Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (+84.62%)
Mutual labels:  webscraping
PacPaw
Pawn package manager for SA-MP
Stars: ✭ 14 (+7.69%)
Mutual labels:  webscraping
91 Python Mini Projects
No description or website provided.
Stars: ✭ 212 (+1530.77%)
Mutual labels:  webscraping
non-api-fb-scraper
Scrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access
Stars: ✭ 40 (+207.69%)
Mutual labels:  webscraping
youtube-audio
extract videos from youtube in audio format using webscraping techniques 🎶
Stars: ✭ 68 (+423.08%)
Mutual labels:  webscraping
animeflv
Animeflv is a custom API that has the entire catalog of the animeflv.net website. You can enjoy all the content with subtitles in Spanish and the latest in the world of anime for free.
Stars: ✭ 37 (+184.62%)
Mutual labels:  webscraping
extractnet
A Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (+300%)
Mutual labels:  webscraping
robotstxt
robots.txt file parsing and checking for R
Stars: ✭ 65 (+400%)
Mutual labels:  webscraping
Catalyst
A VS code Extension to accelerate the process of solving problems on Codeforces.
Stars: ✭ 69 (+430.77%)
Mutual labels:  webscraping
selenium-grid-docker-swarm
web scraping in parallel with Selenium Grid and Docker
Stars: ✭ 32 (+146.15%)
Mutual labels:  webscraping
Youtube-Scraping-Selenium
Automatically creates a Youtube channel dashboard
Stars: ✭ 21 (+61.54%)
Mutual labels:  webscraping
gotor
This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.
Stars: ✭ 97 (+646.15%)
Mutual labels:  webscraping
ioweb
Web Scraping Framework
Stars: ✭ 31 (+138.46%)
Mutual labels:  webscraping
Email-Crawler-Lead-Generator
This email crawler will visit all pages of a provided website and parse and save emails found to a csv file.
Stars: ✭ 47 (+261.54%)
Mutual labels:  webscraping
android-web-scraping-app-jsoup
Sometimes we need to scrap web data from our Android App. To achieve this goal jsoup library is a good option. I wrote a blog post on this topic in my personal blog. If you know Bengali language then you can visit this link.
Stars: ✭ 26 (+100%)
Mutual labels:  webscraping
CourseDownloader
GUI app for downloading whole online courses with folder structure from one url
Stars: ✭ 20 (+53.85%)
Mutual labels:  webscraping
anime-scraper
[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3
Stars: ✭ 21 (+61.54%)
Mutual labels:  webscraping
requestsR
R interface to Python requests module
Stars: ✭ 12 (-7.69%)
Mutual labels:  webscraping
ebayMarketAnalyzer
Scrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel
Stars: ✭ 116 (+792.31%)
Mutual labels:  webscraping

Web Scraping 101

Web Scraping (aka web mining) is the process of collecting information from a network resource. This document is a (WIP) introduction to differenct aspects and techniques involved in an web scraping system.

Most of this text will be language-agnostic, but code snippets will probably break that rule.

Begin reading here

Contributing

Pull Requests are always welcome :)

Things that could use a hand are:

  • more code examples.
  • content flow.
  • any spelling or grammatical errors.
  • references to external resources.

Questions?

If you'd like to ask a question, open up an issue. I'll try to get back to you as soon as I can.

License

MIT

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].