All Projects → CuriousLearner → Geeksforgeeksscrapper

CuriousLearner / Geeksforgeeksscrapper

Licence: mit
Scrapes g4g and creates PDF

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Geeksforgeeksscrapper

Openpdf
OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository. ⛺
Stars: ✭ 2,174 (+1653.23%)
Mutual labels:  hacktoberfest, pdf
Nytimes App
🗽 A Simple Demonstration of the New York Times App 📱 using Jsoup web crawler with MVVM Architecture 🔥
Stars: ✭ 246 (+98.39%)
Mutual labels:  hacktoberfest, webscraping
React Native Pdfview
📚 PDF viewer for React Native
Stars: ✭ 198 (+59.68%)
Mutual labels:  hacktoberfest, pdf
Educative.io Downloader
📖 This tool is to download course from educative.io for offline usage. It uses your login credentials and download the course.
Stars: ✭ 139 (+12.1%)
Mutual labels:  hacktoberfest, pdf
Laravel Dompdf
A DOMPDF Wrapper for Laravel
Stars: ✭ 4,978 (+3914.52%)
Mutual labels:  hacktoberfest, pdf
Svg2pdf.js
A javascript-only SVG to PDF conversion utility that runs in the browser. Brought to you by yWorks - the diagramming experts
Stars: ✭ 231 (+86.29%)
Mutual labels:  hacktoberfest, pdf
Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
Stars: ✭ 2,736 (+2106.45%)
Mutual labels:  hacktoberfest, pdf
Rst2pdf
Use a text editor. Make a PDF.
Stars: ✭ 404 (+225.81%)
Mutual labels:  hacktoberfest, pdf
Pdfvuer
A PDF viewer for Vue using Mozilla's PDF.js
Stars: ✭ 443 (+257.26%)
Mutual labels:  hacktoberfest, pdf
Suckit
Suck the InTernet
Stars: ✭ 429 (+245.97%)
Mutual labels:  hacktoberfest, webscraping
Images To Pdf
An app to convert images to PDF file!
Stars: ✭ 602 (+385.48%)
Mutual labels:  hacktoberfest, pdf
Gazpacho
🥫 The simple, fast, and modern web scraping library
Stars: ✭ 525 (+323.39%)
Mutual labels:  hacktoberfest, webscraping
Backslide
💦 CLI tool for making HTML presentations with Remark.js using Markdown
Stars: ✭ 679 (+447.58%)
Mutual labels:  hacktoberfest, pdf
Youtubeexplode
The ultimate dirty YouTube library
Stars: ✭ 1,775 (+1331.45%)
Mutual labels:  hacktoberfest
Awesome Web Monetization
🕶️ Stuffs about Web Monetization. Packages, articles, documentation links and others tools.
Stars: ✭ 125 (+0.81%)
Mutual labels:  hacktoberfest
Tuist
🚀 Create, maintain, and interact with Xcode projects at scale
Stars: ✭ 2,234 (+1701.61%)
Mutual labels:  hacktoberfest
Colore
A powerful C# library for Razer Chroma's SDK
Stars: ✭ 121 (-2.42%)
Mutual labels:  hacktoberfest
Cross Domain Utils
Cross Domain utilities
Stars: ✭ 124 (+0%)
Mutual labels:  hacktoberfest
Alt F4
Alternative Factorio Friday Fan Facts, also known as Alt-F4
Stars: ✭ 124 (+0%)
Mutual labels:  hacktoberfest
Ros1 bridge
ROS 2 package that provides bidirectional communication between ROS 1 and ROS 2
Stars: ✭ 123 (-0.81%)
Mutual labels:  hacktoberfest

GeeksForGeeksScrapper

Scrapes GeeksForGeeks and creates html & PDF for chosen category along with syntax highlighting for the code.

Screenshots

Example of articles of C category as html:

alt tag

Example of articles of C category as pdf:

alt tag

Installation

To use the scrapper, install the following:

$ sudo apt-get install wkhtmltopdf

Then create venv

$ virtualenv /path/to/g4g-env

Switch to venv

$ source /path/to/g4g-env/bin/activate

Now install BeautifulSoup as:

$ pip install beautifulsoup4

or via package manager as:

$ sudo apt-get install python-bs4

or for Python dependencies, you can just install via requirements.txt inside the virtual environment.

$ pip install -r requirements.txt

Run the G4G_Scrapper

$ python g4g.py

Choose the category you want to scrape from the menu and wait for the magic to happen :)

You can find the output as G4G_<category_name>.html and G4G_<category_name>.pdf in the same directory.

Disclaimer: This is strictly for educational purpose only. Author will not be liable for anything, what so ever; you do with this script.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].