Analyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more

✭ 515

ruby script data-science data-visualization statistics facebook scraping ruby-gem conversation

Facebook Scraper

Scrape Facebook public pages without an API key

✭ 499

python hacktoberfest facebook scraping

Nickjs

Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)

✭ 494

javascript automation browser deprecated scraping headless-chrome phantomjs

Geeksforgeeks.pdf

Topic wise PDFs of Geeks for Geeks articles. (Last updated in October 2018)

✭ 489

python pdf download scraping

Ferret

Declarative web scraping

✭ 4,837

go HTML javascript hacktoberfest cli library chrome tool crawler scraper data-mining scraping crawling query-language scraping-websites cdp hacktoberfest2021

Scrapple

A framework for creating semi-automatic web content extractors

✭ 464

python tutorial crawler scrapy scraping web-scraping selector web-scraper beautifulsoup css-selector

Dataflowkit

Extract structured data from web sites. Web sites scraping.

✭ 456

go golang scraper scraping headless crawling

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

✭ 440

elixir erlang crawler spider scraper scraping crawling

Jekyll

Jekyll-based static site for The Programming Historian

✭ 387

python html api data-mining mapping scraping network-analysis text-analysis data-management

Lookyloo

Lookyloo is a web interface that allows users to capture a website page and then display a tree of domains that call each other.

✭ 381

python privacy scraping dfir capture information-security web-security

Undetected Chromedriver

Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

✭ 365

python python3 testing automation chrome browser selenium scraping cloudflare webdriver captcha navigator chromedriver

Data Science

Collection of useful data science topics along with code and articles

✭ 315

python jupyter-notebook machine-learning data-science natural-language-processing artificial-intelligence data-visualization data-analysis time-series scraping articles

Coronadatascraper

COVID-19 Coronavirus data scraped from government and curated data sources.

✭ 372

html scraping

Post Tuto Deployment

Build and deploy a machine learning app from scratch 🚀

✭ 368

python machine-learning docker pytorch api aws deployment selenium scrapy scraping

Comic Dl

Comic-dl is a command line tool to download manga and comics from various comic and manga sites. Supported sites : readcomiconline.to, mangafox.me, comic naver and many more.

✭ 365

python web automation debian scraping youtube-dl manga python-script phantomjs comics

Katana

A Python Tool For google Hacking

✭ 355

python python3 security proxy hacking google security-tools scraper hacking-tool tor scraping scada sqli

Socialreaper

Social media scraping / data collection library for Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

✭ 338

python api youtube twitter facebook scraping reddit social-media pinterest tumblr

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

✭ 4,077

python machine-learning automation artificial-intelligence ai crawler scraper scraping web-scraping webscraping scrape webautomation

Tinking

🧶 Extract data from any website without code, just clicks.

✭ 331

typescript puppeteer scraping

Social Media Profiles Regexs

📇 Extract social media profiles and more with regular expressions

✭ 324

python github twitter telegram facebook instagram regex scraping phone linkedin regular-expressions snapchat hackernews skype

Spidermon

Scrapy Extension for monitoring spiders execution.

✭ 309

python hacktoberfest testing monitoring scraping crawling monitoring-tool

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

✭ 309

python docker bot docker-compose scraper scrapy scraping selenium-webdriver linkedin

Elixir Scrape

Scrape any website, article or RSS/Atom Feed with ease!

✭ 306

elixir html data-science rss scraping feed information-retrieval readability

Edu Mail Generator

Generate Free Edu Mail(s) within minutes

✭ 301

python python3 selenium scraping mail

Sasila

一个灵活、友好的爬虫框架

✭ 286

python framework http crawler scraping requests crawling

Clean Text

🧹 Python package for text cleaning

✭ 284

python nlp natural-language-processing scraping

Scrapy Crawlera

Crawlera middleware for Scrapy

✭ 281

python plugin proxy crawler scrapy scraping

Lambdasoup

Functional HTML scraping and rewriting with CSS in OCaml

✭ 280

ocaml html css scraping

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

✭ 277

go elasticsearch crawler spider lightweight scraping web-scraping crawling web-crawler

Apify Js

Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

✭ 3,154

javascript automation npm javascript-library puppeteer scraping headless-chrome web-scraping crawling web-crawling rpa apify

Mechanize

Mechanize is a ruby library that makes automated web interaction easy.

✭ 4,158

ruby HTML web scraping

schedule-tweet

Schedules tweets using TweetDeck

✭ 14

python shell automation twitter scraping selenium webscraping selenium-python

instagram explorer

📷 An app to scrap instagram posts and analyze data.

✭ 17

python HTML CSS javascript scraping instagram-scraper web-mining vader-sentiment-analysis

facebook-discussion-tk

A collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.

✭ 33

python PHP scraper facebook scraping

jazz

The Scripting Engine that Combines Speed, Safety, and Simplicity

✭ 132

rust shell android linux markdown data-science cryptography crypto database web embeddable jinja2 scraping development-environment chromeos jazz witness actix

ARGUS

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

✭ 68

python Jupyter Notebook Batchfile scraping crawling scrapy webscraping scrapyd webcrawling

bots-zoo

No description or website provided.

✭ 59

javascript python ruby go bot crawler scraper user-agent scraping crawling selenium useragent puppeteer playwright

61-120 of 229 scraping projects

‹

›