All Projects → MontFerret → Ferret

MontFerret / Ferret

Licence: apache-2.0
Declarative web scraping

Programming Languages

go
31211 projects - #10 most used programming language
HTML
75241 projects
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Ferret

Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+6.04%)
Mutual labels:  crawler, scraper, scraping, crawling, chrome
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-90.9%)
Mutual labels:  crawler, scraper, scraping, crawling
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (-83.69%)
Mutual labels:  crawler, scraper, scraping, crawling
proxycrawl-python
ProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-98.95%)
Mutual labels:  scraper, scraping, crawling, scraping-websites
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+775.4%)
Mutual labels:  hacktoberfest, crawler, scraping, crawling
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-96.46%)
Mutual labels:  crawler, scraper, scraping, crawling
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+221.17%)
Mutual labels:  crawler, scraper, scraping, crawling
bots-zoo
No description or website provided.
Stars: ✭ 59 (-98.78%)
Mutual labels:  crawler, scraper, scraping, crawling
Npkill
List any node_modules directories in your system, as well as the space they take up. You can then select which ones you want to erase to free up space.
Stars: ✭ 5,325 (+10.09%)
Mutual labels:  cli, hacktoberfest, tool
LeetCode
At present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (-99.07%)
Mutual labels:  data-mining, scraper, scraping-websites
Instagram-to-discord
Monitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (-97.66%)
Mutual labels:  scraper, scraping, scraping-websites
scrapman
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-99.57%)
Mutual labels:  scraper, scraping, scraping-websites
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-98.9%)
Mutual labels:  scraper, scraping, crawling
gochanges
**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-99.75%)
Mutual labels:  scraper, scraping, scraping-websites
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-98.92%)
Mutual labels:  scraper, scraping, crawling
document-dl
Command line program to download documents from web portals.
Stars: ✭ 14 (-99.71%)
Mutual labels:  scraper, scraping, scraping-websites
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-99.69%)
Mutual labels:  crawler, scraper, scraping
Musoq
Use SQL on various data sources
Stars: ✭ 252 (-94.79%)
Mutual labels:  cli, scraping, tool
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (-94.09%)
Mutual labels:  crawler, scraping, crawling
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-94.27%)
Mutual labels:  crawler, scraping, crawling

Ferret

Go Report Status Build Status Discord Chat Discord Chat Ferret release Apache-2.0 License

ferret

Try it! Docs CLI Test runner Web worker

What is it?

ferret is a web scraping system. It aims to simplify data extraction from the web for UI testing, machine learning, analytics and more.
ferret allows users to focus on the data. It abstracts away the technical details and complexity of underlying technologies using its own declarative language. It is extremely portable, extensible, and fast.

Read the introductory blog post about Ferret here!

Features

  • Declarative language
  • Support of both static and dynamic web pages
  • Embeddable
  • Extensible

Documentation is available at our website.

Different languages

  • Ferret for python. Pyfer
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].