All Projects β†’ baptisteArno β†’ Tinking

baptisteArno / Tinking

Licence: gpl-3.0
🧢 Extract data from any website without code, just clicks.

Programming Languages

typescript
32286 projects

Projects that are alternatives of or similar to Tinking

Linkedin Profile Scraper
πŸ•΅οΈβ€β™‚οΈ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-48.34%)
Mutual labels:  scraping, puppeteer
LInkedIn-Reverese-Lookup
πŸ”ŽSearch LinkedIn profile by email addressπŸ“§
Stars: ✭ 20 (-93.96%)
Mutual labels:  scraping, puppeteer
Thal
Getting started with Puppeteer and Chrome Headless for Web Scraping
Stars: ✭ 2,345 (+608.46%)
Mutual labels:  scraping, puppeteer
Awesome Puppeteer
A curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+422.05%)
Mutual labels:  scraping, puppeteer
naos
πŸ“‰ Uptime and error monitoring CLI
Stars: ✭ 30 (-90.94%)
Mutual labels:  scraping, puppeteer
Secret Agent
The web browser that's built for scraping.
Stars: ✭ 151 (-54.38%)
Mutual labels:  scraping, puppeteer
double-agent
A test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (-62.84%)
Mutual labels:  scraping, puppeteer
Educative.io Downloader
πŸ“– This tool is to download course from educative.io for offline usage. It uses your login credentials and download the course.
Stars: ✭ 139 (-58.01%)
Mutual labels:  scraping, puppeteer
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-78.55%)
Mutual labels:  scraping, puppeteer
browser-automation-api
Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-92.75%)
Mutual labels:  scraping, puppeteer
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+1449.55%)
Mutual labels:  scraping, puppeteer
bots-zoo
No description or website provided.
Stars: ✭ 59 (-82.18%)
Mutual labels:  scraping, puppeteer
Whatsapp-Net
Generate a network graph of connections from your WhatsApp groups data
Stars: ✭ 75 (-77.34%)
Mutual labels:  scraping, puppeteer
puppeteer-botcheck
πŸ•΅β€β™‚ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (-87.31%)
Mutual labels:  scraping, puppeteer
whatsapp-tracking
Scraping the status of WhatsApp contacts
Stars: ✭ 49 (-85.2%)
Mutual labels:  scraping, puppeteer
Apify Js
Apify SDK β€” The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+852.87%)
Mutual labels:  scraping, puppeteer
Scrapy Crawlera
Crawlera middleware for Scrapy
Stars: ✭ 281 (-15.11%)
Mutual labels:  scraping
Edu Mail Generator
Generate Free Edu Mail(s) within minutes
Stars: ✭ 301 (-9.06%)
Mutual labels:  scraping
Puppetcam
Export puppeteer tab as webm video
Stars: ✭ 279 (-15.71%)
Mutual labels:  puppeteer
Lambdasoup
Functional HTML scraping and rewriting with CSS in OCaml
Stars: ✭ 280 (-15.41%)
Mutual labels:  scraping

🧢 Tinking

Extract data from any website without code, just clicks

A Chrome extension that allows you to create a scraping recipe by directly selecting a page's elements with your mouse.

Tinking allows you to define stepwise rules for scraping a website's content.

Watch a video demo

This tool is under active development. The code could use some cleanup, and there may be bugs. PRs are very welcome!

πŸ’Œ Receive updates in your inbox

πŸ‘‡

Getting Started

Tinking is not yet available on the Chrome Web Store. Follow these steps to run the extension locally:

1. Clone the repository

HTTPS:

git clone https://github.com/baptisteArno/tinking.git

SSH:

git clone [email protected]:baptisteArno/tinking.git

2. Install dependencies, lint code, and build the extension

Using yarn:

yarn && yarn build

Or using npm:

npm i && npm run build

3. Install the extension in Google Chrome (or any Chromium browser)

Navigate to the following URL in Chrome:

chrome://extensions/

Make sure Developer Mode is turned on, and click the Load unpacked button. Select the build folder of the project.

You should now see Tinking listed among any other extensions that you have installed, and you can begin using it to scrape websites.

πŸ‘‰ How to use the generated code?

To-Do

  • [x] Basic scraping
  • [x] Infinite scroll
  • [x] Pagination
  • [x] Documentation on how to use the generated code
  • [x] Multiple export options (playwright, others?)
  • [x] Regex to filter data from scraped text
  • [x] Drag-and-drop steps
  • [ ] [DOC] Write examples of several popular sites
  • [ ] Unit tests
  • [ ] Cool logo
  • [ ] Ship extension to Chrome Web store (Waiting for Google review)

Couldn't scrape a particular website?

While Tinking aims to be a universal tool, it's still a work in progress, and you may run into issues scraping certain websites.

Please submit an issue, and we'll look into a potential fix.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].