All Projects → github → Lightcrawler

github / Lightcrawler

Licence: isc
Crawl a website and run it through Google lighthouse

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Lightcrawler

Pychromeless
Python Lambda Chrome Automation (naming pending)
Stars: ✭ 219 (-83.64%)
Mutual labels:  crawler, chrome
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-90.66%)
Mutual labels:  crawler, chrome
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+283.05%)
Mutual labels:  crawler, chrome
Jvppeteer
Headless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (-85.59%)
Mutual labels:  crawler, chrome
Awesome Java Crawler
本仓库收集整理爬虫相关资源,开发语言以Java为主
Stars: ✭ 228 (-82.97%)
Mutual labels:  crawler, chrome
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+261.24%)
Mutual labels:  crawler, chrome
Puppeteer Walker
a puppeteer walker 🕷 🕸
Stars: ✭ 78 (-94.17%)
Mutual labels:  crawler, chrome
Catblock
CatBlock - An ad-blocking extension (with an ability to replace ads by pictures) for all major desktop browsers!
Stars: ✭ 84 (-93.73%)
Mutual labels:  chrome
Proxy Pool
爬虫代理IP池服务,可供其他爬虫程序通过restapi获取
Stars: ✭ 91 (-93.2%)
Mutual labels:  crawler
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (-6.95%)
Mutual labels:  crawler
Taiwan News Crawlers
Scrapy-based Crawlers for news of Taiwan
Stars: ✭ 83 (-93.8%)
Mutual labels:  crawler
Chrome Extension Guide
Chrome 扩展(插件) 开发官方文档中文版翻译
Stars: ✭ 85 (-93.65%)
Mutual labels:  chrome
Ktspeechcrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-93.13%)
Mutual labels:  crawler
Weibo Album Crawler
新浪微博相册大图多线程爬虫。
Stars: ✭ 83 (-93.8%)
Mutual labels:  crawler
The Tab Of Words
A minimal Chrome / Firefox extension to help you learn Japanese words in each new tab.
Stars: ✭ 94 (-92.98%)
Mutual labels:  chrome
Tumblr crawler
tumblr解析网站
Stars: ✭ 83 (-93.8%)
Mutual labels:  crawler
Gf Secrets
Secret and/ credential patterns used for gf.
Stars: ✭ 96 (-92.83%)
Mutual labels:  crawler
Scrapoxy
Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (-1.27%)
Mutual labels:  crawler
Docker Protractor Headless
Protractor end to end testing for AngularJS - dockerised and headless with real Chrome.
Stars: ✭ 90 (-93.28%)
Mutual labels:  chrome
Roam Research Private Api
Private API to enable API access for Roam Research. Now you can connect Roam to your other projects.
Stars: ✭ 88 (-93.43%)
Mutual labels:  chrome

lightcrawler

Crawl a website and run it through Google lighthouse

npm install --save-dev lightcrawler

lightcrawler --url https://atom.io/ --config lightcrawler-config.json

where lightcrawler-config.json looks something like this:

{
  "extends": "lighthouse:default",
  "settings": {
    "crawler": {
      "maxDepth": 2,
      "maxChromeInstances": 5
    },
    "onlyCategories": [
      "Accessibility",
      "Performance",
      "Best Practices"
    ],
    "onlyAudits": [
      "accesskeys",
      "aria-allowed-attr",
      "external-anchors-use-rel-noopener",
      "geolocation-on-start",
      "no-document-write",
      "no-mutation-events",
      "no-old-flexbox",
      "time-to-interactive",
      "user-timings",
      "viewport",
      "without-javascript"
    ]
  }
}

Enjoy!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].