All Projects → Docbao → Similar Projects or Alternatives

134 Open source projects that are alternatives of or similar to Docbao

Decapitated
Headless 'Chrome' Orchestration in R
Stars: ✭ 65 (-71.74%)
Mutual labels:  web-scraping
Webmiddle
Node.js framework for modular web scraping and data extraction
Stars: ✭ 13 (-94.35%)
Mutual labels:  web-scraping
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+559.13%)
Mutual labels:  web-scraping
Reader
Extract clean(er), readable text from web pages via Mercury Web Parser.
Stars: ✭ 75 (-67.39%)
Mutual labels:  web-scraping
User Agents
A JavaScript library for generating random user agents with data that's updated daily.
Stars: ✭ 485 (+110.87%)
Mutual labels:  web-scraping
Html Metadata
MetaData html scraper and parser for Node.js (supports Promises and callback style)
Stars: ✭ 129 (-43.91%)
Mutual labels:  web-scraping
Project Tauro
A Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-77.39%)
Mutual labels:  web-scraping
Scrapy Training
Scrapy Training companion code
Stars: ✭ 157 (-31.74%)
Mutual labels:  web-scraping
Faster Than Requests
Faster requests on Python 3
Stars: ✭ 639 (+177.83%)
Mutual labels:  web-scraping
Pulsar
Turn large Web sites into tables and charts using simple SQLs.
Stars: ✭ 100 (-56.52%)
Mutual labels:  web-scraping
Rvest
Simple web scraping for R
Stars: ✭ 1,253 (+444.78%)
Mutual labels:  web-scraping
Selectolax
Python binding to Modest engine (fast HTML5 parser with CSS selectors).
Stars: ✭ 368 (+60%)
Mutual labels:  web-scraping
Sqrape
Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Stars: ✭ 144 (-37.39%)
Mutual labels:  web-scraping
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-70.43%)
Mutual labels:  web-scraping
Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (-23.91%)
Mutual labels:  web-scraping
Instago
Download/access photos, videos, stories, story highlights, postlives, following and followers of Instagram
Stars: ✭ 59 (-74.35%)
Mutual labels:  web-scraping
30 Days Of Python
Learn Python for the next 30 (or so) Days.
Stars: ✭ 1,748 (+660%)
Mutual labels:  web-scraping
Uc Davis Cs Exams Analysis
📈 Regression and Classification with UC Davis student quiz data and exam data
Stars: ✭ 33 (-85.65%)
Mutual labels:  web-scraping
Trump Lies
Tutorial: Web scraping in Python with Beautiful Soup
Stars: ✭ 201 (-12.61%)
Mutual labels:  web-scraping
Youtube tutorials
Collection of scripts corresponding to LucidProgramming YouTube tutorials
Stars: ✭ 769 (+234.35%)
Mutual labels:  web-scraping
Scrapyd Cluster On Heroku
Set up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (-53.91%)
Mutual labels:  web-scraping
Pythoncode Tutorials
The Python Code Tutorials
Stars: ✭ 544 (+136.52%)
Mutual labels:  web-scraping
Web Scraping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, SHFE and news data crawlers on BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Stars: ✭ 153 (-33.48%)
Mutual labels:  web-scraping
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+101.74%)
Mutual labels:  web-scraping
Splashr
💦 Tools to Work with the 'Splash' JavaScript Rendering Service in R
Stars: ✭ 93 (-59.57%)
Mutual labels:  web-scraping
Daftlistings
A library that enables programmatic interaction with daft.ie. Daft.ie has nationwide coverage and contains about 80% of the total available properties in Ireland.
Stars: ✭ 86 (-62.61%)
Mutual labels:  web-scraping
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+1672.61%)
Mutual labels:  web-scraping
Phpscraper
PHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (-35.65%)
Mutual labels:  web-scraping
Detect Cms
PHP Library for detecting CMS
Stars: ✭ 78 (-66.09%)
Mutual labels:  web-scraping
Grab
Web Scraping Framework
Stars: ✭ 2,147 (+833.48%)
Mutual labels:  web-scraping
Ping Sm
Receive an email or Telegram message as soon as Migros Sanalmarket is available for delivery in your neighborhood.
Stars: ✭ 71 (-69.13%)
Mutual labels:  web-scraping
Zillow
Zillow Scraper for Python using Selenium
Stars: ✭ 141 (-38.7%)
Mutual labels:  web-scraping
Cascadia
Go cascadia package command line CSS selector
Stars: ✭ 67 (-70.87%)
Mutual labels:  web-scraping
R Web Scraping Cheat Sheet
Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
Stars: ✭ 207 (-10%)
Mutual labels:  web-scraping
Social Media Profile Scrapers
Fetch user's data across social media
Stars: ✭ 60 (-73.91%)
Mutual labels:  web-scraping
Actor Page Analyzer
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
Stars: ✭ 124 (-46.09%)
Mutual labels:  web-scraping
Scrapy Craigslist
Web Scraping Craigslist's Engineering Jobs in NY with Scrapy
Stars: ✭ 54 (-76.52%)
Mutual labels:  web-scraping
Learnpythonforresearch
This repository provides everything you need to get started with Python for (social science) research.
Stars: ✭ 163 (-29.13%)
Mutual labels:  web-scraping
Actor Google Search Scraper
Apify actor that crawls Google Search result pages (SERPs) and extracts a list of organic results, ads, related queries and more. It supports selection of custom country, language and location.
Stars: ✭ 38 (-83.48%)
Mutual labels:  web-scraping
Ayakashi
⚡️ Ayakashi.io - The next generation web scraping framework
Stars: ✭ 117 (-49.13%)
Mutual labels:  web-scraping
Snoop
Snoop — инструмент разведки на основе открытых данных (OSINT world)
Stars: ✭ 886 (+285.22%)
Mutual labels:  web-scraping
Selenium Python Helium
Selenium-python but lighter: Helium is the best Python library for web automation.
Stars: ✭ 2,732 (+1087.83%)
Mutual labels:  web-scraping
Letterboxd recommendations
Scraping publicly-accessible Letterboxd data and creating a movie recommendation model with it that can generate recommendations when provided with a Letterboxd username
Stars: ✭ 23 (-90%)
Mutual labels:  web-scraping
Save For Offline
Android app for saving webpages for offline reading.
Stars: ✭ 114 (-50.43%)
Mutual labels:  web-scraping
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+185.22%)
Mutual labels:  web-scraping
Netflix Clone
Netflix like full-stack application with SPA client and backend implemented in service oriented architecture
Stars: ✭ 156 (-32.17%)
Mutual labels:  web-scraping
Coolqlcool
Nextjs server to query websites with GraphQL
Stars: ✭ 623 (+170.87%)
Mutual labels:  web-scraping
Rod
A Devtools driver for web automation and scraping
Stars: ✭ 1,392 (+505.22%)
Mutual labels:  web-scraping
Scrapy Fake Useragent
Random User-Agent middleware based on fake-useragent
Stars: ✭ 520 (+126.09%)
Mutual labels:  web-scraping
Bet On Sibyl
Machine Learning Model for Sport Predictions (Football, Basketball, Baseball, Hockey, Soccer & Tennis)
Stars: ✭ 190 (-17.39%)
Mutual labels:  web-scraping
Rpa
UI.Vision: Open-Source RPA Software (formerly Kantu) - Modern Robotic Process Automation with Selenium IDE++
Stars: ✭ 477 (+107.39%)
Mutual labels:  web-scraping
Sillynium
Automate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements
Stars: ✭ 100 (-56.52%)
Mutual labels:  web-scraping
Awesome Web Scraping
List of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+1860.87%)
Mutual labels:  web-scraping
Helena
A Chrome extension for writing custom web scraping programs and web automation programs. Just demonstrate how to collect the first row of data, then let the extension write the program for collecting all rows.
Stars: ✭ 151 (-34.35%)
Mutual labels:  web-scraping
Hockey Scraper
Python Package for scraping NHL Play-by-Play and Shift data
Stars: ✭ 93 (-59.57%)
Mutual labels:  web-scraping
City Scrapers
Scrape, standardize and share public meetings from local government websites
Stars: ✭ 220 (-4.35%)
Mutual labels:  web-scraping
Short Jokes Dataset
Python scripts for building 'Short Jokes' dataset, featured on Kaggle
Stars: ✭ 215 (-6.52%)
Mutual labels:  web-scraping
Twitter Intelligence
Twitter Intelligence OSINT project performs tracking and analysis of the Twitter
Stars: ✭ 179 (-22.17%)
Mutual labels:  web-scraping
Juno crawler
Scrapy crawler to collect data on the back catalog of songs listed for sale.
Stars: ✭ 150 (-34.78%)
Mutual labels:  web-scraping
Humanoid
Node.js package to bypass CloudFlare's anti-bot JavaScript challenges
Stars: ✭ 88 (-61.74%)
Mutual labels:  web-scraping
1-60 of 134 similar projects