All Projects → Mechaml → Similar Projects or Alternatives

228 Open source projects that are alternatives of or similar to Mechaml

Tinking

🧶 Extract data from any website without code, just clicks.

Stars: ✭ 331 (+451.67%)

Mutual labels: scraping

Scrapy Crawlera

Crawlera middleware for Scrapy

Stars: ✭ 281 (+368.33%)

Mutual labels: scraping

Geeksforgeeks.pdf

Topic wise PDFs of Geeks for Geeks articles. (Last updated in October 2018)

Stars: ✭ 489 (+715%)

Mutual labels: scraping

Comic Dl

Comic-dl is a command line tool to download manga and comics from various comic and manga sites. Supported sites : readcomiconline.to, mangafox.me, comic naver and many more.

Stars: ✭ 365 (+508.33%)

Mutual labels: scraping

ARGUS

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

Stars: ✭ 68 (+13.33%)

Mutual labels: scraping

Tools for various online judges. Downloading sample cases, generating additional test cases, testing your code, and submitting it.

Stars: ✭ 517 (+761.67%)

Mutual labels: scraping

Elixir Scrape

Scrape any website, article or RSS/Atom Feed with ease!

Stars: ✭ 306 (+410%)

Mutual labels: scraping

Lulu

[Unmaintained] A simple and clean video/music/image downloader 👾

Stars: ✭ 789 (+1215%)

Mutual labels: scraping

schedule-tweet

Schedules tweets using TweetDeck

Stars: ✭ 14 (-76.67%)

Mutual labels: scraping

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (+633.33%)

Mutual labels: scraping

Coronadatascraper

COVID-19 Coronavirus data scraped from government and curated data sources.

Stars: ✭ 372 (+520%)

Mutual labels: scraping

raspagem-de-dados-fatec

📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí

Stars: ✭ 22 (-63.33%)

Mutual labels: scraping

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+8448.33%)

Mutual labels: scraping

Socialreaper

Social media scraping / data collection library for Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Stars: ✭ 338 (+463.33%)

Mutual labels: scraping

Instagram Scraper

Scrape the Instagram frontend. Inspired from twitter-scraper by @kennethreitz.

Stars: ✭ 903 (+1405%)

Mutual labels: scraping

Spidermon

Scrapy Extension for monitoring spiders execution.

Stars: ✭ 309 (+415%)

Mutual labels: scraping

Facebook Scraper

Scrape Facebook public pages without an API key

Stars: ✭ 499 (+731.67%)

Mutual labels: scraping

Sasila

一个灵活、友好的爬虫框架

Stars: ✭ 286 (+376.67%)

Mutual labels: scraping

Pge Outages

Tracking PG&E outages

Stars: ✭ 43 (-28.33%)

Mutual labels: scraping

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (+361.67%)

Mutual labels: scraping

Scrapple

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (+673.33%)

Mutual labels: scraping

facebook-discussion-tk

A collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.

Stars: ✭ 33 (-45%)

Mutual labels: scraping

Parsel

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Stars: ✭ 628 (+946.67%)

Mutual labels: scraping

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-63.33%)

Mutual labels: scraping

Jekyll

Jekyll-based static site for The Programming Historian

Stars: ✭ 387 (+545%)

Mutual labels: scraping

Data Science

Collection of useful data science topics along with code and articles

Stars: ✭ 315 (+425%)

Mutual labels: scraping

memes-api

API for scrapping common meme sites

Stars: ✭ 17 (-71.67%)

Mutual labels: scraping

Tabula

Tabula is a tool for liberating data tables trapped inside PDF files

Stars: ✭ 5,420 (+8933.33%)

Mutual labels: scraping

Post Tuto Deployment

Build and deploy a machine learning app from scratch 🚀

Stars: ✭ 368 (+513.33%)

Mutual labels: scraping

Scrapy Cluster

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.

Stars: ✭ 921 (+1435%)

Mutual labels: scraping

Katana

A Python Tool For google Hacking

Stars: ✭ 355 (+491.67%)

Mutual labels: scraping

Gazpacho

🥫 The simple, fast, and modern web scraping library

Stars: ✭ 525 (+775%)

Mutual labels: scraping

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+6695%)

Mutual labels: scraping

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+1606.67%)

Mutual labels: scraping

Social Media Profiles Regexs

📇 Extract social media profiles and more with regular expressions

Stars: ✭ 324 (+440%)

Mutual labels: scraping

Facebook data analyzer

Analyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more

Stars: ✭ 515 (+758.33%)

Mutual labels: scraping

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

Stars: ✭ 309 (+415%)

Mutual labels: scraping

Webhere

HTML scraping for Objective-C.

Stars: ✭ 16 (-73.33%)

Mutual labels: scraping

Edu Mail Generator

Generate Free Edu Mail(s) within minutes

Stars: ✭ 301 (+401.67%)

Mutual labels: scraping

Nickjs

Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)

Stars: ✭ 494 (+723.33%)

Mutual labels: scraping

Clean Text

🧹 Python package for text cleaning

Stars: ✭ 284 (+373.33%)

Mutual labels: scraping

Mtnt

Code for the collection and analysis of the MTNT dataset

Stars: ✭ 48 (-20%)

Mutual labels: scraping

Lambdasoup

Functional HTML scraping and rewriting with CSS in OCaml

Stars: ✭ 280 (+366.67%)

Mutual labels: scraping

Ferret

Declarative web scraping

Stars: ✭ 4,837 (+7961.67%)

Mutual labels: scraping

Apify Js

Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

Stars: ✭ 3,154 (+5156.67%)

Mutual labels: scraping

Imagescraper

✂️ High performance, multi-threaded image scraper

Stars: ✭ 630 (+950%)

Mutual labels: scraping

instagram explorer

📷 An app to scrap instagram posts and analyze data.

Stars: ✭ 17 (-71.67%)

Mutual labels: scraping

Dataflowkit

Extract structured data from web sites. Web sites scraping.

Stars: ✭ 456 (+660%)

Mutual labels: scraping

jazz

The Scripting Engine that Combines Speed, Safety, and Simplicity

Stars: ✭ 132 (+120%)

Mutual labels: scraping

Configs

Public, free to use, repository with diggers configs for scraping / extracting data from various e-commerce websites and online stores

Stars: ✭ 37 (-38.33%)

Mutual labels: scraping

bots-zoo

No description or website provided.

Stars: ✭ 59 (-1.67%)

Mutual labels: scraping

Mechanize

Mechanize is a ruby library that makes automated web interaction easy.

Stars: ✭ 4,158 (+6830%)

Mutual labels: scraping

scraper

Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.

Stars: ✭ 37 (-38.33%)

Mutual labels: scraping

Newcrawler

Free Web Scraping Tool with Java

Stars: ✭ 589 (+881.67%)

Mutual labels: scraping

Lookyloo

Lookyloo is a web interface that allows users to capture a website page and then display a tree of domains that call each other.

Stars: ✭ 381 (+535%)

Mutual labels: scraping

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向