gaojiuli / Toapi

Licence: other

Every web site provides APIs.

Programming Languages

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Toapi

微信公众号爬虫，公众号历史文章，文章评论，文章阅读及在看数据，可视化web页面，可部署于Windows服务器。基于Python3之flask/mysql/redis/mitmproxy/pywin32等实现，高效微信爬虫，微信公众号爬虫，历史文章，文章评论，数据更新。

Stars: ✭ 287 (-91.06%)

Mutual labels: api, crawler, spider, flask

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

Stars: ✭ 57 (-98.22%)

Mutual labels: crawler, spider, flask

Ok ip proxy pool

🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池

Stars: ✭ 196 (-93.89%)

Mutual labels: crawler, spider, flask

Proxy pool

Python爬虫代理IP池(proxy pool)

Stars: ✭ 13,964 (+335.15%)

Mutual labels: crawler, spider, flask

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-94.67%)

Mutual labels: json, crawler, spider

Flask Restx

Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask

Stars: ✭ 1,050 (-67.28%)

Mutual labels: api, json, flask

Ncov2019 data crawler

疫情数据爬虫，2019新型冠状病毒数据仓库，轨迹数据，同乘数据，报道

Stars: ✭ 175 (-94.55%)

Mutual labels: api, crawler, spider

Flask Restplus

Fully featured framework for fast, easy and documented API development with Flask

Stars: ✭ 2,585 (-19.45%)

Mutual labels: api, json, flask

Crawlertutorial

爬蟲極簡教學（fetch, parse, search, multiprocessing, API）- PTT 為例

Stars: ✭ 282 (-91.21%)

Mutual labels: api, crawler, spider

arachnod

High performance crawler for Nodejs

Stars: ✭ 17 (-99.47%)

Mutual labels: crawler, spider

slime

🍰 一个可视化的爬虫平台

Stars: ✭ 27 (-99.16%)

Mutual labels: crawler, spider

ZhengFang System Spider

🐛一只登录正方教务管理系统，爬取数据的小爬虫

Stars: ✭ 21 (-99.35%)

Mutual labels: crawler, spider

flink-crawler

Continuous scalable web crawler built on top of Flink and crawler-commons

Stars: ✭ 48 (-98.5%)

Mutual labels: crawler, spider

crawler

A simple and flexible web crawler framework for java.

Stars: ✭ 20 (-99.38%)

Mutual labels: crawler, spider

WebCrawler

一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。

Stars: ✭ 39 (-98.78%)

Mutual labels: crawler, spider

Httpie

As easy as /aitch-tee-tee-pie/ 🥧 Modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more. https://twitter.com/httpie

Stars: ✭ 53,052 (+1553.23%)

Mutual labels: api, json

Safrs

SqlAlchemy Flask-Restful Swagger Json:API OpenAPI

Stars: ✭ 255 (-92.05%)

Mutual labels: json, flask

Php Curl Class

PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs

Stars: ✭ 2,903 (-9.54%)

Mutual labels: api, json

Http Fake Backend

Build a fake backend by providing the content of JSON files or JavaScript objects through configurable routes.

Stars: ✭ 253 (-92.12%)

Mutual labels: api, json

galer

A fast tool to fetch URLs from HTML attributes by crawl-in.

Stars: ✭ 138 (-95.7%)

Mutual labels: crawler, spider

View All Similar Projects ➔

Toapi

Overview

Toapi give you the ability to make every web site provides APIs.

v1.0.0 Documentation: http://www.toapi.org
Awesome: https://github.com/toapi/awesome-toapi
Organization: https://github.com/toapi

Features

Automatic converting HTML web site to API service.
Automatic caching every page of source site.
Automatic caching every request.
Support merging multiple web sites into one API service.

Get Started

Installation

$ pip install toapi

Usage

create app.py and copy the code:

from flask import request
from htmlparsing import Attr, Text
from toapi import Api, Item

api = Api()


@api.site('https://news.ycombinator.com')
@api.list('.athing')
@api.route('/posts?page={page}', '/news?p={page}')
@api.route('/posts', '/news?p=1')
class Post(Item):
    url = Attr('.storylink', 'href')
    title = Text('.storylink')


@api.site('https://news.ycombinator.com')
@api.route('/posts?page={page}', '/news?p={page}')
@api.route('/posts', '/news?p=1')
class Page(Item):
    next_page = Attr('.morelink', 'href')

    def clean_next_page(self, value):
        return api.convert_string('/' + value, '/news?p={page}', request.host_url.strip('/') + '/posts?page={page}')


api.run(debug=True, host='0.0.0.0', port=5000)

run python app.py

then open your browser and visit http://127.0.0.1:5000/posts?page=1

you will get the result like:

{
  "Page": {
    "next_page": "http://127.0.0.1:5000/posts?page=2"
  }, 
  "Post": [
    {
      "title": "Mathematicians Crack the Cursed Curve", 
      "url": "https://www.quantamagazine.org/mathematicians-crack-the-cursed-curve-20171207/"
    }, 
    {
      "title": "Stuffing a Tesla Drivetrain into a 1981 Honda Accord", 
      "url": "https://jalopnik.com/this-glorious-madman-stuffed-a-p85-tesla-drivetrain-int-1823461909"
    }
  ]
}

Contributing

Write code and test code and pull request.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

gaojiuli / Toapi

Programming Languages

Labels

Projects that are alternatives of or similar to Toapi

Toapi

Overview

Features

Get Started

Installation

Usage

Contributing