All Projects → fcannizzaro → ksoup

fcannizzaro / ksoup

Licence: Apache-2.0 license
Kotlin Wrapper for Jsoup

Programming Languages

kotlin
9241 projects

Projects that are alternatives of or similar to ksoup

Jsoup Annotations
Jsoup Annotations POJO
Stars: ✭ 242 (+310.17%)
Mutual labels:  scraping, jsoup
LInkedIn-Reverese-Lookup
🔎Search LinkedIn profile by email address📧
Stars: ✭ 20 (-66.1%)
Mutual labels:  scraping
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-71.19%)
Mutual labels:  scraping
covid19br-pub
Projeto de monitoramento de publicações oficiais relacionadas a COVID-19 no Brasil.
Stars: ✭ 12 (-79.66%)
Mutual labels:  scraping
ioweb
Web Scraping Framework
Stars: ✭ 31 (-47.46%)
Mutual labels:  scraping
PrawWallpaperDownloader
Download images from reddit
Stars: ✭ 18 (-69.49%)
Mutual labels:  scraping
RARBG-scraper
With Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (-35.59%)
Mutual labels:  scraping
html-table-extractor
extract data from html table
Stars: ✭ 74 (+25.42%)
Mutual labels:  scraping
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (-10.17%)
Mutual labels:  scraping
node-red-contrib-nbrowser
Provides a virtual web browser (a.k.a. "headless browser") appearing as a node.
Stars: ✭ 31 (-47.46%)
Mutual labels:  scraping
table2pojo
Generate POJOs for database table/columns
Stars: ✭ 16 (-72.88%)
Mutual labels:  pojo
asyncio-hn
Python (asyncio) wrapper for hackernews api
Stars: ✭ 27 (-54.24%)
Mutual labels:  scraping
reason-rust-scraper
🦀 Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (-64.41%)
Mutual labels:  scraping
shorter.recipes
A website dedicated to making recipes from any website easy to read.
Stars: ✭ 27 (-54.24%)
Mutual labels:  scraping
yttrex
youtube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling
Stars: ✭ 31 (-47.46%)
Mutual labels:  scraping
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-10.17%)
Mutual labels:  scraping
oversmash
Overwatch API library for player details and career stats
Stars: ✭ 42 (-28.81%)
Mutual labels:  scraping
docker-selenium-lambda
The simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+191.53%)
Mutual labels:  scraping
scrapman
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-64.41%)
Mutual labels:  scraping
puppeteer-botcheck
🕵‍♂ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (-28.81%)
Mutual labels:  scraping

ksoup

Kotlin Wrapper for Jsoup using Pojo and Delegated Properties

Build Status

Gradle Dependency

Step 1.

Add the JitPack repository to your build file

allprojects {
  repositories {
    ...
    maven { url 'https://jitpack.io' }
  }
}

Step 2.

Add the dependency

dependencies {
  compile 'com.github.fcannizzaro:ksoup:1.0.4'
}

Usage

val html = Jsoup.connect("https://www.npmjs.com/").get()
val ksoup = Ksoup(html)

/*
*  Scrape!
*/

val npm : NPM = ksoup.from(html, NPM())

// or

val packages: List<Package> = ksoup.fromList(html, Package())

See Sample Code

Model

IKsoup([query: String])

afterBind

method called after object binding is completed.

class Package : IKsoup(".item") {

  // properties

  override fun afterBind() {
    // called after object binding is completed.
  }

}

Delegated Methods

trim param is always true

bindText(query: String, [trim: Boolean]) : String?

val title by bindText("head > title")

bindImage(query: String, [trim: Boolean]) : String?

val image by bindImage(".icon")

bindLink(query: String, [trim: Boolean]) : String?

val link by bindLink("a")

bindAttr(query: String, attr: String | KAttr, [trim: Boolean]) : String?

val checked by bindAttr("form > checkbox", KAttr.checked)

bindList(clazz: IKsoup) : List

val items : List<Package> by bindList(Package())

bindClass(clazz: IKsoup, parent: IKsoup) : T

class Package : IKsoup(".item") {

  class Info : IKsoup(".subitem") {
    // Info properties
  }

  // Package properties

  val info: Info by bindClass(Info(), this)

}

Author

Francesco Saverio Cannizzaro (fcannizzaro)

License

Copyright 2017 Francesco Saverio Cannizzaro

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].