All Projects → mfornos → Clojure Soup

mfornos / Clojure Soup

Clojurized access for Jsoup.

Programming Languages

clojure
4091 projects

Projects that are alternatives of or similar to Clojure Soup

Skrape.it
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (+507.89%)
Mutual labels:  jsoup, html-parser
Nsoup
NSoup is a .NET port of the jsoup (http://jsoup.org) HTML parser and sanitizer originally written in Java
Stars: ✭ 145 (+281.58%)
Mutual labels:  jsoup, html-parser
Htmlparser2
The fast & forgiving HTML and XML parser
Stars: ✭ 3,299 (+8581.58%)
Mutual labels:  html-parser
Modest
Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.
Stars: ✭ 572 (+1405.26%)
Mutual labels:  html-parser
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+860.53%)
Mutual labels:  jsoup
Crawlerforreader
Android 本地网络小说爬虫,基于jsoup及xpath
Stars: ✭ 312 (+721.05%)
Mutual labels:  jsoup
Justext
Heuristic based boilerplate removal tool
Stars: ✭ 418 (+1000%)
Mutual labels:  html-parser
scraper
Scraper example built on Scala, Akka and Jsoup
Stars: ✭ 15 (-60.53%)
Mutual labels:  jsoup
Fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
Stars: ✭ 894 (+2252.63%)
Mutual labels:  html-parser
Flesh
Android上福利满满的app,宅男神器
Stars: ✭ 363 (+855.26%)
Mutual labels:  jsoup
Changedetection
Automatically track websites changes on Android in background.
Stars: ✭ 563 (+1381.58%)
Mutual labels:  jsoup
Jodd
Jodd! Lightweight. Java. Zero dependencies. Use what you like.
Stars: ✭ 3,616 (+9415.79%)
Mutual labels:  html-parser
Jsoupxpath
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java,ha ha.Just try it.
Stars: ✭ 331 (+771.05%)
Mutual labels:  html-parser
Rxretrojsoup
A simple API-like from html website (scrapper) for Android, RxJava2 ready !
Stars: ✭ 492 (+1194.74%)
Mutual labels:  jsoup
Hquery.php
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (+676.32%)
Mutual labels:  html-parser
Apifier
Apifier is a very simple HTML parser written in Python based on CSS selectors
Stars: ✭ 5 (-86.84%)
Mutual labels:  html-parser
modest ex
Elixir library to do pipeable transformations on html strings (with CSS selectors)
Stars: ✭ 31 (-18.42%)
Mutual labels:  html-parser
Geeknews
📚A pure reading App based on Material Design + MVP + RxJava2 + Retrofit + Dagger2 + Realm + Glide
Stars: ✭ 3,496 (+9100%)
Mutual labels:  jsoup
Lcg
吾爱破解第三方安卓应用
Stars: ✭ 371 (+876.32%)
Mutual labels:  jsoup
Htmlagilitypack.netcore
An agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT. Deprecated as there's new maintainer for original HAP project.
Stars: ✭ 31 (-18.42%)
Mutual labels:  html-parser

Clojure Soup

Clojurized access for Jsoup.

clojars version

Examples

Get some links of a web page:

(use 'jsoup.soup)

($ (get! "http://google.com" :user-agent "CoCo/1.0") ;; get request with options
   td "a[href]" ;; Jsoup selectors
   (attr "abs:href")) ;; attribute selector

Get all Emoji names concatenated by single bars from 'emoji-cheat-sheet.com':

($ (get! "http://www.emoji-cheat-sheet.com/") 
   "li div:has(span.emoji)" (text) 
   (map #(clojure.string/replace % ":" "")) 
   (clojure.string/join "|")) 

Post with basic authentication:

($ (post! "http://127.0.0.1"  
        :user-agent "CoCo/1.0" 
        :follow-redirects true
        :auth (basic-auth "night" "password")
        :cookies {:user "night" :other "value"}
        :data {:param "one" :another "2"}) ;; post options & data
 td a) ;; Jsoup selectors

Parse a local file:

($ (slurp! "test-content.html" :encoding "UTF-8" :base-uri "http://base") "a[href]")

Build Status

EOF

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].