mfornos / Clojure Soup
Clojurized access for Jsoup.
Stars: ✭ 38
Programming Languages
clojure
4091 projects
Labels
Projects that are alternatives of or similar to Clojure Soup
Skrape.it
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (+507.89%)
Mutual labels: jsoup, html-parser
Nsoup
NSoup is a .NET port of the jsoup (http://jsoup.org) HTML parser and sanitizer originally written in Java
Stars: ✭ 145 (+281.58%)
Mutual labels: jsoup, html-parser
Htmlparser2
The fast & forgiving HTML and XML parser
Stars: ✭ 3,299 (+8581.58%)
Mutual labels: html-parser
Modest
Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.
Stars: ✭ 572 (+1405.26%)
Mutual labels: html-parser
Fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
Stars: ✭ 894 (+2252.63%)
Mutual labels: html-parser
Changedetection
Automatically track websites changes on Android in background.
Stars: ✭ 563 (+1381.58%)
Mutual labels: jsoup
Jodd
Jodd! Lightweight. Java. Zero dependencies. Use what you like.
Stars: ✭ 3,616 (+9415.79%)
Mutual labels: html-parser
Jsoupxpath
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java,ha ha.Just try it.
Stars: ✭ 331 (+771.05%)
Mutual labels: html-parser
Rxretrojsoup
A simple API-like from html website (scrapper) for Android, RxJava2 ready !
Stars: ✭ 492 (+1194.74%)
Mutual labels: jsoup
Hquery.php
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Stars: ✭ 295 (+676.32%)
Mutual labels: html-parser
Apifier
Apifier is a very simple HTML parser written in Python based on CSS selectors
Stars: ✭ 5 (-86.84%)
Mutual labels: html-parser
modest ex
Elixir library to do pipeable transformations on html strings (with CSS selectors)
Stars: ✭ 31 (-18.42%)
Mutual labels: html-parser
Geeknews
📚A pure reading App based on Material Design + MVP + RxJava2 + Retrofit + Dagger2 + Realm + Glide
Stars: ✭ 3,496 (+9100%)
Mutual labels: jsoup
Htmlagilitypack.netcore
An agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT. Deprecated as there's new maintainer for original HAP project.
Stars: ✭ 31 (-18.42%)
Mutual labels: html-parser
Clojure Soup
Clojurized access for Jsoup.
Examples
Get some links of a web page:
(use 'jsoup.soup)
($ (get! "http://google.com" :user-agent "CoCo/1.0") ;; get request with options
td "a[href]" ;; Jsoup selectors
(attr "abs:href")) ;; attribute selector
Get all Emoji names concatenated by single bars from 'emoji-cheat-sheet.com':
($ (get! "http://www.emoji-cheat-sheet.com/")
"li div:has(span.emoji)" (text)
(map #(clojure.string/replace % ":" ""))
(clojure.string/join "|"))
Post with basic authentication:
($ (post! "http://127.0.0.1"
:user-agent "CoCo/1.0"
:follow-redirects true
:auth (basic-auth "night" "password")
:cookies {:user "night" :other "value"}
:data {:param "one" :another "2"}) ;; post options & data
td a) ;; Jsoup selectors
Parse a local file:
($ (slurp! "test-content.html" :encoding "UTF-8" :base-uri "http://base") "a[href]")
EOF
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].