All Projects → gudaoxuri → keyword-extract

gudaoxuri / keyword-extract

Licence: Apache-2.0 license
简单高效的URL关键词提取工具

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to keyword-extract

fbvideos
🔗 Easily extract downloadable link of publicly available videos on facebook.
Stars: ✭ 28 (+86.67%)
Mutual labels:  url, extract
video thumbnail
This plugin generates thumbnail from video file or URL. It returns image in memory or writes into a file. It offers rich options to control the image format, resolution and quality. Supports iOS and Android.
Stars: ✭ 159 (+960%)
Mutual labels:  url
Pguri
uri type for PostgreSQL
Stars: ✭ 235 (+1466.67%)
Mutual labels:  url
Blacksmith
Blacksmith is a tool for viewing, extracting, and converting textures, 3D models, and sounds from Assassin's Creed: Odyssey/Origins/Valhalla and Steep.
Stars: ✭ 104 (+593.33%)
Mutual labels:  extract
Go Http Tunnel
Fast and secure tunnels over HTTP/2
Stars: ✭ 2,786 (+18473.33%)
Mutual labels:  url
linkify
Rust library to find links such as URLs and email addresses in plain text, handling surrounding punctuation correctly
Stars: ✭ 146 (+873.33%)
Mutual labels:  url
Urlhub
URL shortener web application based on the Laravel PHP Framework.
Stars: ✭ 217 (+1346.67%)
Mutual labels:  url
qresExtract
Qt binary resource (qres) extractor
Stars: ✭ 26 (+73.33%)
Mutual labels:  extract
yellowpages-scraper
Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Stars: ✭ 56 (+273.33%)
Mutual labels:  extract
parse-github-url
Parse a Github URL into an object. Supports a wide variety of GitHub URL formats.
Stars: ✭ 114 (+660%)
Mutual labels:  url
bifrost
🌉 The rainbow bridge. URL shortener for Vercel.
Stars: ✭ 28 (+86.67%)
Mutual labels:  url
Scout
🔭 Lightweight URL fuzzer and spider: Discover a web server's undisclosed files, directories and VHOSTs
Stars: ✭ 241 (+1506.67%)
Mutual labels:  url
watson-discovery-food-reviews
Combine Watson Knowledge Studio and Watson Discovery to discover customer sentiment from product reviews
Stars: ✭ 36 (+140%)
Mutual labels:  keyword
Connector
Коннектор: удобный HTTP-клиент для 1С:Предприятие 8
Stars: ✭ 240 (+1500%)
Mutual labels:  url
django-slugs-example-app
A basic app to show how to add slugs to models
Stars: ✭ 12 (-20%)
Mutual labels:  url
Scala Uri
Simple scala library for building and parsing URIs
Stars: ✭ 225 (+1400%)
Mutual labels:  url
use-route-as-state
Use React Router route and query string as component state
Stars: ✭ 37 (+146.67%)
Mutual labels:  url
Breviare
Small URL shortener made with the MERN Stack
Stars: ✭ 16 (+6.67%)
Mutual labels:  url
mnmlurl-extension
[DEPRECATED] 💁 Browser extension for Minimal URL - Modern URL shortener with support for custom alias & can be hosted even in GitHub pages
Stars: ✭ 21 (+40%)
Mutual labels:  url
voice-command
A simple no-API voice command assitant
Stars: ✭ 52 (+246.67%)
Mutual labels:  keyword

URL关键词提取

keyword extract

Codacy code quality Apache License 2 Maven Central

单文件、无三方依赖、支持在线规则升级、非标准协议的URL关键词提取工具。

使用

<dependency>
  <groupId>com.ecfront</groupId>
  <artifactId>keyword-extract</artifactId>
  <version>1.6</version>
</dependency>
// 关键词提取
KeyWordExtract.Result result = KeyWordExtract.extract(url);

// 使用在线规则
KeyWordExtract.loadOnlineRules("https://raw.githubusercontent.com/gudaoxuri/keyword-extract/master/src/main/resources/kwe-rules.txt");

规则配置说明

本地规则文件默认已打到jar中,如要修改可在classpath根目录中创建kwe-rules.txt文件,此文件会覆盖默认规则。

使用在线规则会覆盖自定义规则。

# 一行一条规则,配置项以|分隔
# 规则分一般规则和自定义规则,后者使用js代码处理

# 一般规则
<名称>|<host>|<关键字所在位置,query:查询条件中,path:url路径中>|<对于query位置指定关键字的key,对于path位置指定以/分隔的偏移量>|<解码方式,目前只支持decodeURI,空>|<编码>

e.g. :

百度|www.baidu.com|query|wd|decodeURI|UTF-8
搜狗微信|weixin.sogou.com|query|query|encodeURI|UTF-8
苏宁|search.suning.com|path|0|decodeURI|UTF-8

# 自定义规则
<名称>|<host>|<js代码,入参为uri,返回值为result>

e.g. :

微博|s.weibo.com|var uri = decodeURI(decodeURI(uri)); var kv = uri.split("/")[2]; result = kv.split("&Refer=")[0];

# 自定义协议支持

app://app1/somepath?q=URL关键词提取
custom://custom1/somepath?q=URL关键词提取
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].