All Projects → YvesCheung → EmojiReader

YvesCheung / EmojiReader

Licence: Apache-2.0 license
A simple tool to recognize Emoji in string. (JavaScript & Java)

Programming Languages

kotlin
9241 projects
javascript
184084 projects - #8 most used programming language
HTML
75241 projects
CSS
56736 projects

Projects that are alternatives of or similar to EmojiReader

Emojica
A Swift framework for using custom emoji in strings.
Stars: ✭ 93 (+52.46%)
Mutual labels:  emoji, emojis
emojis
An emoji management bot for Discord.
Stars: ✭ 18 (-70.49%)
Mutual labels:  emoji, emojis
Whatsbook
Create books from WhatsApp group chats with Python and LaTeX
Stars: ✭ 147 (+140.98%)
Mutual labels:  emoji, emojis
Awesome Emoji Picker
Add-on/WebExtension that provides a modern emoji picker that you can use to find and copy/insert emoji into the active web page.
Stars: ✭ 54 (-11.48%)
Mutual labels:  emoji, emojis
Emoji-Log-VSCode
Emoji-Log VSCode Extension — An Emoji Git commit log messages spec standard. [ 📦👌🐛📖🚀🤖 ‼️]
Stars: ✭ 44 (-27.87%)
Mutual labels:  emoji, emojis
React Native Animated Emoji
Animated Floating Reactions like Facebook 👍
Stars: ✭ 82 (+34.43%)
Mutual labels:  emoji, emojis
AllGithubEmojis
A list of all supported github emojis updated weekly. https://jzeferino.github.io/AllGithubEmojis/
Stars: ✭ 82 (+34.43%)
Mutual labels:  emoji, emojis
slack-emoji-for-techies
100s of Slack emoji, many tech-related
Stars: ✭ 123 (+101.64%)
Mutual labels:  emoji, emojis
spacymoji
💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 174 (+185.25%)
Mutual labels:  emoji, emojis
EmoticonsBoard
Function keyboard and emotions. Android表情键盘,可动态更新表情。
Stars: ✭ 31 (-49.18%)
Mutual labels:  emoji, emoticon
Styleguide Git Commit Message
/sBin/StyleGuide/Git/CommitMessage
Stars: ✭ 934 (+1431.15%)
Mutual labels:  emoji, emojis
ChineseBQB-client
🤣 开源表情包小程序
Stars: ✭ 81 (+32.79%)
Mutual labels:  emoji, emoticon
Oji
(◕‿◕) Text Emoticons Maker
Stars: ✭ 668 (+995.08%)
Mutual labels:  emoji, emojis
Emojipacks
CLI to bulk upload emojis to your Slack
Stars: ✭ 1,275 (+1990.16%)
Mutual labels:  emoji, emojis
Supernova Emoji
library to implement and render emojis For Android
Stars: ✭ 334 (+447.54%)
Mutual labels:  emoji, emojis
Spacymoji
💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 151 (+147.54%)
Mutual labels:  emoji, emojis
emoji-extractor-plus
Extract emojis from Apple font in PNG format
Stars: ✭ 42 (-31.15%)
Mutual labels:  emoji, emojis
latexemoji
Latex package to include emoji in Latex document
Stars: ✭ 17 (-72.13%)
Mutual labels:  emoji, emojis
emoticon
List of emoticons
Stars: ✭ 41 (-32.79%)
Mutual labels:  emoji, emoticon
Emojions
Embeddable Emoji Bar
Stars: ✭ 15 (-75.41%)
Mutual labels:  emoji, emojis

EmojiReader

一个能在字符串中识别出 Emoji 的简单工具 (支持JavaScript/Java)


npm version

点此预览效果: https://emoji-reader.vercel.app/

English Readme

特性

  • 支持 Unicode12 规范,点此查看
  • 基于 EBNF 状态机的 Emoji 判断,比正则表达式更易维护
  • 准确判断含有 Emoji 的字符串长度
  • 准确切割字符串不会断开 Emoji

长度判断

Emoji String.length EmojiReader.getTextLength
1 1
🙂 2 1
👱‍♂ 5 1
🏳️‍🌈 6 1
👨‍👩‍👦‍👦 11 1

在字符串中,一个 Emoji 由一个或多个 Unicode 码点(CodePoint)组成,一个码点可能由多个字符组成(取决于码点是否大于 0x010000),因此一个 Emoji 可能由数个字符组成。

很多业务都需要有字数的判断,比如用户昵称不能过长,发言内容有字数限制等等。如果不对 Emoji 进行特殊处理,往往会出现不符合用户预期的情况。

使用 EmojiReader.getTextLength 可以获取到文本的可视符号的长度,一个 Emoji 的长度为1。

//Java
String strWithEmoji = "我是一个😃";
int error = strWithEmoji.length(); //6
int correct = EmojiReader.getTextLength(strWithEmoji); //5
//JavaScript
const strWithEmoji = '我是一个😃';
const error = strWithEmoji.length; //6
const correct = require('emoji-reader').getTextLength(strWithEmoji); //5

表情切割

当显示文本过长时,通常我们会省略末尾的文本,并加上省略号。

但如果字符串中含有 Emoji ,切割字符串就很可能把 Emoji 切段,变成乱码。比如下面这个字符串:

"我是🙂😐😎💏"

经过 String.subString(0, 5) 处理后:

"我是🙂?"

因为多个 Unicode 码点共同组合才能完成一个 Emoji 的展示,通过切割后剩下的 Unicode 码点会表现出无法正常显示的乱码。

使用 EmojiReader.subSequence 可以按照一个 Emoji 长度为1来进行符合视觉预期的裁剪。

//JavaScript
import EmojiReader from 'emoji-reader'
//Java
import com.yy.mobile.emoji.EmojiReader

EmojiReader.subSequence("我是🙂😐😎💏", 0, 5) == "我是🙂😐😎"

安装 (Javascript)

npm install --save emoji-reader

安装 (Java/Android)

  1. 根目录的 build.gradle 添加:

    allprojects {
        repositories {
            ...
            maven { url 'https://jitpack.io' }
        }
    }
  2. 使用的模块的 build.gradle 中添加:

    dependencies {
        api 'com.github.YvesCheung.EmojiReader:lib-jvm:x.y.z'
    }

    其中x.y.z 版本替换为

原理

Unicode 规范文档中给出了 Emoji 的语法,是一个EBNF范式的表达:

possible_emoji :=
    flag_sequence
    | zwj_element (\x{200D} zwj_element)+
     
flag_sequence :=
    \p{RI} \p{RI}
     
zwj_element :=
    \p{Emoji} emoji_modification?

emoji_modification :=
    \p{EMod}
    | \x{FE0F} \x{20E3}?
    | tag_modifier

tag_modifier :=
    [\x{E0020}-\x{E007E}]+ \x{E007F}

这里简单地解释一下:

Emoji只有三种形式

第一种是国旗类的,由两个国家区域符组成

  • 两个区域符号组成国旗的样例

第二种是由表情专属的码点加修饰符组成(修饰符可选)

  • 单个码点组成的样例

  • 码点加上修饰符的样例(此例中修饰符为 \uFE0F \20E3)

第三种是由多个第二种表情通过连接符组成

  • 多个(码点 修饰符)相连的样例(连接符为 \u200D)

  • 经典的全家福

通过全家福可以发现,\u1F469\u1F466 都是独立的 Emoji 码点,可以表现出一个人像,当他们通过 \u200D 连接符组合后,就可以表现出一个多人像的新 Emoji

一个工程师 \u1F477 和一个女性别 \2640 \FE0F 组合起来,就可以表现出一个女工程师的新 Emoji

可选的修饰符 \uFE0F \u20E3 等等跟在独立的 Emoji 码点后面,可以起修改表现颜色/表现性别等作用。

通过修饰符和连接符就能把 Emoji 码点组合出千变万化的表情。

许可证

Copyright 2019 YvesCheung

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].