Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
54 changes: 51 additions & 3 deletions docs/emoji/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,12 +4,22 @@

1. Get the UCD tools updated (see [Building Unicode Tools](../index.md))
2. \[If New version\]
1. Set **the version as per instructions in** Emoji.java.
1. Set **the version as per instructions in** the header of Emoji.java.
2. Create a new folder **{unicode-draft}**/emoji/**X**.0/
3. Copy in the ReadMe.txt from the last version, but add the word "draft"
in front of "emoji".
4. And add the images to the images repository.
5. Make sure that docRegistry.txt is up to date.
1. Look at the last line in docRegistry.txt, eg L2/20-153 ...
2. Go to the https://www.unicode.org/L2/L-curdoc.htm
3. Find the year, and copy the first column after L2/20-153 down to the last real line.
3. Paste at the end of docRegistry.txt. Make sure there are 4 tab delimited columns.
4. Repeat for any later years.
5. NOTE: sometimes there are some glytches in the columns. Typically when there are multiple lines in a cell. You'll see that when a line doesn't start with L2/... In that case, fix the lines (typically by joining with previous line)
6. *Note that the data files with the new emoji will not be generated until candidateData.txt has Status=Draft Candidate.*
3. Run **{unicodetools/org/unicode/text}**/tools/GenerateEmoji.java
1. For VM arguments see the section below.
2. Sometimes the candidateData.txt file will be malformed, and you'll see some errors. Fix them.
4. That will generate updated files in one of two places, according to the
setting Emoji.IS_BETA.
1. **CURRENT VERSION**
Expand All @@ -20,8 +30,46 @@
2. **{unicode-draft}**/emoji/**charts-NEXT.0**/\*
5. Special small versions of the charts are in corresponding directories like:
1. **{unicode-draft}**/emoji/🏴charts-11.0/emoji-list.html
6. Sanity-check them, diff against old files, and check in.
1. Don't use Eclipse diff for the big chart files: they are too big for it.
6. Sanity-check data files, diffing against old files.
1. Old data files will be in unicodetools/unicodetools/data/emoji/**CURRENT**.0/
2. New data files in **{unicode-draft}**
3. If all looks good, copy the new data files into will go into unicodetools/unicodetools/data/emoji/**NEXT**.0/
7. If you are not doing this the first time:
1. Sanity check the new charts (Don't use Eclipse diff for the big chart files: they are too big for it.)
2. Copy them into https://github.com/unicode-org/emoji/tree/main/docs/emoji
1. /**NEXT**.0/
2. /future/


\[If New version\]
Some things will not be fixed yet, so you have to take a second pass.

1. Update emojiOrdering.txt
Diff unicodetools/src/main/resources/org/unicode/tools/emoji/emojiOrdering.txt with the new
unicodetools/data/emoji/**NEXT**/internal/emojiOrdering.txt
Copy in the new characters from internal/emojiOrdering.txt to emoji/emojiOrdering.txt.
**Don't** remove the sets of ZWJ sequences like 👱‍♀ 👱‍♀️ 👱‍♂ 👱‍♂️ 👱🏻‍♀ 👱🏻‍♀️ 👱🏻‍♂ 👱🏻‍♂️ 👱🏼‍♀ 👱🏼‍♀️ 👱🏼‍♂ 👱🏼‍♂️ 👱🏽‍♀ 👱🏽‍♀️ ; those are still needed.

2. Run GenerateEmoji.java again.
If you missed one of the new characters you will probably get an error in building the EmojiOrder.
This will update the generated files to have **NEXT** instead of E0.0. Example:
``
1F6DC ; Emoji # E0.0 [1] (🛜) wireless
becomes
``
1F6DC ; Emoji # E15.0 [1] (🛜) wireless
``
It will also add the new characters to emoji-test.txt (they will be missing in the first pass.)

3. Copy over the data files again
From: **{unicode-draft}**/Public/emoji/**NEXT.0**/\*
To: unicodetools/unicodetools/data/emoji/**NEXT**.0/

1. Go to step 7 above to copy the charts.
2.

### CLDR
CLDR uses some files once the correct emoji-test.txt file is built. For details, see https://cldr.unicode.org/development/generate-emoji-paths

### GenerateEmoji: VM Arguments

Expand Down
1,326 changes: 1,326 additions & 0 deletions unicodetools/data/emoji/15.0/emoji-data.txt

Large diffs are not rendered by default.

1,489 changes: 1,489 additions & 0 deletions unicodetools/data/emoji/15.0/emoji-sequences.txt

Large diffs are not rendered by default.

5,022 changes: 5,022 additions & 0 deletions unicodetools/data/emoji/15.0/emoji-test.txt

Large diffs are not rendered by default.

723 changes: 723 additions & 0 deletions unicodetools/data/emoji/15.0/emoji-variation-sequences.txt

Large diffs are not rendered by default.

1,411 changes: 1,411 additions & 0 deletions unicodetools/data/emoji/15.0/emoji-zwj-sequences.txt

Large diffs are not rendered by default.

90 changes: 90 additions & 0 deletions unicodetools/data/emoji/15.0/internal/emoji-internal.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,90 @@
# emoji-internal.txt
# Date: 2022-05-02, 03:44:53 GMT
# © 2022 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use, see https://www.unicode.org/terms_of_use.html
#
# Emoji Data Internal for UTS #51
# Version: 15.0
#
# For documentation and usage, see https://www.unicode.org/reports/tr51
#
# Format:
# <codepoint(s)> ; <property> # <comments>
# Note: there is no guarantee as to the structure of whitespace or comments
#
# Characters and sequences are listed in code point order. Users should be shown a more natural order.
# See the CLDR collation order for Emoji.


# ================================================

# All omitted code points have Emoji_Gender_Base=No
# @missing: 0000..10FFFF ; Emoji_Gender_Base ; No

26F9 ; Emoji_Gender_Base # E0.7 [1] (⛹️) person bouncing ball
1F3C3..1F3C4 ; Emoji_Gender_Base # E0.6 [2] (🏃..🏄) person running..person surfing
1F3CA ; Emoji_Gender_Base # E0.6 [1] (🏊) person swimming
1F3CB..1F3CC ; Emoji_Gender_Base # E0.7 [2] (🏋️..🏌️) person lifting weights..person golfing
1F46E..1F471 ; Emoji_Gender_Base # E0.6 [4] (👮..👱) police officer..person: blond hair
1F473 ; Emoji_Gender_Base # E0.6 [1] (👳) person wearing turban
1F477 ; Emoji_Gender_Base # E0.6 [1] (👷) construction worker
1F481..1F482 ; Emoji_Gender_Base # E0.6 [2] (💁..💂) person tipping hand..guard
1F486..1F487 ; Emoji_Gender_Base # E0.6 [2] (💆..💇) person getting massage..person getting haircut
1F575 ; Emoji_Gender_Base # E0.7 [1] (🕵️) detective
1F645..1F647 ; Emoji_Gender_Base # E0.6 [3] (🙅..🙇) person gesturing NO..person bowing
1F64B ; Emoji_Gender_Base # E0.6 [1] (🙋) person raising hand
1F64D..1F64E ; Emoji_Gender_Base # E0.6 [2] (🙍..🙎) person frowning..person pouting
1F6A3 ; Emoji_Gender_Base # E1.0 [1] (🚣) person rowing boat
1F6B4..1F6B5 ; Emoji_Gender_Base # E1.0 [2] (🚴..🚵) person biking..person mountain biking
1F6B6 ; Emoji_Gender_Base # E0.6 [1] (🚶) person walking
1F926 ; Emoji_Gender_Base # E3.0 [1] (🤦) person facepalming
1F935 ; Emoji_Gender_Base # E3.0 [1] (🤵) person in tuxedo
1F937..1F939 ; Emoji_Gender_Base # E3.0 [3] (🤷..🤹) person shrugging..person juggling
1F93C..1F93E ; Emoji_Gender_Base # E3.0 [3] (🤼..🤾) people wrestling..person playing handball
1F9B8..1F9B9 ; Emoji_Gender_Base # E11.0 [2] (🦸..🦹) superhero..supervillain
1F9CD..1F9CF ; Emoji_Gender_Base # E12.0 [3] (🧍..🧏) person standing..deaf person
1F9D4 ; Emoji_Gender_Base # E5.0 [1] (🧔) person: beard
1F9D6..1F9DF ; Emoji_Gender_Base # E5.0 [10] (🧖..🧟) person in steamy room..zombie

# Total elements: 51

# ================================================

# All omitted code points have Emoji_Explicit_Gender=No
# @missing: 0000..10FFFF ; Emoji_Explicit_Gender ; No

1F385 ; Emoji_Explicit_Gender # E0.6 [1] (🎅) Santa Claus
1F466..1F469 ; Emoji_Explicit_Gender # E0.6 [4] (👦..👩) boy..woman
1F46B ; Emoji_Explicit_Gender # E0.6 [1] (👫) woman and man holding hands
1F46C..1F46D ; Emoji_Explicit_Gender # E1.0 [2] (👬..👭) men holding hands..women holding hands
1F470 ; Emoji_Explicit_Gender # E0.6 [1] (👰) person with veil
1F472 ; Emoji_Explicit_Gender # E0.6 [1] (👲) person with skullcap
1F474..1F475 ; Emoji_Explicit_Gender # E0.6 [2] (👴..👵) old man..old woman
1F478 ; Emoji_Explicit_Gender # E0.6 [1] (👸) princess
1F483 ; Emoji_Explicit_Gender # E0.6 [1] (💃) woman dancing
1F574 ; Emoji_Explicit_Gender # E0.7 [1] (🕴️) person in suit levitating
1F57A ; Emoji_Explicit_Gender # E3.0 [1] (🕺) man dancing
1F930 ; Emoji_Explicit_Gender # E3.0 [1] (🤰) pregnant woman
1F931 ; Emoji_Explicit_Gender # E5.0 [1] (🤱) breast-feeding
1F934..1F936 ; Emoji_Explicit_Gender # E3.0 [3] (🤴..🤶) prince..Mrs. Claus
1F9D5 ; Emoji_Explicit_Gender # E5.0 [1] (🧕) woman with headscarf

# Total elements: 22

# ================================================

# All omitted code points have Multi_Person_Groupings=No
# @missing: 0000..10FFFF ; Multi_Person_Groupings ; No

1F46A..1F46B ; Multi_Person_Groupings # E0.6 [2] (👪..👫) family..woman and man holding hands
1F46C..1F46D ; Multi_Person_Groupings # E1.0 [2] (👬..👭) men holding hands..women holding hands
1F46F ; Multi_Person_Groupings # E0.6 [1] (👯) people with bunny ears
1F48F ; Multi_Person_Groupings # E0.6 [1] (💏) kiss
1F491 ; Multi_Person_Groupings # E0.6 [1] (💑) couple with heart
1F91D ; Multi_Person_Groupings # E3.0 [1] (🤝) handshake
1F93C ; Multi_Person_Groupings # E3.0 [1] (🤼) people wrestling

# Total elements: 9

#EOF
Loading