ladybird/Meta/generate-emoji-txt.sh
Ryan Liptak 221d9089e9 Meta: Generate emoji.txt at build time from Unicode's emoji-test.txt
Instead of manually updating emoji.txt whenever new emoji are added,
we use Unicode's emoji-test.txt to generate emoji.txt on each build,
including only the emojis that Serenity supports at that time.

By using emoji-test.txt, we can also include all forms of each emoji
(fully-qualified, minimally-qualified, and unqualified) which can be
helpful when double-checking how certain forms are handled.
2022-08-22 08:42:54 +01:00

62 lines
1.9 KiB
Bash
Executable file

#!/usr/bin/env bash
set -e
if [ $# -ne 3 ]; then
echo "Usage: $0 <input emoji-test.txt file> <emoji image directory> <output path>"
exit 1
fi
INPUT_FILE="$1"
EMOJI_DIR="$2"
OUTPUT_PATH="$3"
# empty the generated file first
:>| "$OUTPUT_PATH"
first_heading=true
while IFS= read -r line
do
if [[ $line == \#\ subgroup:\ * || $line == \#\ group:\ * ]]; then
if [ $first_heading = false ]; then
echo "" >> "$OUTPUT_PATH"
fi
echo "$line" >> "$OUTPUT_PATH"
first_heading=false
elif [[ ${#line} -ne 0 && $line != \#* ]]; then
codepoints_string=${line%%;*}
IFS=" " read -r -a codepoints <<< "$codepoints_string"
for i in "${!codepoints[@]}"; do
# strip leading zeros
codepoints[$i]="${codepoints[$i]#"${codepoints[$i]%%[!0]*}"}"
# add U+ prefix
codepoints[$i]="U+${codepoints[$i]}"
done
# when doing a lookup we want to remove all U+FE0F (emoji presentation specifier) codepoints
lookup_filename_parts=()
for codepoint in "${codepoints[@]}"; do
if [[ $codepoint != "U+FE0F" ]]; then
lookup_filename_parts+=("$codepoint")
fi
done
IFS=_
lookup_filename="${lookup_filename_parts[*]}.png"
if [ -f "$EMOJI_DIR/$lookup_filename" ]; then
emoji_and_name=${line#*# }
emoji=${emoji_and_name%% E*}
name_with_version=${emoji_and_name#* }
name=${name_with_version#* }
qualification=${line#*; }
qualification=${qualification%%#*}
# remove trailing whitespace characters
qualification="${qualification%"${qualification##*[![:space:]]}"}"
IFS=" "
echo "$emoji - ${codepoints[*]} ${name^^} ($qualification)" >> "$OUTPUT_PATH"
fi
fi
done < "$INPUT_FILE"