Tag plugin doesn't work when specific Japanese characters, e.g '一'(U+4E00), exist in a tag like as follows.
Because '一's UTF-8 byte sequence(\xE4\xB8\x80) get corrupted by the following code in syntax_plugin_tag_tag::handle(tag.php).
$tags = trim($tags, "\xe2\x80\x8b"); // strip word/wordpad breaklines(U+200b)
It removes \x80 from \xE4\xB8\x80('一's UTF-8 byte sequence), and its result becomes an invalid sequence \xE4\xB8.
Tag plugin doesn't work when specific Japanese characters, e.g '一'(
U+4E00), exist in a tag like as follows.Because '一's UTF-8 byte sequence(
\xE4\xB8\x80) get corrupted by the following code insyntax_plugin_tag_tag::handle(tag.php).It removes
\x80from\xE4\xB8\x80('一's UTF-8 byte sequence), and its result becomes an invalid sequence\xE4\xB8.