It is represented in the place of a character that is identified to be invalid or incorrect, or in cases when it can't be represented on a device used. 替代字符 替代字符 (英語: substitute character,␚)是一个 控制字符,它被用于替代识别为无效、错误或不能在指定设备上表示的字符。 它也被一些编程语言用于 转义序列。 在 ascii 和. The substitute character (sub) was intended to request a translation of the next character from a printable character to another value, usually by setting bit 5 to zero.
When applied to chinese word segmentation. Specifically, we first encode the input text by converting each chinese character into a short sequence based on its glyph or pronunciation, and then construct the vocabulary. In computer data, a substitute character (␚) is a control character that is used to pad transmitted data in order to send it in blocks of fixed size, or to stand in place of a character that is.
Click to show more than 250. 替代字元 (英語: substitute character,␚)是一個 控制字元,它被用於替代辨識為無效、錯誤或不能在指定裝置上表示的字元。 它也被一些程式語言用於 跳脫序列。 在 ascii 和 unicode.