Converts a string into canonical form, standardizing
such issues as whether a character with an accent
is represented as a base character and combining
accent or as a single precomposed character. The
string has to be valid UTF-8, otherwise null is
returned. You should generally call glib.global.utf8Normalize
before comparing two Unicode strings.
The normalization mode glib.types.NormalizeMode.Default only
standardizes differences that do not affect the
text content, such as the above-mentioned accent
representation. glib.types.NormalizeMode.All also standardizes
the "compatibility" characters in Unicode, such
as SUPERSCRIPT THREE to the standard forms
(in this case DIGIT THREE). Formatting information
may be lost but for most text operations such
characters should be considered the same.
Converts a string into canonical form, standardizing such issues as whether a character with an accent is represented as a base character and combining accent or as a single precomposed character. The string has to be valid UTF-8, otherwise null is returned. You should generally call glib.global.utf8Normalize before comparing two Unicode strings.
The normalization mode glib.types.NormalizeMode.Default only standardizes differences that do not affect the text content, such as the above-mentioned accent representation. glib.types.NormalizeMode.All also standardizes the "compatibility" characters in Unicode, such as SUPERSCRIPT THREE to the standard forms (in this case DIGIT THREE). Formatting information may be lost but for most text operations such characters should be considered the same.
glib.types.NormalizeMode.DefaultCompose and glib.types.NormalizeMode.AllCompose are like glib.types.NormalizeMode.Default and glib.types.NormalizeMode.All, but returned a result with composed forms rather than a maximally decomposed form. This is often useful if you intend to convert the string to a legacy encoding or pass it to a system with less capable Unicode handling.