Send a warning to STDERR when user-provided charsets contain grapheme clusters, alerting the user that such clusters will be treated as distinct codepoints. Do not error out, as this may be intentional.
The handling of grapheme clusters should also be documented more thoroughly; it's currently only documented in a README FAQ entry.