Instead of splitting words by spaces, we should use the Unicode segmentation algorithm as implemented by unicode-segmentation, unicode-linebreak or textwrap.
Related to the migration to textwrap, see #30.