• Djehngo@lemmy.world
      link
      fedilink
      arrow-up
      6
      ·
      1 year ago

      Makes sense, the code-points split is stable; meaning it’s fine to put in the standard library, the grapheme split changes every year so the volatility is probably better off in a crate.

      • Knusper@feddit.de
        link
        fedilink
        arrow-up
        5
        ·
        1 year ago

        Yeah, although having now seen two commenters with relatively high confidence claiming that counting codepoints ought be enough…

        …and me almost having been the third such commenter, had I not decided to read the article first…

        …I’m starting to feel more and more like the stdlib should force you through all kinds of hoops to get anything resembling a size of a string, so that you gladly search for a library.

        Like, I’ve worked with decoding strings quite a bit in the past, I felt like I had an above average understanding of Unicode as a result. And I was still only vaguely aware of graphemes.