purescript/purescript-strings

Add a `isWellFormed` function?

chtenb opened this issue · 2 comments

Could we provide a function that checks if a string is wellformed w.r.t. its encoding? In the JS backend this would be a check if there are lone surrogates present, in other backends this may be a different check. This would allow one to check if a string is well formed, after which one doesn't have to deal with invalid encoded string situations.

Maybe a better API would return either the already-well-formed string or a new string derived by an implementation-defined transformation process (such as using replacement characters) on the input to ensure it is well-formed.

toWellFormed :: String -> Either String String