<xs:element name="foo" type="xs:hexBinary" dfdl:lengthKind="delimited" dfdl:separator="å¹´" dfdl:encoding="iso-8859-1" />
The separator now contains the 3 bytes of the UTF-8 character, but as individual characters in iso-8859-1 where byte values and unicode codepoints are the same.
It doesn't work because char class entities like WSP+ remain problematic. As a UTF-8 WSP+ allows repeats of any of the byte sequences corresponding to these unicode characters:
U+0009-U+000D (Control characters)
U+0020 SPACE
U+0085 NEL
U+00A0 NBSP
U+1680 OGHAM SPACE MARK
U+180E MONGOLIAN VOWEL SEPARATOR
U+2000-U+200A (different sorts of spaces)
U+2028 LSP
U+2029 PSP
U+202F NARROW NBSP
U+205F MEDIUM MATHEMATICAL SPACE
U+3000 IDEOGRAPHIC SPACE
I can't express with separator, a repeating disjunction of the byte sequences corresponding to the above.
Now, I think all this complexity adds no value for anyone.
To avoid all this, I would propose these restrictions on delimited binary data