1) For xs:string if dfdl:lengthKind is
'implicit' then xs:maxLength is used to extract N units from the data.
If dfdl:lengthUnits is 'bytes' then N bytes are extracted. If validation
is switched on xs:maxLength is also used to validate that no more than
N characters appear in the infoset. This seems problematic where
the dfdl:encoding is non-SBCS.
2) For xs:string if dfdl:lengthKind
implies a variable length on output and dfdl:textPadKind is not 'none'
then xs:minLength is used to ensure that at least N units are output.
If dfdl:lengthUnits is 'bytes' then N bytes are written to
the data. If validation is switched on xs:minLength is also used to validate
that at least N characters appear in the infoset. Again this seems
problematic where the dfdl:encoding is non-SBCS.
Should we disallow the combinations
that actually cause a problem?
Regards
Steve Hanson
Architect, Data Format Description Language (DFDL)
Co-Chair, OGF
DFDL Working Group
IBM SWG, Hursley, UK
smh@uk.ibm.com
tel:+44-1962-815848
Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number
741598.
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6
3AU