Spec bug: WSP* should not be allowed for textStandardZeroRep

We don't allow %ES; for textStandardZeroRep. But we do allow character class entities like %NL; %WSP; %WSP+; and... problematic: %WSP*; I claim if you don't allow %ES; you should not allow %WSP*; because it matches zero-or-more, allowing an empty-string to match in principle. Leaving off WSP* only eliminates the zero-characters option, as WSP+ is available. There are probably other places in the spec that have this same issue. Basically, anyplace that %ES is disallowed but the other character class entities are allowed. -- Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | www.tresys.com

Mike %WSP*; is useful tacked onto the end of string literal properties that don't allow %ES; so I think the correct restriction is that if a property does not allow %ES; it should not allow %WSP*; on its own. Regards Steve Hanson Architect, IBM Data Format Description Language (DFDL) Co-Chair, OGF DFDL Working Group IBM SWG, Hursley, UK smh@uk.ibm.com tel:+44-1962-815848 From: Mike Beckerle <mbeckerle.dfdl@gmail.com> To: dfdl-wg@ogf.org, Date: 10/08/2013 17:02 Subject: [DFDL-WG] Spec bug: WSP* should not be allowed for textStandardZeroRep Sent by: dfdl-wg-bounces@ogf.org We don't allow %ES; for textStandardZeroRep. But we do allow character class entities like %NL; %WSP; %WSP+; and... problematic: %WSP*; I claim if you don't allow %ES; you should not allow %WSP*; because it matches zero-or-more, allowing an empty-string to match in principle. Leaving off WSP* only eliminates the zero-characters option, as WSP+ is available. There are probably other places in the spec that have this same issue. Basically, anyplace that %ES is disallowed but the other character class entities are allowed. -- Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | www.tresys.com -- dfdl-wg mailing list dfdl-wg@ogf.org https://www.ogf.org/mailman/listinfo/dfdl-wg Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU

I can find only one place in the spec where this applies which is textStandardZeroRep. I am updating errata 2.42 with this clarification also. On Mon, Aug 12, 2013 at 11:03 AM, Steve Hanson <smh@uk.ibm.com> wrote:
Mike
%WSP*; is useful tacked onto the end of string literal properties that don't allow %ES; so I think the correct restriction is that if a property does not allow %ES; it should not allow %WSP*; *on its own*.
Regards
Steve Hanson Architect, IBM Data Format Description Language (DFDL) Co-Chair, *OGF DFDL Working Group* <http://www.ogf.org/dfdl/> IBM SWG, Hursley, UK* **smh@uk.ibm.com* <smh@uk.ibm.com> tel:+44-1962-815848
From: Mike Beckerle <mbeckerle.dfdl@gmail.com> To: dfdl-wg@ogf.org, Date: 10/08/2013 17:02 Subject: [DFDL-WG] Spec bug: WSP* should not be allowed for textStandardZeroRep Sent by: dfdl-wg-bounces@ogf.org ------------------------------
We don't allow %ES; for textStandardZeroRep. But we do allow character class entities like %NL; %WSP; %WSP+; and... problematic: %WSP*;
I claim if you don't allow %ES; you should not allow %WSP*; because it matches zero-or-more, allowing an empty-string to match in principle.
Leaving off WSP* only eliminates the zero-characters option, as WSP+ is available.
There are probably other places in the spec that have this same issue. Basically, anyplace that %ES is disallowed but the other character class entities are allowed.
-- Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | * www.tresys.com* <http://www.tresys.com/> -- dfdl-wg mailing list dfdl-wg@ogf.org https://www.ogf.org/mailman/listinfo/dfdl-wg
Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU
-- Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | www.tresys.com

It also applies to initiator, terminator, separator. The spec descriptions of these properties do not say that %ES; is not allowed, but they should do - it is not legal for any of these properties to return empty string. IBM DFDL checks for this. We are inconsistent with how entities like %ES; are referred to in the spec. Some references use ES and some use %ES;. Regards Steve Hanson Architect, IBM Data Format Description Language (DFDL) Co-Chair, OGF DFDL Working Group IBM SWG, Hursley, UK smh@uk.ibm.com tel:+44-1962-815848 From: Mike Beckerle <mbeckerle.dfdl@gmail.com> To: Steve Hanson/UK/IBM@IBMGB, Cc: "dfdl-wg@ogf.org" <dfdl-wg@ogf.org>, "dfdl-wg-bounces@ogf.org" <dfdl-wg-bounces@ogf.org> Date: 16/08/2013 19:24 Subject: Re: [DFDL-WG] Spec bug: WSP* should not be allowed for textStandardZeroRep I can find only one place in the spec where this applies which is textStandardZeroRep. I am updating errata 2.42 with this clarification also. On Mon, Aug 12, 2013 at 11:03 AM, Steve Hanson <smh@uk.ibm.com> wrote: Mike %WSP*; is useful tacked onto the end of string literal properties that don't allow %ES; so I think the correct restriction is that if a property does not allow %ES; it should not allow %WSP*; on its own. Regards Steve Hanson Architect, IBM Data Format Description Language (DFDL) Co-Chair, OGF DFDL Working Group IBM SWG, Hursley, UK smh@uk.ibm.com tel:+44-1962-815848 From: Mike Beckerle <mbeckerle.dfdl@gmail.com> To: dfdl-wg@ogf.org, Date: 10/08/2013 17:02 Subject: [DFDL-WG] Spec bug: WSP* should not be allowed for textStandardZeroRep Sent by: dfdl-wg-bounces@ogf.org We don't allow %ES; for textStandardZeroRep. But we do allow character class entities like %NL; %WSP; %WSP+; and... problematic: %WSP*; I claim if you don't allow %ES; you should not allow %WSP*; because it matches zero-or-more, allowing an empty-string to match in principle. Leaving off WSP* only eliminates the zero-characters option, as WSP+ is available. There are probably other places in the spec that have this same issue. Basically, anyplace that %ES is disallowed but the other character class entities are allowed. -- Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | www.tresys.com -- dfdl-wg mailing list dfdl-wg@ogf.org https://www.ogf.org/mailman/listinfo/dfdl-wg Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU -- Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | www.tresys.com Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU
participants (2)
-
Mike Beckerle
-
Steve Hanson