James
Is the problem that the dfdl:terminator
'%CR;%LF;' for the end of the header record is firing prematurely when
it encounters the CRLF in the data?
If so then I'm not sure that DFDL can
ignore the extra %CR;%LF; without using an escape scheme - but there isn't
an escape scheme to use.
I would model the data as unbounded
repeating records, and use a discriminator to distinguish the repeats from
the next header.
Regards
Steve Hanson
Architect, IBM Data Format Description Language (DFDL)
Co-Chair, OGF
DFDL Working Group
IBM SWG, Hursley, UK
smh@uk.ibm.com
tel:+44-1962-815848
From:
"Garriss Jr.,
James P." <jgarriss@mitre.org>
To:
"dfdl-wg@ogf.org"
<dfdl-wg@ogf.org>,
Date:
04/06/2013 19:56
Subject:
[DFDL-WG] Ignore
extraneous CRLF w/ space?
Sent by:
dfdl-wg-bounces@ogf.org
Long IMF headers, such as Received, can
be wrapped onto the next line by using a CRLF and then a space. This
example has 3 such wrappings:
Received: from smtpksrv1.mitre.org (localhost.localdomain
[127.0.0.1])
by localhost (Postfix) via Exchange
Front-End Server webmail.afmc.af.mil
([131.28.34.85]) with SMTP id 0A8791F116E
for <jgarriss@mitre.org>; Tue,
4 Jun 2013 14:03:24 -0400 (EDT)
How do I get DFDL to ignore these wrappings?
For most of the header, it’s not an issue, because I can use a lengthPattern
to lookahead to the ; before the date starts. But once the date starts,
I have no way of knowing when it ends, so I need to ignore any CRLF with
a space.
TIA
--
dfdl-wg mailing list
dfdl-wg@ogf.org
https://www.ogf.org/mailman/listinfo/dfdl-wg
Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number
741598.
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6
3AU