This is a static archive of the previous Open Grid Forum Redmine content management system saved from host redmine.ogf.org file /dmsf_files/8019?download=12540 at Fri, 04 Nov 2022 15:28:01 GMT Basic WG charter info

Data Format Description Language (DFDL) Working Group

Global Grid Forum, Data Area

Chairs:

Martin Westhead, westhead@avaya.com

Mike Beckerle, beckerle@us.ibm.com

Jim Myers, jimmyers@ncsa.uiuc.edu

Email list:

dfdl-discuss@nesc.ac.uk

Web page:

https://forge.gridforum.org/projects/dfdl-wg/

Charter:

Focus/Purpose

XML provides an essential mechanism for transferring data between services in an application and platform neutral format. However it is not well suited to large datasets with repetitive structures, such as large arrays or tables. Furthermore, many legacy systems and valuable data sets exist that do not use the XML format. The aim of this working group is to define an XML-based language, the Data Format Description Language (DFDL), for describing the structure of binary and character encoded (ASCII/Unicode) files and data streams so that their format, structure, and metadata can be exposed. This effort specifically does not aim to create a generic data representation language. Rather, DFDL endeavors to describe existing formats in an actionable manner that makes the data in its current format accessible through generic mechanisms.

 

The DFDL description would sit in a (logically) separate file from the data itself. The description would provide a hierarchical description that would structure and semantically label the underlying bits. It would capture:

Further, if the data file contains highly repetitive structures, such as large arrays or tables, such a description can be very concise.

 

The potential benefits to having such a standard include:

Goals/Milestones

The goals of the group are as follows:

  1. To develop a proposal for a standard Data Format Description Language (DFDL) which will consist of a general structure description language and then an extensible set of property libraries for which we will provide a base.
  2. To work with other groups within the GGF to ensure that the DFDL proposal conforms with other emerging Grid standards.
  3. To foster the development of reference implementations of libraries and tools that use the DFDL proposal.

The group aims to be very focused and to leverage existing implementation work (see references) in the development of reference implementations. We propose to produce the following documents:

  1. Core DFDL standard – this will describe the syntax and semantics of the core language and the extension mechanisms.
  2. Basic properties library
  3. Extended properties library

 

(Note: we anticipate documents 2 and 3 potentially being split into multiple sub documents representing different areas of the standard.)

 

Milestones:

References:

This is a static archive of the previous Open Grid Forum Redmine content management system saved from host redmine.ogf.org file /dmsf_files/8019?download=12540 at Fri, 04 Nov 2022 15:28:01 GMT