Federal CIO Council XML Working Group
Draft Meeting Minutes, March 14, 2001
American Institute of Architects Board Room
Announcements
XSD is a namespace. Schema allow individual nodes in an XML document to be associated with type declarations in a schema. This allows namespace integration -- to ensure there will be no namespace collisions. DTDs don’t have namespace functionality and can only be associated with one schema.
XML Schema format is much more verbose than that of DTDs. DTD parsing can be difficult because of the complexity of DTD syntax whereas syntactic rules and full support of data types are embedded in the parser for Schema. Multiple levels of validation occur when using Schema. The Oracle XML Parser and Xerces by Apache are the two parsers that are currently the most compliant with W3C XML Schema. Schema saves money on development costs because the developer can offload simply logic onto Schema rather than other areas of coding- leaving simple validations up to Schema and allowing developers to concentrate on business processes. DTDs won’t go away overnight. They’re the only W3C approved standard.
The primary Schema components are: XML namespaces, element declarations, attribute declarations, simple type definitions, and complex type definitions. Type definitions are reusable structures and are used to create aggregate structures.
Secondary Schema components are: model group definitions, attribute group definitions, identity-constraint definitions, notation declarations, wildcards, and annotations.
Group definitions are for reuse. Include or Ignore sections are not available in Schema as they were in DTDs.
Namespaces were built to make sure elements and attributes are unique across different XML vocabularies. Schema has a two part naming convention: first the local name and then the Universal Resource Identifier (URI) of the namespace. Because of the second part, people tend to want to type the URI into their browser and resolve it. The only function of the URI is to ensure uniqueness of element names and attributes; it may not be capable of being resolved. Namespaces are needed for semantic level integration across languages and organizations. They help to avoid potential mismatches (e.g. different uses of the element name part number). The following tools support the XML Schema Candidate Recommendation from October 24, 2000 (with some limitation).
Presentation: "Applied XML Schema Design"
Mr. Alexander Falk, CEO and President of Altova - the XML Spy Company, made the presentation and gave the live demo. XML Spy is an XML document and schema editor with XML Schema capabilities. It provides a graphical representation of XML Schema components, its elements, and their content models. XML Schema can define the specific number of repetitions unlike in DTDs. XML Spy graphics are based on the Universal Modeling Language (UML). Attributes and attribute groups have no predefined order; they are in a list rather than in a tree. Complex types and substitution groups are a result of Schema’s influences by Object Oriented Programming paradigms.
There are several options for Schema development. A Schema can be developed:
This tool supports previewing, editing Extensible Stylesheet Language (XSL) or XSL Transformations (XSLT), connecting to repositories, converting XML, and is a good prototyping tool. Free evaluations are available at xmlspy.com. Data can be viewed in a nested format. XML Schema is a collection of components and a sequence of elements. Spy shows the data type and allows users to manipulate the data in a Graphical User Interface (GUI) format or plain text view. Complex data types can be used in multiple places and can all be manipulated in a GUI format and then viewed in plain text. Annotations, which give extra information to software or people, are also supported in GUI and plain text. Spy creates repository ready documentation.
Ms. Yee asked if elements could be easily changed (e.g. could US address easily be changed to international). Mr. Falk said yes and illustrated an example from the W3C Primer that is included in XML Spy.
Mr. Richard Campbell asked, "Can you combine multiply Schema into one?" Mr. Falk responded yes, use import and include namespace to so this. Mr. Falk added that this is also shown in the Primer.
Mr. Walt Houser asked if XML Spy plans to implement deltav (Internet Engineering Task Force (IETF) specifications) or the W3C digital signature recommendation. Mr. Falk responded that WebDAV has already been implemented and versioning will be available in the next version of Spy. Spy can associate and add files based on signatures and the W3C digital signature will be supported when it is recommended.
Mr. Ambur was concerned that Mr. Evdemon had said XML Schema should not be used in production systems and asked what the work group should be advising as far as Schema is concerned. Mr. Falk agreed with Mr. Evdemon but added that many people are already using it. The benefits are available and organizations need to move ahead. The W3C Proposed Recommendation should be pretty stable and big companies are supporting it. Many tools already support it. One goal is to develop vocabulary and he suggested that you should start doing that with XML Schema now. Mr. Evdemon says XML Solutions recommends their internal consultants to use Schema if the process is just getting started.
Mr. Mark Crawford commented that standards bodies are currently doing work based on the W3C XML Schema Proposed Recommendation.
Mr. Houser asked, "How do you handle error trapping? How about array (x) data types in the Schema?" Mr. Falk responded that when it comes to DTD validation the document is either valid or not. The Schema process is different. The XML document instance is first validated to see if it is well-formed and then the Schema validation occurs. In Schema validation, compliant post schema validation set has an augmented view of the validation set with more information; it says whether each component is okay.
Mr. Houser asked how does Spy trap errors when using XSL? Mr. Brian Cole responded Schema can detect if an error tag is valid if it is PCDATA. Mr. Evdemon added different parsers show different errors (e.g. Xerces verses Oracle). Post validation schema can vary quit a bit whereas core syntax group validates information into a set that reflects the low level primitive type that you see in XML. The main thing is how to include support for parameter entities. Post Schema validation would return errors as if it were part of the Schema. Many people are looking to W3C for a standard. Therefore, error support and handling varies between vendors.
Mr. Houser asked, "Is there a specification?" Mr. Evdemon replied that there is SAX (not supported by W3C) and DOM APIs, but again there are differences between vendors.
Break
Mr. Ambur indicated that unless additional corrections are needed the February 14, 2001, minutes will be adopted. If anyone has updates, please communicate them to Laura Green at lgreen@lmi.org.
Presentation: "Formatting XML Records for Capture and Management in DoD Certified Electronic Records Management Systems"
This presentation is available on-line through xml.gov in PowerPoint or HTML. Mr. Jim Capparelli of BroadVision gave the presentation. Mr. Capparelli introduced the following company’s responsible for the demonstration and their components of the presentation:
Mr. Capparelli showed the workflow for a request to update OMB Circular A-130 using MSWord as the XML authoring tool (utilizes i4i products). Using MS Word to create the XML is easy for users because of its common look and feel. Mr. Capparelli opened the A-130 XML document and demonstrated how the user can turn tags on and off in Word. He made changes to the XML instance and then saved it. FileNET’s Panagon Content Services and Panagon IDM Desktop provide the foundation for this demo. The Panagon repository:
Mr. Capparelli resumed the presentation and said once the document is part of the official record it can be distributed to the masses. He said it doesn’t use cookies, but there wasn’t time to go into the details about that. You login as a particular user (i.e. xml.gov user) to see the document in Hyper Text Markup Language (HTML). If you login as a different user and a different view is available. This was a demonstration of the versatility of XML’s feature of separating format from content. Mr. Capparelli showed another example of this by demonstrating how a user could see the OMB Circular on a cell phone, Palm, etc.
Mr. Ambur said everything that exists is a record and that agencies need to get over the notion that some records are not "official" records. Any information that exists is subject to release under a FOIA request or subpoena. Publishing and providing access to information are value-additive extensions of the basic requirements for the management of records throughout their full life cycles. Accessibility, search, and retrieval can be tremendously improved -- not only on the Internet but also on user's own hard drives -- by the application of a logical taxonomy represented in XML metatags.
Mr. Royal had a technical question on how elements change dynamically. Are they created in a database? Mr. Capparelli answered they are not part of the original document; it creates a new instance with the changes.
Mr. Royal asked does it take advantage of metadata. Mr. Houser asked what is the protocol on objects for access control. Mr. Capparelli said Oracle 8i is the underlying database. If LDAP is required, that’s a customization; add users to database using login and password.
Mr. Houser asked what about public key. Mr. Capparelli said it’s not out of the box, that would require customization.
Mr. Houser asked if the level of security was only passwords and ids. Mr. Capparelli said read write manage know access; if a user is writing another application they can use an API (DOM) that usually happens behind the firewall. Mr. Houser responded most attackers are behind the firewall; behind a firewall is a little less secure. Mr. Marty Heinrich from Impact Innovation said third party signature supports PKI (Public Key Infrastructure) integration. Mr. Houser asked, "Have you made it work?" Mr. Heinrich suggested exchanging cards. Mr. Houser said I don’t see an improvement over MS Security control. Mr. Capparelli said XML repository side is records management rather than an official document. Ms. Hilda Johnson said the US Patent and Trademark Office has bundled i4i's product with a PKI solution from Entrust. Digital signatures are required for patent applications. Without them, the applications are not accepted..
Mr. Bill Morgan asked does it have an audit trail. Mr. Capparelli answered audit logs are available through the Oracle database. Fleet, Walmart, and other customers have third party security that the outside world is hitting.
Mr. Rollin Ridgeway from NASA asked if the presentation and contact information was available on xml.gov. Mr. Capparelli responded that all contact information was available on xml.gov except BroadVision’s.
Mr. Houser asked if this had been presented to NARA. Mr. Dan Jansen responded he represented NARA.
Mr. Royal stated that PKI may not be right and there’s a need to look more at digital signatures, especially if you want to sign at a section level.
Mr. John Milligan invited everyone to a presentation on National Archives April 5 by BroadVision on developing electronic archives. All are invited. It’s only $10 for a full morning and early afternoon that expounds on today’s presentation.
Mr. Ambur announced that the Defense Logistics Information Service (DLIS) has one slot in next month’s meeting and the other slot is open. He asked for suggestions on filling that spot and suggested possibly an update on ebXML. Mr. Crawford suggested an update on the EWG /X12 Core components initiative that will continue 19-23 March 2001 in Tyson’s Corner at the EDIFACT Work Group (EWG) meeting. Mr. Houser suggested an update on the 18-23 March 2001 IETF meeting in Minneapolis. Mr. Crawford also suggested Mark Nobles of LMI who is a member of the W3C Security working group could present an update on W3C Security efforts.
Ms. Amie Milan of KPMG noted that the XBRL consortium has responded to the issues raised in DFAS proof of concept. [Editor's note: The consortium's response will be posted at http://xml.gov/documents/completed/xbrl_consortium_responses.htm.]
Mr. Michael Palmer announced a meeting at KPMG Consulting in Tyson’s Corner, 12 floor, on April 4, from 1:00 -4:00 p.m. It is open to anyone with an interest in XBRL.
XML Working Group Attendance List
March 14, 2001
Last name |
First name |
Organization |
Baldwin |
Nathan |
DISA |
Benedict |
Robert |
NASA |
Bennett |
Daniel |
CitizenContact.com |
Billups |
Prince |
DISA |
Burnstein |
Michael |
Sybase |
Campbell |
Richard |
FDIC |
Capparelli |
Jim |
BroadVision |
Cole |
Brian |
NSF |
Crawford |
Mark |
LMI |
Dingman |
Gail |
USPS |
Drest |
Dan |
BroadVision |
Fitzpatrick |
Bob |
Filenet |
Glace |
Jessica |
LMI |
Gluck |
Daniel |
OPM HR Data Network |
Heinrich |
Marty |
Impact Innovations |
Hoglund |
Bruce |
DLA (Consultant) JECPO |
Horton |
Sherry |
DLA |
Houser |
Walt |
VA |
Hunt |
Jim |
GSA |
Jansen |
Dan |
NARA |
Johnson |
Hilda |
i4i |
Knight |
Conrad |
USPS |
Laing |
Grant |
Intellor |
Leverson |
Steve |
US Courts |
Milan |
Amie |
KPMG |
Milligan |
John |
CSC |
Morgan |
Jane |
GSA |
Niemann |
Brand |
EPA |
Palmer |
Michael |
KPMG/XBRL |
Poot |
Lex |
DTS |
Ridgeway |
Roland |
NASA Headquarters |
Royal |
Marion |
GSA |
Sakran |
Nasra |
Filenet |
Saldo |
Paul |
Novient |
Sinisgalli |
Mike |
XML Solutions |
Todd |
Mike |
OSD |
Tulley |
John |
i4i |
Veith |
Lisa |
Impact Innovations G |
Vineski |
Steve |
EPA |
Walker |
John |
KPMG Consulting |
Walker |
Patrick |
Impact Innovations G |
Warrington |
Earl |
GSA |
Yee |
Theresa |
LMI |