Sustainability of Digital Formats
|
|
Introduction | Sustainability Factors | Content Categories | Format Descriptions | Contact |
Full name | HDF4, Hierarchical Data Format, Version 4 and earlier |
---|---|
Description | At its lowest level, HDF is a physical file format for storing scientific data. The data structure types that HDF supports are Scientific Data Sets, Raster Images (General, 8-bit, 24-bit APIs), color palettes, text entries, and Vdatas and Vgroups.
|
Production phase | Generally used for middle- and final-state archiving. |
Relationship to other formats | |
Has subtype | Includes version 4.x and previous releases not documented separately here. |
Affinity to | HDF5, Hierarchical Data Format, Version 5 |
LC experience or existing holdings | None |
---|---|
LC preference | None |
Disclosure |
The HDF software was developed and supported by NCSA and is freely available. In July 2005, NCSA announced that the "Hierarchical Data Format group is spinning off from the National Center for Supercomputing Applications (NCSA) as a non-profit corporation supporting open source software and non-proprietary data formats."
Source code for the HDF libraries is available in Fortran and C. Some tools are available as Java source. |
---|---|
Documentation | http://www.hdfgroup.org/products/hdf4/ |
Adoption | These freely available tools are used by an estimated 2 million users in fields from environmental science to the aerospace industry and by entities including the U.S. Department of Energy, NASA, and Boeing. It is used world-wide in many fields, including Environmental Science, Neutron Scattering, Non-Destructive Testing, and Aerospace, to name a few. Scientific projects that use HDF include NASA's HDF-EOS project, and the DOE's Advanced Simulation and Computing Program. |
Licensing and patents | None. |
Transparency | TBD. |
Self-documentation | An HDF structure is self-describing, allowing an application to interpret the structure and contents of a file without any outside information. Supports user-defined attributes and annotations. |
External dependencies | None. |
Technical protection considerations | None. |
Dataset | |
---|---|
Normal rendering | Normal rendering for datasets not established yet. |
Tag | Value | Note |
---|---|---|
Filename extension | hdf |
From The File Extension Source. |
Internet Media Type | application/x-hdf |
From The File Extension Source. |
Magic numbers | Hex: 0E 03 13 01 |
From The File Extension Source. |
General |
There are two HDF formats, HDF (4.x and previous releases) and HDF5. These formats are completely different and NOT compatible. As of September 2005, there are no plans to drop support of HDF, but features will not be added. New projects are encouraged to use HDF5. Some of the HDF (4) limitations are: A single file cannot store more than 20,000 complex objects, and a single file cannot be larger than 2 gigabytes; the data models are less consistent than they should be. There are more object types than necessary, and datatypes are too restricted; the library source code is old and overly complex, does not support parallel I/O effectively, and is difficult to use in threaded applications. |
---|---|
History |
The HDF Group will be spinning off from the National Center of Supercomputing Applications (NCSA) as a non-profit corporation. The new corporation, "The HDF Group" (THG), will continue to support open source software and non-proprietary data formats. This move is expected to take up to six months to complete. A new web site has been set up for THG at: http://www.hdfgroup.org/ |
|