UNITED STATES DEPARTMENT OF AGRICULTURE

 

 

 

 

 

 

 

 

 

FOOD AND DRUG ADMINISTRATION

 

FOOD ADVISORY COMMITTEE

 

 

 

MEETING ON INFANT FORMULAS

 

 

 

 

 

 

 

 

 

 

 

 

 

Monday, November 18, 2002

 

 

8:20 a.m.

 

 

 

 

 

 

 

 

U.S. Department of Agriculture

Animal and Plant Health Inspection Service Building

4700 River Road

Riverdale, Maryland


Temporary Voting Members Present

 

James Anderson, Ph.D.

Robert D. Baker, M.D., Ph.D.

Margaret E. Briley, Ph.D., R.D., L.D.

Scott Denne, M.D.

Cutberto Garza, M.D., Ph.D., Chairman

James E. Heubi, M.D.

Laurie J. Moyer-Mileur, Ph.D., R.D., C.D.

Virginia A. Stallings, M.D.

Patti Thureen, M.D.

 

Participating Food Advisory Committee

Members Present

 

Annette Dickinson, Ph.D.

Goulda Angella Downer, Ph.D.

Lawrence N. Kuzminski, Ph.D.

Madeleine J. Sigman-Grant, Ph.D.

 

Acting Industry Representative

 

Roger A. Clemens, Dr.P.H. CNS FACN


FOOD ADVISORY COMMITTEE ON INFANT FORMULAS

 

AGENDA

 

Welcome, Introduction and Charges

  Christine J. Taylor, FDA    4

 

Administrative Issues -- Cathy DeRoever,

  Mary Ann Killian and Jeanne Latham, FDA    7

 

Regulatory Background

  Christine J. Taylor, FDA    23

 

Remarks by Chairperson, Cutberto Garza    32

 

Presentations by Invited Speakers

  Physical Growth Measurements

    W. Cameron Chumlea    43

 

  Body Composition, Kenneth J. Ellis    72

 

  World Health Organization Growth

    Reference, Edward A. Frongillo    103

 

  National Center for Health Statistics/

    Centers for Disease Control and

    Prevention Growth Charts, Lawrence M.

    Grummer-Strawn    127

 

  Iowa and Iowa/Fels Growth Data

    Samuel J. Fomon    152

 

  Growth Data for Preterm Infants

    Jon Tyson    169

 

  Analytical Issues, Edward A. Frongillo    202

 

Questions to Previous Speakers

 

  Clinical Studies -- Product Composition

  Considerations, Duane Benton    228

 

  Clinical Studies -- Clinical

    Consideration, Dennis M. Bier    259

 

Questions to Speakers    283

Preliminary Discussion of Issues by Committee    313

Concluding Remarks    383


P R O C E E D I N G S

    DR. TAYLOR:  I'm Christine Taylor, and I'm director of Office of Nutritional Products Labeling and Dietary Supplements at FDA's Center for Food Safety.

    We will be this morning going through a series of conversations, and what you've got right now is our overview.  So given the fact that we started just a few minutes late, we'll go ahead and try to shortcut this overview.

    In the next few minutes, we'll do a brief overview for this Food Advisory Committee meeting.  We'll review a couple of the administrative issues, which will be focused on ethics and conflicts of interest.  We'll spend some time on regulatory context for this meeting, and then we will begin the meeting per se with Dr. Bert Garza serving as chair.

    During the meeting, there will be a series of presentations and white papers, which we have provided as background information for the committee.  There will be some discussion, and then beginning tomorrow there will be public comments, more discussion, and response.

    We're going to spend just a very few minutes this morning going over the role and expertise of the task force members, and I think the key point to be recognized is that there are several different kinds of members sitting with us today.

    FDA considers a number of factors in selecting individuals to serve on the Food Advisory Committee, including their scientific expertise, as well as issues related to conflict of interest.

    We also have sitting with us a consumer representative.  This person is a voting member of the Committee and represents the consumer perspective on issues and actions that come before the Committee.

    We also have an industry representative sitting with the committee.  This is a nonvoting member, and they're responsible for representing all members of the industry, and not any particular association, company or product.

    Basically, the kinds of members we have on this committee are temporary voting members, as well as some members of our larger Food Advisory Committee.  As I mentioned, we also have consumer reps and industry reps.

    What I'd like to do, at this point, starting with Dr. Baker, if you will, Dr. Baker, just so we can get used to using the microphones, if you would let us know your name, where you are, and if you're too modest, I have a listing of your expertise.

    DR. BAKER:  Robert Baker from Buffalo, New York.  I'm a pediatric gastroenterologist, and I have a Ph.D. in biochemistry and in nutrition.

    DR. STALLINGS:  I'm Virginia Stallings, from Children's Hospital in Philadelphia.  I'm the head of the Nutrition Section there, and I do work in healthy children and children with chronic disease related to nutrition.

    DR. HEUBI:  I'm Jim Heubi.  I'm a pediatric gastrologist, as well.  I'm the program director for the GCRC, the General Clinical Research Center at the Children's Hospital in Cincinnati, and I have a longstanding interest in nutrition relating to infant nutrition bone disease, cholesterol, metabolism, you name it, there's a variety of things.

    DR. ANDERSON:  I'm Jim Anderson.  I'm at the University of Nebraska Medical Center in Omaha, Nebraska.  I'm chairman of the Department of Preventive and Societal Medicine, and I'm a biostatistician by training.

    DR. DOWNER:  I'm Goulda Downer, a doctorate in Human Nutrition, with a residency in pediatrics at Georgetown.  Currently, I'm a clinical nutritionist with my own practice, and I'm also on faculty at George Washington University.

    MS. SIGMAN-GRANT:  I'm Madeleine Sigman-Grant.  I'm a maternal and child nutrition specialist at the University of Nevada Cooperative Extension.

    DR. MOYER-MILEUR:  I'm Laurie Moyer-Mileur, from the University of Utah.  I'm a registered dietician with a doctorate in exercise physiology, and I have over 20 years of neonatal nutrition experience.

    DR. GARZA:  I'm Bert Garza.  I'm a professor of nutrition at Cornell University.  I'm both an M.D. and have a Ph.D. in nutritional biochemistry and metabolism, and my primary interests have been in maternal-child health, with interests in growth, and protein and energy metabolism.

    DR. KUZMINSKI:  I'm Larry Kuzminski.  I'm from Duxbury, Massachusetts.  I'm retired from the food processing industry, having R&D responsibilities and operations responsibilities with the Kellogg Company and with Ocean Spray Cranberries.

    DR. DENNE:  I'm Scott Denne.  I'm from Indiana University.  I'm a pediatric neonatologist.  I have a longstanding interest in neonatal nutrition, specifically, and protein and energy metabolism.

    DR. THUREEN:  I'm Patti Thureen, a neonatologist from the University of Colorado in Denver, and my particular interest is in protein and energy metabolism in the extremely low-birth-weight neonate.

    DR. BRILEY:  I'm Margaret Briley from the University of Texas at Austin, and my expertise has been in nutrition of children and child care.

    DR. TAYLOR:  If we could just stop right there.  Margaret is our consumer rep, and on our right we have Dr. Roger Clemens, who is substituting for Annette Dickinson, who is our industry rep.

    I'll go to the next slide and just give a minute or two about the staff you have sitting at the table with you.  As I've mentioned, I'm with the Office of Nutritional Products Labeling and Dietary Supplements.

    We also have Dr. Susan Walker, who is our associate director for Clinical Affairs, as well as Dr. Beth Yetley, who's the lead scientist for nutrition.

    Jeanne Latham, who is sitting next to Dr. Bert Garza, is our executive secretary, and we're being joined today by Ms. Mary Ann Killian, who is program integrity adviser at the Office of Human Resources at FDA.

    Let me just spend a very quick minute, and then we will return with a regulatory context.  I think in terms of mechanics, we need to understand kind of where we are in the process.  Currently, we are operating as an ad hoc task force to the Food Advisory Committee.  In the very near future, we will constitute an Infant Formula Subcommittee of the Food Advisory, but currently we are still in the ad hoc mode.

    The current focus of the Infant Formula Advisory Meetings is to obtain scientific input for evaluating whether new infant formula supports normal physical growth of infants.  This comes under Section 412 of the Food, Drug, and Cosmetic Act, which in a few moments we'll come back to in more detail.  We're looking basically for scientific input, which eventually will inform the Agency relative to regulatory efforts.

    What we're undergoing currently is a series of meetings, and I'm sure most of you remember that last April we held our first meeting on this issue of normal physical growth.  It included a somewhat general discussion, an effort to understand the regulatory context, as well as a few specific questions about extrapolation and attrition in the study.

    This is the second of this series, and the general scientific topics for today fall into three categories: Growth Measures and Methodologies, the Role of Such Measures and Methodologies in  Demonstrating Normal Physical Growth, and then, finally, Principles and Criteria to Determine the Need for a Clinical Study to Provide the Agency an Assurance of Normal Physical Growth.

    It's always helpful to be clear about what's not on the table.  There are so many issues in the area of infant formula, normal physical growth, other issues related to the Agency's regulatory purview that sometimes it's important to realize there are things that are of great interest, but are not on the table for discussion.

    This lists a few, probably the ones that our discussions will most likely tend to gear toward.  The design and conduct of studies is not on the agenda today, other endpoints of clinical studies is not on the agenda.  What constitutes major and minor changes is not on the agenda.  That's, of course, for those of you that are intimately involved in the regulatory, you understand that that has regulatory meaning.  The nutritional impact or efficacy of formulas, the safety of individual ingredients and specific regulatory decisions are not topics for today.

    We have provided specific background for the committee in the form of white papers.  We have a total of nine white paper which, as Dr. Garza will explain in a few moments, we'll go through this morning.  Each of the papers will be introduced by an expert, and then of course discussed by the committee as appropriate.  Those related to the assessment of normal physical growth are listed here, and then for our second topic, changes warranting a clinical study, we have two white papers.  Those should be in your notebooks and available for further discussion.

    Now, the Agency's role is to give you specific charges that are to be accomplished by the end of the meeting on Tuesday, and those charges are in your notebook in the form of seven questions, and I won't go through them now.  I think Dr. Garza will take the time to do that with you later on, but they fall into basically four categories: Metrics for evaluation of growth, which is Questions 1, 2, 3A and 3B; questions about comparators, Questions 4 and 5; controlled feeding parameters, Question 6; and then changes in composition, Question 7.

    Just for the group of us here, the summary of the charges fall into two categories: The criteria for adequate evaluation of normal physical growth during the first six months, and here are several substantive ones.  Again, they are specifically it in your questions; and then, secondly, the type of changes in infant formula that should warrant a clinical study.  Again, those are the remaining questions in your notebook.

    Just in terms of the mechanics, this morning, next, we'll cover the administrative issues, Jeanne Latham, as assisted by Mary Ann Killian, will go through that with you.

    I'll return, and with the help of Dr. Walker and Dr. Yetley, give you some regulatory context and then the actual task force meeting will begin.

    What we'll do is hold questions until after the administrative component, and then again after the regulatory context, and then we should be on our way.

    So, Jeanne, I'll turn the meeting over to you, and dutifully return for the next part.   Thank you.

    MS. LATHAM:  Good morning.  I'm Jeanne Latham and, first of all, in terms of administrative issues, we wanted to have Cathy DeRoever's statement read into the record, and Dr. Garza will take care of that.

    Thank you.

    DR. GARZA:  Catherine DeRoever, the executive secretary of the Food Advisory Committee, was asked to take a few minutes to refresh everyone's memory about a few of the rules of the road, in terms of Advisory Committee operations, so I'm going to be reading her statement.

    It is my understanding that all committee members have been provided with a copy of a Committee Member Guide to FDA Advisory Committees and a video.  The video's title is "A Panel Member's Responsibility."  I believe there are copies of the Member Guide available at the registration desk for anyone who may be interested.  The Committee Member Guide is in need of updating but, by and large, it provides a good operational overview.

    FDA relies on its Advisory Committees to provide the best-possible scientific advice available to assist us in making complex decisions.  Our goal is to do this in as open and transparent a manner as possible.  Part of that openness carries with it a request that the members try to avoid even the appearance that issues are being decided or conclusions are being reached outside the actual meeting.

    We understand that issues raised during the meeting may well lead to conversations over breaks or during the meal.  In fact, we hope the discussions are thought-provoking.  We have had instances where the members have come back from a break and said, "You know, we were talking over break, and we would like to request that FDA provide us some additional information so we can better understand thus and such."  This is perfectly acceptable.

    What we don't want is to have a situation where after the break the members come back and say, "We were talking over break, and we decided that the answer to Question 1 is..."  From our perspective, that would be particularly troublesome because neither the Agency, nor the public, would have had the benefit of listening to the entire discussion, the questions raised, the responses, et cetera.

    In fact, FDA has recently adopted a policy that only matters that can be decided by a show of hands are procedure matters, for example, break times.  I'm not sure I understand that.

    [Laughter.]

    DR. GARZA:  All other votes and comments must be placed on the record, attributed to the member making the statement.  The policy goes even further.  If a member has to leave the meeting early, that member waives the right to vote.  You may wonder why would the person lose their right to vote, but the answer is fairly simple.  FDA believes all parts of the meeting and the discussions are important.  Consequently, voting on issues without having the benefit of all of the discussion would be premature.

    The issue of openness is larger than what transpires during the course of the meeting.  I would like to call your attention to the section in the Members' Guide, titled, "Member Interaction Before, During and After a Meeting."  In essence, this section underscores the fact that all communication with the members should be routed through the Committee's executive secretary.  No one, not even FDA staff, with the exception of the executive secretary, should be contacting the members about upcoming meetings, topics, et cetera.

    This same guidance applies to consultations between members prior to a meeting.  If a member receives an inappropriate contact, the members should feel free to notify the executive secretary and/or refer the person making the contact to the executive secretary.  Our goal in having all contacts routed through the executive secretary is to minimize any situation that could be misinterpreted.

    Appearance issues are always difficult because, as is true of many things, appearances can be deceiving.  We ask that our members, guest speakers, and everyone attending the meeting be mindful of how an interaction between a member and anyone, for that matter, might be perceived.

    Please let me be clear it is not my intention to question anyone's motives or integrity, but I am very sensitive to the issue because I have, and imagine so have you, seen highly respected individuals become the object of negative attention based on a misperception, and I certainly wouldn't want anyone in this room to become such a target.

    I am confident that everyone here is sensitive to these issues and can appreciate that my comments are intended as a gentle reminder.

    Thank you.

    Any questions?  Which I will refer to Ms. Latham.

    [Laughter.]

    DR. GARZA:  From any of the committee members?  Is all of that clear?

    [No response.]

    DR. GARZA:  Thank you.

    MS. LATHAM:  Good morning.  I am Jeanne Latham, the executive secretary for the FDA's Food Advisory Committee on Infant Formula.  I want to welcome everyone, and I'd like to read the conflict of interest statement for the record.

    The following announcement addresses the issue of conflict of interest with respect to this meeting and is made a part of the record to preclude even the appearance of such at this meeting.

    By the authority granted under the Food Advisory Committee Charter of July 2002, the following individuals have been appointed as temporary voting members by Joseph A. Levitt, director, Center for Food Safety and Applied Nutrition:

    James Anderson, Ph.D; Margaret Briley, Ph.D.; Robert Baker, M.D., Ph.D.; Scott Denne, M.D.; Cutberto Garza, M.D., Ph.D.; James Heubi, M.D.; Laurie Moyer-Mileur, Ph.D.; Virginia Stallings, M.D.; Patti Thureen M.D.

    The issues to be discussed at this meeting are issues of broad applicability.  Unlike issues in which a particular sponsor's product is discussed, the matters at issue do not have a unique impact on any particular product or manufacturer, but rather may have widespread implications with respect to all infant formulas and their manufacturers.

    To determine if any conflicts of interest exist, the committee participants have been screened for interest in companies that make infant formula.  As a result of this review, in accordance with 18 United States Code, Section 208(b)(3), Dr. Cutberto Garza has been granted a particular matter of general applicability waiver that permits him to participate fully in the matters at issue.  A copy of the waiver statement may be obtained by submitting a written request to the Agency's Freedom of Information Office, Room 12A30 of the Parklawn Building.

    With respect to FDA's invited guest speakers, there are reported interests that we believe should be made public to allow the participants to objectively evaluate their comments.

    Dr. W. Cameron Chumlea has a grant from Nestle to serve as a coordinating center for a nutritional study of Chinese elderly.

    Dr. Samuel Fomon previously consulted with firms that make infant formula and is likely to do so in the future.

    Dr. Duane Benton owns stock in Abbott Laboratories, and he receives retirement benefits from Abbott.

    Dr. Dennis Bier's employer, the ARS Children's Nutritional Research Center, recently received the Bristol-Myers Squibb-Mead Johnson nutritional 2002 unrestricted nutritional research grant.  As Center director, Dr. Bier is named as the principal investigator, although no funds come to him personally or for his personal research.

    We would also like to note for the record that Dr. Roger Clemens is participating in this meeting as the acting industry representative and a nonvoting member of the Committee.

    In the event that the discussions involve any other issues not already on the agenda, for which FDA participants have a financial interest, the participant's involvement and their exclusion will be noted for the record.

    With respect to all other participants, we ask, in the interest of fairness, that they address any current or previous financial involvement with any firm that makes infant formula.

    Thank you.

    With that, I will turn the program back over to Dr. Taylor.

    DR. TAYLOR:  Thank you very much, Jeanne.

    Our goal for the next 15 or 20 minutes is to set the regulatory context for the discussions we're having today.  For those of you that remember the spring meeting, we did spend some time on that, and hopefully most of this is a review, and all we have to add is an additional focus relative to the topic for today.  As I've mentioned earlier, this is an ad hoc task force of the Food Advisory Committee, addressing infant formula issues.

    Obviously, we have statutory authority relative to infant formula, and its long history goes back to 1980, at which time Congress passed special legislation that amended the Food, Drug, and Cosmetic Act.  We, in the Agency, try to avoid throwing numbers, and clauses and phrases around, but it's almost impossible not to, and the key phrase is that it provided Section 412 to the Food, Drug and Cosmetic Act.

    In 1986, Congress had an interest in adding to this, providing more statutory authority, and so there were some additional amendments in 1986.

    I think what we have to keep in mind is that the infant formula legislation happened for a very specific reason.  Infant formula is unique from other foods.  It is the sole source of nutrition for a vulnerable population.  In Congress's mind, it therefore warranted a special set of provisions for regulation.

    It's clear that the intent and outcome of this action was the following statement from Congress.  It should not only be safe, which I would point out is handled in separate sets of provisions than what we're addressing today and contain all of the necessary nutrients, which again is the separate set of provisions which are not on the table for today, but also should provide those nutrients in a bio-available form to ensure that the infant formula were to support optimal infant growth and health.  That's what we're about today in some respects.

    This chart is a little complicated at first, but I think it sets the context for what we call to be regulatory boxes, and really what's on the table today is this particular component, but all of this is the regulation of infant formula.  The safety of the individual ingredients, the classic safety considerations are handled under a separate set of provisions, the so-called Section 409.  So ingredients for intended use, that's where most of your classic safety reviews come in.

    Section 412, as provided for by Congress in 1980, is really a statutory check on a particular formulated product.  In providing those assurances, companies consider the required nutrients that have to be in the formula, the good manufacturing practices or GNPs and quality controls, and then quality factors.

    Today, we are focusing on quality factors.  These assurances are provided prior to marketing, and once marketing occurs, in the world of infant formula, the claims then come in, as far as efficacy, truthful, and not misleading, again, a separate set of provisions.  I'll come back to this in a moment, but the key component is that we're here taking a look at that.

    Now, as mentioned just a second ago, in order to provide these assurances, manufacturers submit a notification to FDA 90 days prior to their intention to market that particular infant formula.  Again, as I mentioned, it's specific to a finished product.  The Agency reviews it, again, as I just mentioned, for those three components, and it's here, hopefully, highlighted in red that our questions today will focus.

    The definition of quality factors is not precise.  They certainly do offer the opportunity of expanding, as needed.  There is language from a 1980 discussion in the House Committee, and their references to quality factors focus on things such as pertain to the bioavailability of a nutrient and the maintenance of levels or potency.  They discuss at great length the growth of infants during the first few months of life, and they discuss the concept of healthy growth, the idea being that once you've formulated a product, it needs to support healthy growth.  So, in its simplest form, quality factors are a check on the concern that once you get the entire product put together it works appropriately.

    Now the types of quality factors could be many.  At this point, we basically have two.  In the realm of nutrient-specific, we have provisions for protein efficiency ratios, protein per se, but over time, others could be put in place.  In the world of the formulation itself, the totality of the formulation the quality factor we address is normal physical growth, and, again, others could be put in place over time as needed.

    So, for today, normal physical growth as  quality factor is what's on the table.

    The scientific questions that will come through as you read the charges are basically twofold.  How do you measure and affirm normal physical growth and how and when should assurances of normal physical growth be appropriately provided?

    Going back to that again, this particular slide, quality factors, normal physical growth, assurances for a specific product, along with other components of these assurances.

    Now, just so that we're sure how it works from a regulatory perspective, we've put in this slide, but I think it's redundant to what we've said before.  In order to provide assurances, vis-a-vis Section 412, the manufacturer submits a notification 90 days prior to marketing.  FDA reviews the notification package, taking into all of the components, nutrients, GNPs, quality control and quality factors, and if assurances are adequately provided, FDA does not object to the marketing of the formula.

    If, in the Agency's opinion, assurances are not adequately provided, FDA does let the company know that it objects to the marketing of this particular formulation.  It's important to note, from a regulatory perspective, that this is not a premarket approval process, so manufacturers do have the right to go to market over FDA's objections.

    The scientific input we get from you folks today and tomorrow will certainly guide our thinking about the evaluation of normal physical growth when infants are fed a new formula.  We'd like to point out that it certainly helps us, but it's also helpful to stakeholders in that what is expected becomes clear to them.  It's not as much of a black box if it's quite clear how FDA's scientific considerations are handled.

    It's also going to guide our thinking about when clinical studies should accompany formulation of processing changes in infant formulas, and again it's helpful to us, but it's also helpful to our stakeholders.

    The outcome of today's discussions can be used to inform our ongoing reviews, but we do need to talk a little bit about current rulemaking, in that discussions today have the opportunity or the possibility of impacting on current rulemaking activities.  The current state of our rulemaking, as probably many of you know, is that in 1996, we proposed a rule to implement parts of Section 412, and in that was included the implementation of quality factors.

    That final rule has not been issued, so we are still in the process of what's known as rulemaking.  If input from this Committee is relevant, and it may or may not be, but if input from this Committee is relevant, there would need to be an opportunity to comment on that, and we would, of course, reopen the comment period on this rule for that purpose.  So we retain the option of reopening the comment period.

    So, again, just to review, you've seen this before, today's discussions, vis-a-vis the charges, our growth measures and methodologies, the role of such measures relative to normal physical growth, and the general principles and criteria to determine the need for a clinical study to provide assurances of normal physical growth.

    I think we've gone over topics not under discussion, so I'll mention these only in passing, and then again remind you that the specific charges from the Agency to the Committee are the seven questions in your notebook, and I'm sure Dr. Garza will go over those with you in some detail.

    I do want to introduce Dr. Susan Walker, who's at the table here, our associate director for Clinical Affairs, and Dr. Beth Yetley, who's our lead scientist, and the agreement we have is if you have questions, I will go join them, and we will answer them as a troika.

    Thank you very much.

    MS. LATHAM:  Are there questions?

    DR. GARZA:  Are there any questions to Dr. Taylor?

    [No response.]

    DR. GARZA:  Thank you very much.  That is clear, judging from the lack of questions from the Committee.  I do want to take this opportunity to welcome the Committee members, and guests, and staff that have joined us, and to thank Dr. Taylor because we've made up the lost time.  I was concerned that we would be running late, but we're doing all right in terms of time.

    We have a very full agenda and would like to begin by asking the Committee members if, in fact, they have any questions about the agenda.

    [No response.]

    DR. GARZA:  Very quickly, just to review the procedure, we're going to be launching into one of the major segments of the Committee meeting in just a few minutes, and that is nine presentations, based on the nine background papers which were sent to each of us several weeks ago.  Only Committee members have the privilege of asking questions to any of these presenters.  We will try to hold presentations to about 15 minutes.  Someone will be helpful in alerting the speakers when I think there is about three minutes left in their presentations to help them allocate their time appropriately, and then we will have about 10 minutes of questions from Committee members to each of their presenters.

    You also will have the opportunity to ask questions of those that make comments in the public comment period tomorrow, and we will have blocks of time then to come to some consensus on the seven questions that you have been sent as well.

    It's going to be very important that we address each of those questions carefully, and therefore I'd like to make sure that each of you has a chance to review those questions because I will be proposing time limits to assure ourselves sufficient time to deal with each of them in a way that doesn't shortchange any of them, and so we'll be trying to deal with that time allocation later this afternoon.

    All experts, I am told, will be able to stay throughout today and tomorrow for those questions and answers to that if, in fact, in those blocks of time when we're dealing with any of those seven questions, any of the committee members would like to address any questions to any of the presenters, and then obviously that's going to be possible as well.

    You have seven questions.  They have been divided for us in four sections.  One of those sections is on metrics for the evaluation of normal physical growth, a second section deals with comparators for the evaluation of normal physical growth, and a third is on controlled feeding comparators, and the fourth is on changes in infant formula composition.

    Rather than reading each of the seven questions, I'd like to take just a few minutes to ask committee members if you have any questions about the issues that we've been asked to consider under each of these sections so that, in fact, we can be clear what we're being asked to do, and you can have those clearly in mind during the presentations.

    So let's begin with the first in terms of metrics for the evaluation of normal physical growth.  There are two questions under that section.  Do any of the committee members have any questions about points that you're being asked to address?  I'll give you a few minutes to review those, and we can ask the troika to clarify those for us.  I think that's the way you were described.  That was not my word.

    Having had the pleasure to work with them, you'll get informative responses to your questions, I'm sure.

    MS. LATHAM:  At the end of last week--this is Jeanne Latham, the exec sec--we e-mailed to everyone the updated questions, and I just wanted to make sure that you all have those, and if you don't, we will get them to you.  You've got them. . Anybody that doesn't have them?

    [No response.]

    DR. GARZA:  I think they were in the packets today, again, in case you didn't bring them with you.

    Would any member of the staff want to address any questions in this section?

    [No response.]

    DR. GARZA:  I take it, then, that they're clear--Dr. Thureen?

    DR. THUREEN:  Yes, I have one question.  These are metrics for the evaluation of growth between birth and six months of age.  We will be dealing with both term and preterm infants, I presume.  Should they be handled separately?  Because the preterm infants we are discussing I believe growth after post-conceptional age birth to six months, so should they be handled separately or should we just do a general assessment of these evaluations, presumably for term infants, with maybe later adjustments for preterm infants?

    DR. GARZA:  No, I would assume that we will take those separately, but let me ask the staff if they would object, if there's any reason why we shouldn't take them separately?

    Committee members?  So we'll probably do A and B.  Thank you.  That's a good clarification, with preterm being all preterms, low birth weight, very low birth weight, and extreme low birth weight.

    Any other questions, then, on this first section?

    [No response.]

    DR. GARZA:  Then, on the second, on comparators for the evaluation of normal physical growth, let me give you a few minutes to review those two questions and see if there are any issues that need to be clarified.

    [Pause.]

    DR. GARZA:  Any questions on either of those?

    We have one question on the control  feeding comparators.  Let's take a look at that and see if that's clear.

    DR. ANDERSON:  This is Jim Anderson.

    I wonder if I could get a clarification of the difference between the current infant formula plus new ingredient that's listed on the first bullet and the infant formula plus new ingredient with the asterisks on the last of the bullets.

    DR. GARZA:  On the last bullet, right?  So it's listed below are examples of controlled feeding clinical comparators, and I believe the question is can you clarify the distinction between the first and the last bullets.

    DR. TAYLOR:  We're working on it.

    DR. GARZA:  I gather what it meant was that it was a study in which the new ingredient would be used with some infant formula, but that the intention was to market the new ingredient as a component of some other infant formula.

    DR. GARZA:  That was mine or a generic infant formula, where that new ingredient might be added to any formula, so that it would be a generic comparison was the way I read that.  Am I not clear of that?  If my interpretation is correct, so it's a generic--

    DR. WALKER:  Your interpretation is correct.

    DR. GARZA:  So it's a generic endorsement of the ingredient.

    DR. WALKER:  Right.

    DR. GARZA:  Does that clarify it?

    Dr. Walker, would you--

    DR. WALKER:  The instance in which the generic ingredient is added to the new infant formula and the way that you discussed there, Dr. Garza, is a correct interpretation.

    DR. GARZA:  So it's more an endorsement of the ingredient itself, with a test formulation, but as opposed to a specific formula that had been marketed in the past, where a new ingredient would be added.  I understood it to mean a more generalized evaluation, rather than a specific one.

    DR. WALKER:  I think after we have some of the discussions, some specifics of these will be made much more clear.  I think the speakers will address a lot of these issues in detail, and then we can have more questions.

    DR. GARZA:  And I've been reminded that we must each identify ourselves before we ask questions.  I will try to remember, but I'm probably the guiltiest of all.  I would hope that they would recognize my voice before this meeting is over.

    The statement I was supposed to read said, "I am Cathy DeRoever."  I thought I better not say that.

    [Laughter.]

    DR. GARZA:  It would confuse people, and obviously embarrass Dr. DeRoever as well.

    Are there any other questions on this third section?

    [No response.]

    DR. GARZA:  If not, then, the fourth one is the last.  It is also one question with two parts, A and B.  The table that is attached, obviously, is quite informative, so I would ask you to take a few minutes to look at Question 7, along with Table 1, see if there are any questions.

    [Pause.]

    DR. GARZA:  Are there any questions related to this section?

    [No response.]

    DR. GARZA:  If not, what I propose is the following; that you think about 30 minutes for each of the first six questions, approximately 120 minutes for the seventh question.  If we don't need the entire two hours for the last question, we can always come back and address issues that perhaps we might have felt were not completely resolved.

    Obviously, if it's clear that we need more time with any specific question, as the discussions evolve, then we can always go back and try to reallocate them, but we will come to some agreement on how much time to spend with each before we start the discussion this afternoon.  But as you've had a chance to review them, see if, in fact, you feel comfortable beginning with that type of allocation for the various questions that we're being asked to review.

    I don't think that all first six would necessarily take 30.  Some may take a bit longer, and some will take less, but I want to make sure that we don't shortchange, as I said, any specific question, and so we can get agreement on that as a group, going through the discussion and developing some information that would be useful to the FDA I think is more likely.

    Are there any other questions regarding procedures or the charge to the group that either staff or I have failed to clarify for you?

    [Pause.]

    DR. GARZA:  If not, then why don't we begin with the presentations.  I know that Dr. Chumlea was on the bus, so I assume he's here.  We're starting a bit early, but I think that's fine.  I will just introduce each speaker as they come forward.

    Dr. Cameron Chumlea is a Fels Professor at Wright State University School of Medicine in Ohio.  And for those of you in the field of anthropometry, I don't think Cameron needs any introduction.  For those that may not be familiar with his work, we don't have time.

    [Laughter.]

    DR. GARZA:  It has been quite extensive, and he's certainly recognized throughout the world for his work in this area.

    Thank you very much for the white paper and for joining us this morning, Cameron.

    DR. CHUMLEA:  Thank you very much for the introduction.  It's a pleasure to be here this morning, and I hope I can provide the committee with some information that's appropriate.

    First, I'd like to just simply recognize my co-author, Dr. Shumei Sun, who I know is familiar to many of you and point out that she has just recently become Wright State's Brage Golding Distinguished Professor of Research.

    So, first of all, I'd just like to point out that growth is relative, as you can see from the slide here, it says, "I keep track of my son's growth, which is going up the vertical scale, and my husband's growth.  Frank is age 30, 33, 35, 40."  So growth goes in various directions for all of us here.

    The second thing here is that we're dealing with infants.  Of course, there's our perspective of what infants are, but there's also a public perspective, and I saw this checking out the groceries and decided it really deserved a slide because clearly this, to some degree, is maybe the public's viewpoint, which you can clearly see.  It's amazing what you can do with PowerPoint these days and some slides that are available.

    So what I'm going to do this morning is just basically cover some brief information that's probably familiar to everybody, so that we can just kind of all come up to speed.  Clearly, infancy is a period of rapid growth, and to some extent, this is probably the most difficult group of individuals to measure.  The only other group that's equally difficult is at the opposite end of the age range, but this is generally a very difficult group, but it's also easy in the sense that there's also very few actual measurements that can be collected from them that are really going to be useful.  Weight, recumbent weight and head circumference are the three that are the most important and the ones that should be taken.

    Just to review, weight clearly measures the growth of all body tissues, recumbent length describes the amount of linear growth because we're dealing with both increase in mass and increase in size, and then head circumference reflects brain growth because this is the period of time, the first few years of life, when the brain actually does the majority of its growth.

    This is a period of time when body dimensions increase at a greater rate than in any other period in life.  Weight increases between birth and six months about 115 percent, length increases about 34 percent, head circumference increases 22 percent on average.  The rate of growth in weight ranges from about 1.1/1.2 kilos for boys or girls at one month of age, but then we have to remember that we're on a growth curve here, and then it, of course, starts to slow down, and by six months it's running around a half a kilo a month for boys and girls at six months of age.

    The rate of growth in length is about 3.5 to 4 centimeters per month for boys and girls, and it slows to about 1.5 to 2 centimeters a month for girls and boys at six months.  Just as a reference, the adolescent growth spurt between, say, you know, 12 to 16 years of age, the maximum amount there is only somewhere in the neighborhood of 5 to 8 centimeters a year.  So here we're looking at 3 to 4 centimeters a month.  So they can just put that in comparison because everybody kind of focuses and remembers how much growth their kids did when they were adolescents.  They frequently forget that that rate was a fraction of what they were really doing in the first few years of life.

    The assessment of status, once we've collected measurements, we really need the measurements to be accurate and reliable, and this is really a very critical point, particularly in this particular age range.

    The measurements are really not difficult to take, and there are a variety of mediums in which they're now described.  NCHS produced a video at the end of NHANES III that describes these and all of the measurements that were used in NHANES III on that videotape.  WHO has an in-house video that describes the measurements that are being used in their multi-center growth reference study.  I don't think that one has really actually been distributed yet, but it is available if you can talk to the right people.

    These are also all very similar techniques.  They are also probably being what's currently being done in the current NHANES, and they're all the ones that have come out of the Anthropometric Standardization Reference Manual from 1987, by Lohman.

    When we collect the measurements, this is a point where frequently things get skipped over, and the reason things get skipped over is because there's lack of time, money, personnel, et cetera, but it's the part that's really very important, in terms of collecting the information because, one, we're going to either plot those on a growth chart or refer to status, and the other aspect is what's very important here is we're going to calculate increments of rate of growth, and there you're compounding your measurement errors.

    It really takes two people to measure an infant appropriately.  I'm sorry, folks, but that's really the correct way to do it.  It can be done with one, but that's going to add to the errors that are going to be involved.  You need an examiner who's going to position and take the measurements, and you need a recorder who's going to be writing down the measurements because one person is holding the infant and trying to do the measurements, and they really don't have time to write anything down, and then they need to switch roles because we need to take double measurements here so that we can get as much information as possible.

    Also, what we'd like to do is have the technicians compare their values.  One, this just simply catches transposition errors that occur frequently because people write down numbers in different ways sometimes.  All of the studies that I've described to you have allowable differences between what the measurement values can be between the technicians, again, to control for errors and just to control for variabilities that can occur.

    Did I skip one?  How do I go back on this?

    [Pause.]

    DR. CHUMLEA:  Weight.  An infant can be weighed alone or they can be weighed while the mother is holding them.  It depends a little bit on the situation, but, frankly, I would prefer that the mother hold the infant.  You can weigh the mother, weigh the mother holding the infant, take the subtraction and you're going to get the weight.  The reason I like the idea is it keeps the baby calm, and it provides for a very stable piece of information.

    The infant can be weighed alone, and there are a variety of electronic scales, but pretty much once you take the baby out of the mother's arms and you place it on something else, it starts moving around.  Fortunately, the manufacturers of several of the scales now can compensate for this weight so the stability of the measurement is much better than it used to be.

    It's best that they be weighed nude.  Blankets, et cetera, are available.  However, if they are going to be in undergarments, I think NCHS subtracts about a tenth of a kilo from that for the readings, and spring-type scales and beam balance scales are simply not appropriate for use any more.  The electronic scales are much more available.

    There's a company called Seca that makes some very good scales.  We've used those.  WHO has a really nice platform scale that actually pares the mother's weight.  I don't know exactly what the manufacturer is, but it's a really excellent device.  I think it's been specially made for them, but there are a variety of scales that are available for use, but my preference is that the infant should be held with the mother or the caregiver, whoever is there, and then subtract the weight, if that's possible.

    This is the one nobody really likes.  It's recumbent length.  It takes two people to do it, again.  A variety of different pieces of equipment that are useful for doing this.

    It requires one person holding the infant's head.  It requires the mother, the caregiver standing there beside the infant reassuring them that nothing is going to happen, and it takes a third person then to position to footboard up against the soles of the foot of the infant.  You're holding the head so the child is looking straight up in a vertical Frankfort plane, and it takes then another person to hold the legs, both legs, for the infant, if it's very small, and get the length, and by the time they're six months, generally, the best you can sometimes do is grab one leg and try to hold the other one with your little finger and get it.  It's not easy to do, and it's one that can be particularly prone to error, and it's also important that the kid stays straight down the table.  I think I've covered what's there.

    This simply gives you a description.  I think this child is about two, the age there, but again positioning the head up against the headboard, keeping the legs straight and keeping the feet straight up in terms of taking the measurement, but clearly it takes two people to do.

    Head circumference, it should be measured with an inelastic tape of fiberglass, metal, something like that that's good.  It's really best that the infant is seated in the mother's lap.  I don't like it being done with the infant.  I've seen that done.  I like the kid up in the mother's lap, which requires then the person getting, who's taking the measurement, to get down beside the mother.  This allows the mother to cuddle the infant, keep him quiet, and you can slip the tape over their head and get the measurement before they pretty much know what happened to them.

    It's placed right across the front of the skull, and it can be quickly moved up and down the back of the skull.  The insertion tape is a nice piece of equipment that's useful for doing this until you find the greatest circumference, and then you pull the tape tight, and this is something we have to coach people in because it doesn't hurt the infant.  There's no pain involved with it.  They feel a little pressure, and they want to kind of shake it off, but it does need to be tight.

    And you can see here it's just anchored really right over the kid's eyebrows, worked up and down.  Most kids at this age aren't going to have quite this much hair, and so it's generally pretty easy to get this one from them, particularly, again, with the child being comforted by the caregiver.

    Now, what are some other measurements that could be taken?  Well, there's really a bunch of them, but I don't think those are really going to be really appropriate in the instance here.  Crown-rump length is sitting height.  Crown-rump length, I think that was pretty only used in children with special cases.  Chest circumference, limb lengths, one that's potentially possible is skinfold thicknesses.

    The problem with all of these measurements is they really kind of have a restricted utility in terms of describing normal or healthy growth.  They're frequently prone to high measurement errors, and there's really a limited amount of reference data available for all of these measurements pretty much within the age range that we're looking at, six months.

    If you're going to do something like a skinfold.  The skinfolds you're going to take are going to probably be triceps and subscapular.  Now, again, you're bringing two people into the program, if not three, and you've got to go to landmarks, and to do a triceps, you've got to find the midpoint of the arm, which means you've got to measure it, and make that determination, and you can see clearly it's requiring one person is holding the child and the other person is taking the measurement, and then you're going to have to go and take the measurement.

    The question comes up there in terms of equipment.  Skinfolds are dependent upon the type of caliper you're using.  There's two major brands, really.  There's a Lang and a Holtain, which you see here.  NCHS and WHO are both using the Holtain caliper.  I think the Lang is still used out there, to some degree, but I'm not that familiar with it any more.  There are differences there.

    Really, if you're wanting to do skinfolds, the question is what do you want to get out of that?  And probably what you wanted to get is total body fat, and there's probably, I think Dr. Ellis is probably going to talk about better ways of doing that today than taking skinfold measurements.  I don't really feel comfortable in doing it on anybody until they're about two years of age.  It's just difficult to do.

    Now, there's indices that can be used from the information that's collected.  BMI is the one that we all get informed, and just for your information, I'm 28, so you can kind of put that in reference.  I always think everybody should, when we have meetings like this, they should always walk around with their BMI on it, so we'll all be honest about this folks.

    [Laughter.]

    DR. CHUMLEA:  The problem with BMI is that, in infants, you've got 25-percent body length is composed of the head, so that throws off the proportionality aspect.  The relation of BMI with direct measures of body composition in infants hasn't really been established.  Weight for length is probably a better descriptive indices of relative leanness adiposity within children.

    Measurement error is very important and needs to be paid attention to.  The catch here is that the error which may be small is actually going to be very large because of the small size of the child that you're measuring.  So you really need to pay a tremendous amount of attention to error in measuring infants.  Of course, they can have a tremendous impact on the interpretation, if you're going to go growth increments.

    We need to get good-quality equipment.  Measurements should be taken.  If they're taken on a daily basis, the equipment needs to be calibrated.  That includes scales.  People forget that scales can go out of calibration, and then particularly the technicians need to be trained in a standardized way of taking the measurements.

    We need to collect inter- and intra-observer reliability.  Quality control is really important, particularly if there's going to be more than one center used to collect information because we need to control for inter-site differences.

    Measurement schemes.  You need baseline, an interim and a final.  I really like something that's going to be getting a measurement at 1, 2, 4 and about 6 months of age is my preference for collecting things, generally, starting after about 10 to 14 days.  With measurements at 1, 2, 4 and 6 months, you're going to get a good accountability of weight measurements over that period of time.  Clearly, if you can collect more measurements, the more measurements the better.  I'd be very happy to have those.

    I'm going to cover just very briefly growth increments, which are going to be calculated from the repeated measures of growth, and there are charts that these can be plotted on from birth to 12 and 3 to 6 months of age, which are examples here.

    These are from Fels data, and just contrary to popular opinion, the majority of Fels infants were breast fed for at least three months, exclusively, so that has been reported.  We tend to not get a good press on that.  I just want to kind of correct that.

    In terms of growth velocity data, there's the Fels data.  Also, I'd like to point out that WHO is collecting longitudinal data from its multi-center growth reference study, but this data and report from that study has not been available, and that'll probably be given later.

    So recommendations, from what I've just described to you, weights should be measured I think at 1, 2, 4 and 6 months.  I'd like to see recombinant length and head circumference at the beginning and end because it just gives you additional information on the quality of the size of the infant.  Close attention needs to be given to methodology and errors.  Two technicians are really important and reliability data needs to be collected, and use of existing increment charts until the WHO charts are available.

    So thank you very much.  I'd just like to say, personally, the last time I had to give a paper in front of Dr. Briley, she gave me a B--

    [Laughter.]

    DR. CHUMLEA:  --which is about 25 years ago.  So I hope I did at least that good this time.

    Thank you.

    DR. GARZA:  We'll take that up at the break, I guess.

    DR. CHUMLEA:  Okay.

    [Laughter.]

    DR. GARZA:  Thank you very much, Dr. Chumlea.

    Are there any questions?

    DR. SIGMAN-GRANT:  Sigman-Grant.  I have a question.  You recommend weight starting at one month, and you talked about the regain from the loss from birth weight.  So much is happening in that first month.  Why don't you measure it before--

    DR. CHUMLEA:  I said that, really, as early as, say, 10 to 14 days.  I would like to see it done that way, but within no later than one month of age.  So let me kind of restructure that between--

    DR. SIGMAN-GRANT:  Why not between--why not the first week, instead of 14 days?

    DR. CHUMLEA:  Well, there's a shift in weight, as far as I'm familiar with, that's supposed to occur after birth, and so I think, just my understanding is, that there's a period of time within the first week or State or local that the infant basically kind of stabilizes after the birth experience.  Now, Dr. Fomon, I'm sure, could give you more information on that, if I'm incorrect on that.

    The more measurements you can get out of this, the better.  I was being, trying to give you what I think is the very minimum that you have to collect there.  You can measure them every week.  That would be fine with me.

    DR. DOWNER:  Goulda Downer.  I understand, when you talked about examiner variability and possibly downright error, but can you talk a little bit more about why you don't think that subscapular skinfolds and triceps skinfold are important--

    DR. CHUMLEA:  Useful information?

    DR. DOWNER:  Yes, at this juncture, because I think it is.

    DR. CHUMLEA:  The reason I don't like them is this.  First of all, they're extremely difficult to collect in children at this age, so the amount of error that's in the measurement is extremely high.  The question I would have is that what information are you going to get out of this particular measurement that you're not going to be getting by bodyweight alone?  Because if weight is going up, the skinfolds are going to go up; if the weight is going down, the skinfolds are going to go down.

    So the question that you're really interested in is total body fat, and, yes, 90-some-odd percent of total body fat in a child is principally subcutaneous; that if you want to go total body fat, there are now better ways of doing that, such as DXA, that I think are going to give you the information that you really want.

    If you do go and collect the skinfolds, then you're faced with some reference values that are useful.  There's only two that are out there that are fairly good--what's available from NCHS and NHANES III, and then Dr. Fomon's data on skinfolds.  But outside of that, there's really little other reference data that's available.

    So I guess if I'm going to, what I want to know is total body fat, and if I want to measure total fat, I'll go measure total body fat with something that's going to give me I think better, and more accurate, and reliable information about the child than I'd get from the skinfold.

    DR. HEUBI:  Jim Heubi.  I don't want to misunderstand what you're saying, but you're not recommending that people weigh infants in garments and subtracting--

    DR. CHUMLEA:  I'm sorry, what?

    DR. HEUBI:  You're not recommending that people weigh infants in garments and then subtracting one-tenth of a kilogram for the--

    DR. CHUMLEA:  I would prefer they be weighed nude, yes.

    DR. STALLINGS:  Stallings.  To follow up a bit on the other question.  We're beginning to think I think about looking at infants who are growing too slowly as the historical way of the failure to thrive related to this, but the issues of growing too fast are also of concern.

    So, to go back to the question, could we get most of that information with weights and heights, and weights for heights, rather than looking for data related to adiposity, or if we were looking for excess growth, whatever that concept means, what would you recommend, derived values from the anthropometry or DXA?

    DR. CHUMLEA:  I think--we're still sticking between this birth and 6-month range, and I think if you're getting excess growth, you're going to, you may need to take more frequent measurements so that you can plot and get a better description of the curve as what's going on there.  And if you have more information, then you can discriminate between the children who have excess growth and those who don't.  So that's I think something that's important to consider.

    I think you will get everything you want from weight and length.  If you go to DXA with a child at this age, and Ken will address this more I think in his talk, you're going to get, you know, fat, lean and bone, and the fat is probably the most important aspect here that you'd be concerned about for excess growth, but it's going to be described in weight, also.

    So unless you're wanting to tease a tissue out and say, okay, we're really concerned about the increase in fat here, in addition to the increase in weight, then, yeah, then something like DXA I think would be important.

    DR. STALLINGS:  Follow-up.  The velocity, then, would be what we would be looking at, more than just attained weight?

    DR. CHUMLEA:  I think you have to do both of them.

    DR. STALLINGS:  We, historically, are always looking I think at the attained weight.

    DR. CHUMLEA:  Yes, and you'd have to include the velocity in there because the velocity would, these children should, I think, potentially have much higher velocities.

    DR. STALLINGS:  And that might be a way of discriminating between the concept of normal growth and excess growth?

    DR. CHUMLEA:  Yes, right.  You could have children, let's put it this way, who have, say, after, say, three or four months, when their velocity should be declining, these children might not be declining as what the average is, so they're still obtaining a rather high velocity of growth at that point.

    DR. THUREEN:  Thureen.  I would argue that DXA is not a very useful body composition measurement for most studies because it's not that readily available, especially as a field tool, and I think that a lot of people are now starting to do more caliper measurements for assessment of body fat, even in very tiny infants.  And certain people, like Suda Kashyap, have gotten very reliable measurements over time.

    Do you think that the data from the NHANES study on body composition, using anthropometric measurements of body fat, was not useful or do you think it is useful?  And if you want to do large population studies, do you think that there is a future for caliper measurements?

    DR. CHUMLEA:  The DXA thing you can talk to Ken about.  I'll let him address that when he gets up here.

    There's two issues here, whether we're talking about small studies or large studies.  If you're wanting to do large-scale studies, population studies, like NHANES has done with NCHS, then collecting caliper information is going to be what you can do because there can be limits to what you can collect for DXA, particularly because of the issue of radiation exposure, although it's very minimal, even within the current NHANES, where they have DXA machines in all of the trailers, I think the limited age there is Age 8, from what they're collecting, although they technically have the availability of doing it in those particular studies.  Other people clearly don't have access to such expensive pieces of equipment.

    If you don't have access to that, then I'm not opposed to collecting the skinfold data.  The issue comes up that it is extremely difficult to collect accurately and reliably, and so, I guess, I kind of am in favor sometimes of no data is better than bad data, and I know that, in collecting it, it is something that people have to pay very close attention to, the technicians have to be very careful, and this is frequently something that in the course of studies, we pay lip service to it, and there's good attention, but these things do tend to fall out.

    Now, in smaller scale studies, this is something that can be done, and the information can be collected.  Overall, I'm just not that happy with the information.  Yes, there are studies where it's been done very well, and so I'm not putting those studies down at all.  I'm just talking about, in general, my experience has been, in collecting from infants in this age range, that this is really hard to do, and when it's something that's hard to do, it doesn't sometimes always get done the best way.

    DR. THUREEN:  Thureen, one more question.

    In your opinion, if you're looking at a growth outcome study, do you think incremental data are the gold standard--growth data are the gold standard, attained growth are both critical to an outcome study?

    DR. CHUMLEA:  I'm assuming, when you say a "growth study," you're going to be collecting repeated measurements from the same children, so you're going to have both pieces of information available there.

    The status value simply describes where the children are in reference to whatever reference values you're using for peers at that age.  That simply tells you that they're at certain percentile levels, but at the same time children also grow at different rates, and so there's a distribution of the rates at which they grow.

    So children who may appear to be at one percentile level, their rates of growth can be a different percentile level, so it gives a much clearer picture upon what's available.  And since any study where you're going to collect repeated measurements, you're going to have all of that information available to you.  So I would take advantage of it.  Again, the errors are difficult to control and need to be paid attention to for collecting it.

    DR. GARZA:  Cameron, I have two questions.  Given the fact that we are going to be providing advice to the FDA on the approval of specific formulas, how many measures do you recommend be taken if, in fact, one has an interest in the pattern of growth?

    DR. CHUMLEA:  If I was going to design the study, and you're not going to restrict me to what I want, okay.

    DR. GARZA:  From your perspective, if you're going to be protecting the public health and infants' health, what should the American public ask?

    DR. CHUMLEA:  I'd want a birth weight.

    DR. GARZA:  Birth weight.

    DR. CHUMLEA:  Clearly.  I would like it at, say, two weeks, one month--I'd like it again at two, and then at four, five, and six maybe, something like that.  The more measurements I could get out of the thing the better.

    DR. GARZA:  But you think that with seven measurements, one would be able to assess both the pattern of growth, as well as velocity of growth, at those specific time periods.

    DR. CHUMLEA:  Yes.

    DR. GARZA:  And you mentioned there were various sources of error.  Is there any consensus that FDA could rely on that deals with the nature of the equipment, the type of calibration that should be insisted upon, the training that obtaining the measures should be able to obtain and document, and the--those would be the three: equipment, calibration and the training of the technicians.

    DR. CHUMLEA:  There's a little bit of information about equipment errors and a little bit of information about the inter- or intra-observer errors for collection of measurements in the Anthropometric Standardization Reference Manual that's collated in one location.

    There's other pieces of information that are clearly scattered around the literature that are available.  From NCHS, there's really a limited amount of information.  There was really limited error data that was collected in NHANES III.  What's available from NCHS is principally from the earlier NHANES studies, and NHES.  So there's not much there.

    The techniques are described in a variety of locations, but there's not really anything that I know of that's really written down that says, okay, you can refer to here, and this is what you should do, in terms of training, collecting the measurements, et cetera, in one central location, no.

    DR. GARZA:  Any other questions or comments?

    [No response.]

    DR. GARZA:  Thank you very much.

    DR. CHUMLEA:  Thank you very much.

    DR. GARZA:  We'll move on, then, to the next presentation.  It's a topic that has already come up, body composition assessment in early infancy.  Dr. Ken Ellis, from the USDA/ARS Children's Nutrition Research Center, with Baylor College of Medicine.

    Again, Dr. Ellis, thank you very much for joining us.

    DR. ELLIS:  Thank you.  What I'm going to present today is a probably a little different from what most people have had experience seeing.  Some of this--in fact, all of this is going to be body composition beyond simple weight.  If one was interested in what the composition of weight is--let's see.  Which button do I...this is the laser, okay?  And which one of these is--can you hear me now?

    [Pause.]

    DR. ELLIS:  This is supposed to move the slides, she said.  I'm so used to pointing to the screen these days, so it's my fault.

    As you already heard, most people--or most of the work, at least in infants, has been with weights and heights, all kind of weights and heights charts.  But body composition, at least the first models that were attempted, the basic classic 2-compartment model is to measure--is divided in two compartments, fat and non-fat.  The direct measurement of body fat is really very difficult to do.  It's not an easy process to do that.  And so for many years what we did was we said that if we could measure precisely some parameter of the body that would represent the fat-free mass, then subtraction of the fat-free mass from the total weight would give us a measure of the fat mass.

    Part of the problem with that is that all the years in the  (?)  scale for the fat-free mass translate directly to the fat mass, and you'll see what I mean by that in a few minutes.

    Three classic methods that have been used for 50 years, or maybe even longer:  underwater weighing in adults--you can't do this in babies and infants.  Practitioners as well as parents tend to object to holding babies underwater.  Hydration, this is probably the more common thing you'll find in the literature that is done.  It's the dilution technique.  You give a tracer, collect the blood sample or some fluid sample several hours after that, then do some manipulations on that,  (?)  space to get water, and make some assumption about how much water there is in the fat-free mass.  And as we all know, hydration content of the fat-free mass in children at very early ages changes dramatically.  So depending upon what you assume, you can then quickly be off in your estimate of the fat.

    Whole-body counting is another method used to measure potassium content, primarily the body cell mass.  And, again, how much that relates to the--how much of that is a constant fraction or not of fat-free mass at these ages has also been questioned.

    So, again, like I said, the difficult underwater weighings, difficult to do the infants.  For the water measurements, they must swallow all the tracer, collect some kind of fluid sample.  Plasma is your best choice.

    The problem with this is you can't repeat it.  If you want to do the trial a week or so later, you can't.  You need to leave a sufficient amount of time for the previous tracer to clear, or you start increasing the doses of the tracer to compensate for that.  And, again, in the past, the most accurate assays required one to have a mass spec or availability of a mass spec, which is not in everyone's laboratory or garage, as I usually say.

    Whole-body counters, the problem with those have been over the years really they haven't been designed for infants.  They're really designed for adults.  There are a few of us who have done this, but in general they simply don't exist.  And even if you do have a whole-body counter available to you, most of the time it's not in a clinical setting.  You're going somewhere else to get the measurement done.  Again, the reason that tends to exclude it, at least for infants and children.  But one nice feature is you can repeat this as often as you wish, and so you could do it on a daily basis if one chose to do that.  So if you have access to a counter, one can count these infants as frequently as one chooses.

    And because of the way in which the counting procedure works, there doesn't have to be any really significant constraints.  They can move around and be--it will not really affect the results that much.

    Now, I looked at the various things that we received in reference to this meeting, and there's the document in there from the American Academy of Pediatrics where there's one little paragraph on body composition.  This was in June of '88.  And it says, "Normal growth implies appropriate composition of the increment in body weight.  Sequential measurement of various aspects of body composition"--such as water, fat and bone--"have the potential for defining changes in body composition."  However, at that time the opinion of the Task Force was that such measurements have not yet reached the stage of precision, non-invasiveness enough, and they're not that very convenient, and which I've just showed you that sort of in the three previous procedures, which I would agree in 1988, I would agree with that statement.

    That gives us a quick summary of that.  It just says you want to be able to--that the weight is appropriate composition, you will be able to do longitudinal measurements.  Again, for clinical testing, the precision, noninvasiveness, and convenience are the issues that they put out.

    Again, the 2-compartment model, I quickly talked about the limitations.  The density is not constant.  The hydration is not a constant.  Extracellular and water ratio is not constant.  Bone accretion is not constant.  Basically babies aren't constant.  We know that, right?  Because we know that when the baby comes back a month later, it's not the same baby we saw a month before.  It's a whole different child totally, at least from a body composition point of view, unlike adults, which really change very slowly over time.

    Now, so what's happened in pediatric body composition research since 1988?  There are sort of three general areas where there have been advances made.  One is bioelectrical techniques.  In pediatrics, I want to take a broad sense of pediatrics.  That's anybody under the age of 18.  And there's been quite a bit of work used with this technique in older children.  When it has been tried or used in infants, it has not been very successful at all.  In fact, in general, most conclusions with this technique--I should say the first two techniques, the bioelectrical impedance and the bioelectrical spectroscopy, which is the same as this but at two frequencies, have been that the information gained from those two techniques really hasn't been much more than you already knew when you had just simply weights and heights with these individuals.

    TOBEC, on the other hand, has been more successful, but the problem with TOBEC is that these machines are very--they're not common, there are not many out there.  There's probably not more than a dozen--a half a dozen, in fact, for infants.  And so, again, this is a technique that holds promise or has held promise, but, again, it's not a technique that is widely available at all.

    Absorptiometric techniques, DXA.  Remember that Academy of Science report?  It was in 1988.  Well, in 1993, the world of X-ray absorptiometry changed because at that time it went to what's called DXA.  It went to X-ray sources, different detectors.  The whole technology advanced substantially such that one could now consider this technology for whole-body measurements in infants, and one can do a localized region, such as the spine, if that's specifically what one would want to do.

    Another area, body volume measurements.  Remember, underwater weighing doesn't work.  This is an air displacement plethysmograph technique now, which is just actually started this year, so maybe in another few years, when this committee reconvenes, we'll be able to talk more about this.

    I'm going to basically focus on the absorptiometry or DXA or DEXA methodology because that right now holds the best promise for this kind of broad application.

    So the basic model, again, in 1988--I mean, there are more models, but the very basic model, again, we're talking about fat and fat-free mass.  And today, when one talks about body composition, one is almost forced pretty much to really address this kind of a model over here, 4-compartment model:  fat, bone mineral, mineral composition, ash, the water, and protein content.  And, in fact, it is these compartments that we're interested in when we look at body composition at any age and look at change in body composition at any age.

    So that 4-compartment model is now shown on your left, and what does DXA provide us?  DXA provides us--the only method we have that provides us a 3-compartment model for a single measurement.  We have a measurement of fat, which is directly the fat.  We get a measurement of BMC, which is bone mineral content, which is for the mineral compartment.  Eighty percent of this is this; there's another 20 percent which is distributed in the non-mineral, non-osseous compartments.  Then a third large compartment called the lean tissue mass, which is the non-bone, non-fat compartments.

    So when we do a DXA measurement of anyone at any age, we get this basic model, which has been a huge advance.

    This is what an image looks like in a child.  Most of the time people show images of adults.  Adults are very nice.  They hold their arms right.  They put their legs down straight.  Children, amazingly, 14 years of age, haven't learned how to do that yet, have they?  So they tend to lay the way they want to.  We do make measurements in children at our place.  We've done probably, I don't know, 600, 700 DXAs, at least.  Maybe a thousand.  I really don't know the number these days.  Quite a few studies.

    Let me show you, again, if one is interested in just the spine, this is not an infant spine.  It happens to be an adult image there.  But one can localize and make a measurement just at the spine for bone.  But for body composition work, we do total body measurements, which is shown here.  I think you can get an idea of the skeleton seen.  I think you can see the soft tissue parts that are obviously not the bone, and we can then get that information.

    Now, as you heard earlier, the BMI in terms of the height of heads, if one chooses to, one can actually decapitate the image and just worry about this part if you're concerned about how does the head contribute to all this information.  So it is possible to do that kind of stuff.

    By the way, the time it takes us today--in 1988, if you attempted an infant, which you would never be successful at, it would be 25 minutes.  Today we do an infant in less than 3 minutes with the newer scanners.

    So DXA, what are the advantages and some of the disadvantages?  First, DXA has almost achieved a reference status within the body composition field.  It still has some improvements to be made, but, again, in terms of everything else, it is the better technique that we have.

    Advantages, it does give us good precision and accuracy.  It is the only technique for a single assay that gives us basically a 3-compartment model:  bone, fat, and lean.  As you can see from that image, we can get some regional information if we choose to do that.  It has a very low exposure risk.  There's a very minimal amount of that.  And there are more the reference populations out there for adults, for children, and they are being developed, and several for infants, if you know the references to look for.

    The disadvantages, very low exposure risk, the same thing.  One could argue whether it's an advantage or disadvantage.  One of the ways I talk about this--in fact, I just thought about it coming over here yesterday on the plane--was on the flight I was on, there was at least five children under the age of 2 on that flight with me, and the radiation dose they got on that flight exceeds what you get from the DXA.  In fact, it's two to three times higher.  So there's an idea what the risks are involved.

    Scanners are not optimized for infants.  If you get a scanner, it's adult size.  They have not--the industry has tended to resist this, primarily because of the market that they are focused at, which is osteoporosis in older women.

    It doesn't give us a 3-D image.  You saw that 2-dimensional image.  It gives us 2-D not 3-D imaging.  That's what I call boot-strapped 3-C model.  It's not a perfect 3-C model, but it's not bad.  And the one different problem has been that the results differ between manufacturers, so that if you do a study--a multi-site center study, you want to stay with the same instrument, same software.

    Okay.  Precisions and accuracy of different body composition measurements and the minimal detectable change in an infant.  For this I chose basically a full-term infant and made the assumption it's 15 percent body fat.

    If you look at these methods, the water, the dilution method, or the bioelectrical impedance method or TOBEC, this is the potassium one here.  DXA, the bottom three, the fat, fat-free, and the bone, precisions, these are optimistic.  Precisions tend always to be a little bit better than--whenever you do a precision measurement study, they always do much better than they do random.  I'll guarantee that.

    This shows you the precision measurements here.  This is generally the accuracy.  Precisions are, let's say, in the 1 to 5 percent range or 2 to 5 percent range.  Accuracies tend to be in a 3 to 5 percent range.

    If you take these, this information, and take this size of an infant and you translate those into one of the minimum detectable changes for that infant, the values are shown here, the last column on the right.  And the percents are those percents of what that person had in terms of the composition at that age.

    So one can measure water, changes at 5 will start to show up.  If you use (?)-ium, they can get worse, TOBEC or BIAs.  Fat-free mass, 125 gram changes, only about 5 percent of the total fat-free mass.  Forty grams of fat, if that starts to change, it's 8 percent of that to implement this weight and composition.  You can start to see changes relatively quick with the single input.

    This shows the relationship between precision of the methods and what kind of a change, minimal change is required for that to become statistically significant, at the 5 percent level and a power of 0.8.

    I call this the clinical application in individuals, what I consider to be a clinical application, what's changed in that individual.  And if you look, for example, this is the relationship that the minimal detectable change--this is approximately 3 times the precision.  And so that if our precisions for BMC are somewhere around 2.5 percent, 6 percent change would occur, lean tissues at about 10 percent, and fat-free mass at around--if the precision is 4.5 to 5 percent, we'd have to see changes in the range of about 14 percent to be significant for that individual.

    Now, this I show you because this is the difference between 1988 and today.  I only got 30 seconds left?  Oh, well.  Well, okay.  Very quickly, this shows you the methods.  This shows you what the precisions are for FFM, fat-free mass.  If you translate those into fat-free mass, this is the tail here.  The top three were in 1988.  That's why you couldn't do it.  DXA is hugely improved since then.  This shows you the kind of weight gains that would have to occur in infants at, I guess, again, the standard term infant here, very small weight change with DXA, large weight changes for--I'm out of time.  She says zero.  Anyway, the--I'll keep on going anyway.  And the number of weeks would have to be changed in that individual.

    I can't believe I've taken up all the time already.

    This just shows you the rates of change that occur with age.  You can find this from several different sources.

    This shows you the first six months where you have to figure out what the rates of change are in the composition, and here's a series of papers that are on infants.  There's one on TOBEC.  This one used a series of methods which we can do.  All the rest you'll notice, with the exception of one here and dilution, were done with DXA.  The weight ranges are shown here.  The number of infants are shown over here.  We are now doing a meta-analysis to bring this together into one common reference database.

    And I want to just quickly go through these, again, precisions, 2 percent, 3 percent, 6 percent reported here.  And if you did a calculation--this is an interesting study in twins.  They looked at the weight difference, which is about 14 percent, and they calculated again with these alpha-5, power of 0.8, 40 to 45 infants would be needed to detect a 15 percent difference in one or more of any of these three compartments.

    Another paper here, let's see, this particular paper does give percentile curves for each of these values as a function of weight.  And, again, what I'll point out here is that even though you may have a 3.5 kilogram weight infant, the fat range can range from 10 to 26 percent.  By the time they're 10.5 kilograms of weight, it can range from 22 to 23 percent.  So weight does not represent fat.

    This paper is another one that has percentile curves of each of these compartments versus weight.  Again, comparable to the other one, actually these are lower fats.  Interesting, this is an European study.  The previous one was a U.S. study.

    And I want to show you--this is the last one, slide here, and I'm a minus five I think now.  But here this shows the changes--total body DXA.  These were preterm infants.  Initial weights were about 17--under 1750.  This was fortified human milk formula, and this was a preterm infant formula.  And the baselines are measured at 3 weeks of age, repeated again at approximately 3 to 4 weeks later.  And this shows the statistical ability to measure changes in body weight, for example, and in the fortified human milk you can make--you can change--see the difference at this level.  And the preterm formula, it's about 19.9 grams per kilogram per day.  Differences were four, and this is statistically different.  You can see these--in other words, you can compare in groups of 20 versus 30, you can see differences in weight, you can see differences in lean mass and fat mass and bone mass by DXA at three weeks between these two groups of children.  So it is possible with DXA to measure not only weight but the composition of that weight change in relatively small sample sizes.

    And if you want to convert those to growth kind of numbers, this represents about 2.3 grams per kilogram per day in terms of growth, and that's about 12 percent of the mean weight gain in terms of composition.  This again is 2.1 grams of lean mass per kilogram per day, 15 percent, about 1.2 grams per kilogram for fat mass and 76 milligrams per kilogram per day for the BMC.

    So the point here is that we can measure--I think there was a question, could we measure changes of composition that would be comparable to 3 grams per day?  The answer is--from this study the answer would be yes, we could do that.  Again, relatively small sample sizes of 25 to 30 children.

    I'll end there.  Thank you.

    DR. GARZA:  Thank you very much.

    Any questions or comments?

    DR. MOYER-MILEUR:  I have a question.  When you do your measurements in your babies, are they sedated?  Because we find that we require sometimes more than two technicians to keep a baby quiet to minimize the movement artifact.

    And my other question is, with preterm babies, we found it somewhat difficult to do early measurements because of equipment artifact, that they have leads and monitors on that make it very difficult to get a true assessment using DXA.

    DR. ELLIS:  Yes, two things.  One, none of the infants that we measured and none of the infants that any of these studied were sedated.  These are--again, they're all healthy children.

    Our experience has been if you feed them right before you want to do the measurements, they tend to be rocked in the chair by the mother or someone, they go out, and then you can make the measurement pretty easily.

    I don't know which machine you were using.  Was it a 4500A or 2000 or--

    DR. MOYER-MILEUR:  We have a 4500A, and we also have--

    DR. ELLIS:  You should do it in three minutes or less.  You do have to work at it.  I mean, normally with older children who will cooperate, that could be anywhere from age 5 to 18, depending upon what you're looking for.  But those children can get on a bed and will cooperate and can be--and they'll do it.

    Here you can do the whole procedure in ten minutes or less.  Here sometimes you have to spend as much as an hour to get the one measurement done.  You have to work at it.  But none of these children were sedated.  We don't sedate any children in any of our studies.

    DR. MOYER-MILEUR:  Yes, and I just, you know, would caution DXA in the infants in that it requires people with specialized training so that you can't just--

    DR. ELLIS:  Yes, yes.

    DR. MOYER-MILEUR:  --go to a community hospital and get their--

    DR. ELLIS:  Yes, you can't--you cannot send these children to a radiology department even with the hospital because they simply are not experienced with measuring children.  They just don't like it when children show up.  They don't have--infants, they basically will send them back.  They will not--they will not take that hour, hour and a half to do it in.  It takes effort sometimes.  Sometimes they go right on the bed and out.  It's always the ones that show up at 4:30 that take the hour and a half to two hours, though.

    As far as the artifacts, you're right.  You have to be careful about artifacts.  You can delete those off the images, though, pretty well.  If you take leads out, for example, out to the side, you can delete those right off the images.  And so that's a minor effect if you deal with it right.

    DR. GARZA:  Dr. Denne?

    DR. DENNE:  I was wondering if there are any direct comparisons in infants between skinfold thicknesses and DXA for fat mass.

    DR. ELLIS:  There may be a few, but, again, the issues have been that skinfolds are probably more difficult to get than the DXA.  We have skinfolds in some of our kids, but we just don't rely upon them for anything.

    DR. DENNE:  It would be an interesting comparison to make.  You know, relative difficulty depends on what you're actually used to doing.

    The other question is:  How is DXA validated in infants?  I mean, most of this body composition, you know, was validated against the other techniques which all have their own sets of issues.

    DR. ELLIS:  Yes.  The validation of DXA are done two ways.  One is with animals, small animals have been done.  We have done 73 piglets under the weight of 10 kilograms.  Other people have done comparable size piglets.  And probably if you add everything up, it's probably about 200 pigs have been done over the years at different centers with different machines.

    The pig is not the best of models because, for example, its bone is more mineralized than infant's.  Weight-wise, composition-wise, soft tissue is not that bad.

    The other way we've done it is we actually built phantoms.  We've actually fabricated mock-ups of the human body with parts made from polyester resin, doped with calcium and phosphate--phosphorous compounds to simulate that.  But that's how it's done.

    I have also done cadaver work.  The problem--not the problem.  The situation is that, unlike Elsie Widdenson, today's environment would not allow one to chemically digest the infant body, so we have done that in about 30--more or less 30--these are all preterm infants, and we did that by a technique called neutron activation analysis where we do a nuclear, chemical--nuclear chemistry technique where you measure calcium, phosphorous, sodium, chlorine, phosphorous, manganese and magnesium and potassium.

    And so if I look at the BMC bone versus fat, and if I look at the other ones and make some model--I have to make some modeling assumptions now about how much sodium is in the water, extracellular water and so forth, but they come out pretty well, with the 5 percent kind of accuracies.

    DR. THUREEN:  In the past several years, it's been recommended that at different centers, even if you have the same type of machine, you should do your own phantoms.  Do you think there's enough phantom data out there now that that doesn't need to be done?  Or if you're going to do a multi-center study, do you think that needs to be done?

    DR. ELLIS:  Well, for the multi-center study, there should be at least a common phantom that is going around to all those sites.  One, to do the initial calibration to be certain everybody is within reason of the numbers, and then continue on throughout going for the study.  That's typically what we do in all studies at all ages, whether it's infants, children, or adults.  That's what we do these days.  For multi-site studies, there's a common set of phantoms that go around all the time.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  I'm one of those other six people in the world that's got the TOBEC, so I agree those, you know, have tremendous advantages.  But I don't think that we would be able to use them.  So I think, you know, bringing us to issue with DXA and how we could use it is an important question.

    Laurie asked one of the big questions that I'm always asked, which is about sedation, and I would agree that, you know, natural sleep and that sort of thing and working in the research setting.

    The other question sometimes is:  How many images do you really have to take to get the one right?  I even noticed on your slide the hand is--

    DR. ELLIS:  The hand was a little off.

    DR. STALLINGS:  And I just spent last week working with DXA and trying to figure out which one had all the body parts there and minimal movement.

    But would you share with the group, you know, how frequently do you need to do two scans or you get halfway through a scan and then you do it again to get a good research quality measurement?

    DR. ELLIS:  I would say it's definitely less than 10 percent that we have to repeat the scans.  It's like you say, there's a technician there.  The image is being acquired while the scan is being done, and you can stop it immediately to start again, as you well know.

    We have some--we have looked at some scans where the infant has moved, but we finish the scan and then repeat the scan again and looked at those.  It has a lot to do with what kind of movement you have.  As you well know, you can--if the child's arm is here--or say here when it starts and here when it ends, you have a three-armed child in the image you end up with, because it was here the first time you scanned through and caught on the second, on the lower case.

    We have found that if we have motion in this direction, there tends to be a minimal effect because you're not changing anything.  You're just moving the slice over a little bit here.  But it's when there's movement like this, a flapping of the arms or kicking of the legs, if they're doing that, we don't scan them.  We stop.  But that's usually what happens, they wake up or something.  Less than 10 percent.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  I have two, Ken.  How well described are the specs of equipment that one would need to be able to measure infants reliably?  Is there pretty much a consensus on the quality of the equipment, the DXA equipment that would be needed?

    DR. ELLIS:  You'd have to have something that's equivalent to what's called--there's basically two manufacturers in this country.  One is Lunar, the other one's Hologic.  You have to have at least the DPXL for the Lunar at least the 4500A or DelphiA for the Hologic.

    We always use Hologic's, and, in fact, we, you know, are constantly trying to improve those machines.  I'm not going to tell you they're perfect, but they're the best thing we have.  I think they could make them better.

    DR. GARZA:  You also indicated that individuals or personnel had to be specialized or had to be highly trained.  How much training do individuals need to be able to use this equipment reliably, or was the training in reference to just training and dealing with pediatric populations?

    DR. ELLIS:  It's more dealing with the pediatric populations.  It's more of that than it is simply for this, because once they understand--basically what you want to do is you want to minimize motion and have them in the right position and things like this.  But it's more dealing with the pediatric population, dealing with a child that may want to cry for 20 minutes or something, or a mother that could be apprehensive when she hears the child crying.  It's more that issue than it is anything else.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  I just want to ask a little bit what Bert was doing.  If we were doing such a study, a multi-center study, what would your advice be about centralized reading of the scans, the technician at the instrument site?

    DR. ELLIS:  It is a good point.  These days, again, it is common practice now to send all the scans to a common central reading site because at least what happens there--well, I'm thinking more of the adults.  If there is any kind of bias--in adults you set regions of interest.  In the infants, it's a total body scan.  There's no region of interest set.  So it's less of an issue there.

    But, again, the judgment about good scans or bad scans would come from one source and not from different sources.  So it would be a reasonable thing to do.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  Thank you very much, Ken.

    DR. ELLIS:  Sure.

    DR. GARZA:  Committee members and guest speakers are invited next door for coffee.  We're going to break right now, instead of at 10:35 as on your schedule, so that we don't break up the following three presentations.  But I will ask everyone to try to get back here at about 10:25, 15 minutes from now, so we can assure that we don't eat into any discussion time with either the speakers or anyone else.

    Everyone else that is not a speaker or on the committee is invited to the cafeteria.  These are federal rules.  I didn't make them.

    [Laughter.]

    [Recess.]

    DR. GARZA:  The committee is seated at the table.  If I can have our guests please take your seats, we're ready to start.

    Our next speaker is Dr. Frongillo.  Dr. Frongillo is an associate professor in the Division of Nutritional Sciences at Cornell University, and he's going to give us an overview of the World Health Organization Growth Reference Study that was referred to a bit earlier by Dr. Chumlea.

    DR. FRONGILLO:  Good morning.  I'm going to stand here in the middle and use this archaic technology.  It might help if we could dim the lights up at the front here a little bit.

    These are some growth data from a single child, and you can tell what country they're from if you look at the units of measurement in pounds.  And if we plot the data, it's actually more interesting to look at.  We see a trend like we expect.  But to try to really discern anything about what the pattern is, it's helpful to compare it to something.  And so this is a graph--this is the old U.S. reference, the 1978 reference, and this is the same child.  And you can see this graph on the left, the child started off at the bottom of the distribution and then seemed to climb into the chart a bit, and then at about four months or so started to really--three to four months in there, started to really--its trajectory is now falling well below the chart.  And eventually by--this is about two years where you see it comes back onto about the same percentile where it originally started.

    This discrepancy that we see is either telling us something about this particular child or it's telling us something about the reference.

    Well, it turns out it's telling us something about the reference because that's a breast-fed child.  And if we look at a data set, this is a comparison that the WHO infant growth--an analysis that was done in the early 1990s, and what it shows is that, if you look on the left here, this is for boys, this is weight in kilograms and age going up to 12 months.  And the dotted line to the 1978 U.S. reference which was adopted about that time, just after that by WHO, is the international reference.  And the breast-fed data set, these are infants who were exclusively breast-fed for four months and then continued breast feeding through the first year.

    And you can see again that about four months you start to see the solid curves deviating from the dotted lines, which is the same pattern that we just saw.  And so this is showing that in a sample of about 426, I think it was, infants epidemiologically what we just saw in that individual child.

    And a similar pattern was seen for girls.  We can amplify this in a way by--what I've done is just simply take the current reference, the current international reference, the 1978 reference, and that's what would be at zero.  So if these children were growing exactly like the reference, there'd be a horizontal line right at zero here.  But what we can see is that this breast-fed set seemed to grow, if anything, a little bit faster at first, and then by the end of the first year had increased their weight substantially less than the U.S. set.

    In fact, if we calculated the difference in rates for these two groups from zero to 12 months, it's about 2.7 grams per day.  So this is something to keep in mind for later when we're thinking about how big our meaningful difference is.

    The difference from about one month, which is at the top there, from the maximum to the minimum at 11 months is about five grams per day.

    So that was for weight.  If we calculate a z-score--and we'll have--I'll put this up here because we're going to hear about z-scores at various points.  A z-score is where we take a particular measurement for a child and compare it to a reference median, whatever the reference is, and then divide it by a reference standard deviation.  So the graph I just showed you just showed the numerator there, but for a z-score we also divide by the standard deviation.  And the reason we do that is because then it's easy to imagine that the growth of a, quote, normal population would fall between about minus two and plus two z-scores.  About 95 percent of the distribution would fall there.

    So if we look at z-scores for this breast-fed set, then what we saw was that, regardless of the index that was used, whether it was length-for-age, which are the triangles, which is the curve near the bottom, whether it was weight-for-age, which is the circles, or weight-for-length, which is the squares, we saw a very similar pattern with this breast-fed set in comparison to what was then the U.S. and international reference.

    So this discrepancy, along with other information that was obtained during the review that the Infant Subcommittee made during the early 1990s leading up to the WHO publication in 1995 of the uses and interpretation of anthropometry, the recommendation was made that consideration should be given to making a new international growth reference.

    The justification for having an international reference is, first of all, that it allows cross-national comparisons to be made that otherwise couldn't be made; and since there's been an international reference since the late 1970s, it's allowed us to do some things in a comparative way globally that we weren't able to do before.  For example, this is a graph from a WHO publication that was in the bulletin of the WHO showing that the trends that have occurred from 1980 to about now so that we could actually look at the progress in Africa on the left, the very rapid progress that occurred in Asia.  This is in percentage--the percent of the population that's stunted--and the rapid progress that was made in Latin America and the Caribbean region.

    This kind of comparison has been made possible because there is a common reference being used throughout the world.

    In addition, we know that it's very expensive to make local references, and also that in developing countries where there's still a very strong cyclical trend in growth, if a local reference were made, it would have to be revised very quickly because of changes that are occurring.

    The justification for having an international reference goes back to work that was done in the early 1970s.  This is a well-known graph from a well-known paper, in '74 I think it was, showing that these curves right here were all curves of high SES children in well-off countries, whereas the ones that were down here were children in developing countries that were not so well off.  And so the fact that these were all so close together meant that growth roughly from one place to another where children are growing in conditions that are favorable to growth tends to be roughly about the same.

    Some work that the WHO has done with a cross-national data set collected by the Human Reproductive Program recently shows that--this is for girls--across a number of different countries, these are children who were reasonably well off SES, not necessarily the highest SES, showed basically that, with the exception of this lower curve, which is in China, that these other curves all pretty much are very close together.  Again, giving more recent evidence of the idea that it was reasonable to make an international reference where data from multiple countries could be combined.

    So let me tell you, then, a little about the effort that's underway in the Multi-center Growth Reference Study.

    First of all, I wanted to point out that a reference--the idea of a reference is that it's a tool for providing a common basis for the purposes of comparison.  So we're interested in references because it allows us to compare as opposed to a standard which then involves a judgment.  So here we're talking about making a reference, and during the early 1990s, both the U.S., in preparation for the revision of the U.S. reference, and also WHO examined the current reference which was being used in both the U.S. and internationally.  And the sample that had been used for the early infancy especially was from Fels, which was one particular place in Ohio.  The measurements were taken every three months, and in the very early period we might wish for more than that.

    There were very few infants that were breast-fed for an extended period of time, and at the time that this reference was made, there simply wasn't the technology to do curve-fitting that we now have.

    So those reasons, plus the main factor that the breast-fed infants seemed to grow differently than infants who were not necessarily breast-fed according to feeding recommendations, drove the decision to make a new reference.

    At the time the WHO feeding recommendation was that infants should be breast-fed exclusively from birth up to about four to six months, and then after that they should continue to be breast-fed for up to two years or beyond.  You may know that WHO recently--I guess about a year and a half ago--revised this to be from birth to about six months for exclusive breast feeding.

    So the objective of the Multi-center Growth Reference Study that WHO is doing is to build a set of growth curves for all children under age 5 years to be adopted as a new international reference for assessing the growth and nutritional status at both the population and individual level.

    When this effort was started, it was clear that there were a couple of conceptual issues that needed to be thought through.  One was that some references have been constructed, especially, for example, in the U.S. or in England, to take two examples, have been constructed to be descriptive references, meaning that they were intended to describe the growth of the population at a particular time.

    This is different than what's going on in the Growth Reference Study, the WHO Multi-center Growth Reference Study, which we can think of as perhaps a prescriptive reference, meaning that it's meant to be a reference that depicts the growth of infants who were fed according to current recommendations for how children should be cared for during infancy.

    The other issue had to do with maximal growth versus optimal growth.  In the past, we've had a tendency to think that maximal growth and optimal growth are the same thing.  The graphs I just showed you indicate that when infants are breast-fed, at least during the first year, and perhaps into part of the second year, they are not the maximum size they would be if they were breast-fed, but we think that because they're fed following feeding recommendations that that corresponds to optimal growth.

    So the design of the study involves multiple geographically diverse sites.  There's a longitudinal component which goes from zero to 24 months.  And each site was asked to recruit about 300 infants per site in the hopes that at least 70 would be available for inclusion in the final reference.  We've actually done better than that because the compliance with the feeding recommendations by the mothers and infants has been much higher than the 25 percent that we feared might be there.  So we've actually ended up with quite a bit more than 70 percent.  And then very frequent measurements, I'll show in a minute, and then there's a cross-sectional component which overlaps the longitudinal component.  It starts at 18 months, goes up to 71 months, past 5 years, to make sure we have enough data on the right-hand side to be able to characterize growth well up to at least 5 years.  And sites were asked to recruit about 1,400 per site, which, again, would give a minimum at each age of about 70.

    In the longitudinal component, which is a very demanding part of this study, measures of weight, length, and head circumference are collected frequently during the time.  At birth, there's one visit, of course, and then in months 1 to 2 they're biweekly.  So there's four visits there.  In months 3 to 12, measurements are monthly, so there's 10 visits for that.  And then in roughly the second year, they're bimonthly, which is six visits during that time.   And then arm circumference and skinfold measurements are also taken in the same schedule, starting at 3 months, as in the other measurements.

    Now, the way the study was constructed, there were a set of criteria at the population level and then a set of criteria at the individual level.  So at the population level, the idea was to find populations of infants who did not have socioeconomic constraints on growth, where mobility would be low so that they could be followed, where at least 20 percent were willing and able to follow the WHO feedings recommendations, with support, so there had to be existence of or at least the ability to build breast-feeding support systems; and then there had to be local presence of collaborative institutions who were capable of carrying out this kind of exacting work.

    Then at the individual level, individual criteria were set that there was an absence of health, environmental, economic constraints on growth; the mother was willing to follow the WHO feeding recommendations; that the mother was a non-smoker; that gestational age would be at term, which we defined to be 37 to 42 weeks; and that the infants wouldn't have any severe illnesses that would be expected to affect growth.

    The protocol for the study site selection then applied to subpopulations the fact that socioeconomic status did not constrain growth, it was low altitude, low mobility, the minimum of the 20 percent, existence of breast-feeding support systems, the local institutions.  We looked at the rate of hospital deliveries because we had to know that there were enough infants being produced quickly enough that they could be enrolled in the study so we could get the study done sometime in our lifetime; that there would be sufficient numbers of eligible birth; and that it was feasible within those locations.

    In some places, for example, in a really huge city of 10 million, it's just not feasible to do a study like this.  The logistics are too difficult.

    The Steering Committee also considered some other factors in its thinking about in looking at mean birth weights, maternal heights, complementary feeding practices, health-related behaviors, and the existence of environmental hazards.  The Steering Committee looked at geographic distribution.  It's a global reference, so WHO tried very hard to have geographic representation throughout the world, and funding issues, because it's expensive to do this kind of study and we had to think about where the funding would come from and how that could be arranged.

    The protocol was developed by this set of characters here.  The main reason I put it up here is to show you that it's a multidisciplinary set of people representing a diversity of backgrounds and institutional relationships.  So this group put together the protocol roughly in the '95, '96 time frame.  And then the study is being run now with an advisory group:  Cameron Chumlea, Tim Cole, myself; Ray Martorell is the Chair of this group; John Van den Broeck, who recently moved to South Africa; senior scientists representing CDC and UNICEF, previously was Roger Shrimpton in UNICEF; and then WHO, the day-to-day work gets done at the sites and at WHO Secretariat.  Mercedes de Onez coordinates all of this with her staff there.  A very dedicated group of people.

    The sites that have been selected are the following:  in Pelotas, Brazil, in the south part of Brazil near the coast; Victoria is the PI with Cora Post; in Oslo, Norway; in the U.S. at Davis; in Muscat, Oman; in Accra, Ghana, the capital of Ghana; and in New Delhi, India.  These are the six sites.  Each of these sites represents very differing and very large challenges to carrying out this study.  And it couldn't be done without the commitment of the teams there who are doing the work on the ground.

    The Steering Committee is chaired by Cutberto Garza representing UNU.  Data management is done by local data entry and checking at the local, each site, and then the data are shipped to the WHO Human Reproductive Program.  They have extensive experience in handling large, multi-country data sets, and they've done a fantastic job in coordinating all of this and in helping to ensure data quality.

    The decisions and information about the study, there was a working group on the growth reference protocol.  We have Steering Committee and Advisory Committee meetings periodically, other meetings.  Particular tasks are handled by other meetings.  We do a lot through electronic mail.  Various site visits are made, were made before the study started in preparatory work and are made throughout the study, and rapid surveys were done at the beginning to get information that was needed to actually do the planning for the data collection in the particular sites.

    To give you an idea of what's involved, I just made a list here of sort of the documentation that's been produced, which will give you a feel for what was necessary to carry this off at the level of scientific quality that was being strived for.  The protocol was developed, a measurement of standardization protocol was developed.  A manual of operations, a generic manual of operations was produced, and each site had to adapt that for its particular location.

    There was a protocol for the 12-month visit.  A special effort was made throughout this study on the epidemiological quality of the study to follow all infants.  Even if mothers wanted to drop out or weren't complying, an effort was made to follow them as much as possible so we could keep measurements for every infant even if they weren't exactly following the feeding recommendations.

    In particular, we had an effort made that at 12 months we could go out and get at least some measurements on all those who were not willing to continue.

    There were guidelines for complementary feeding.  A protocol was developed for assessing diet, for the cross-sectional study, for data management, and then questionnaires were produced for both the longitudinal and the cross-sectional study.  And if any of you have been involved in questionnaire production, you can imagine what those meetings were like as people argued about the exact wording of every question and every answer.

    So that's an overview of the study.  Basically where we are now is that the last site--the sites have been selected over time.  Some were able to start earlier than others.  Some had technical challenges that took longer than others.  So in the next few months we'll be finished with data collection.  There's a meeting coming up to look at and try to decide on the final methodology for analysis.  Some preliminary work is underway.  So this will be proceeding, and we're talking about having a reference be available in the 2005 year.

    There's a lot of work that needs to be done in order to prepare for not just producing the reference but preparing for how it will be depicted and how it will be used and testing that will have to be done with the reference under the auspices of WHO.

    DR. GARZA:  We have about five minutes left for questions.  We might be able to go over.

    Dr. Stallings?

    DR. STALLINGS:  Well, one, to compliment the group.  This is an extraordinary effort and an extraordinary study.  But to cut to the chase, do you think that we will see this used in the U.S. as the growth standard for infancy through 5 years or even infancy through 3 years with what we think of now as our traditional infant chart?  So I'd be interested in your opinion and certainly in relation to what we're here for, which is to start to understand the best comparison group for children in the U.S. who are taking infant formula.

    DR. FRONGILLO:  Okay.  Well, I think I'll probably not try to answer your question so directly.  Let me just say that I think the advantages of this reference will be that it's longitudinal, that the longitudinal data in the first couple of years will have the ability to look at--to have a reference, a velocity reference, and so a judgment will have to be made whether that's better than, for example, the Iowa/Fels data that are available.

    The second thing is that we know that infants who are fed following the breast-feeding recommendations will show a different pattern of growth than formula-fed infants.  So to the extent to which it's seen as desirable to have a reference which fits that growth pattern for infants who are being breast-fed, then I think that would certainly be the advantage of the new reference.

    DR. GARZA:  It may be useful to describe the references that are going to be available.  Is it just weight and length, or are there others?

    DR. FRONGILLO:  Well, those reference data will be available for all of the measures that I showed, so we will have data on weight and length, head circumference, arm circumference, and skinfolds.

    DR. GARZA:  Any other questions?

    DR. THUREEN:  Why did you choose to include skinfold measurements, and what kind of information did you hope to get from that?

    DR. FRONGILLO:  Well, I think that there was a debate about, you know, the importance of this and when it should be started, and I think the lack of reference data on skinfolds was very compelling and it was thought that, given the potential usefulness of that information in the future, that while a study of this effort was--while this large effort was being made, it would be important to have that kind of information available.

    DR. GARZA:  Dr. Baker?

    DR. BAKER:  I have a question about the prescriptive nature of this.  If you're going to do a study like this using a prescription, it assumes the prescription is right.  It also assumes that it also would change, presumably, if the prescription changed.

    Now, the WHO has changed it since this study was done.  Does that make a difference?

    DR. FRONGILLO:  Well, certainly we thought about this a lot during the time in which the study was being planned.  I don't think anyone imagined that the basic recommendation about breast feeding and complementary feeding is going to change appreciably, and at least not for quite some time.

    Now, if it does, one of the things we wanted to do and part of the reason for the intensive follow-up even for infants who didn't exactly follow the current feeding recommendation was that it allows us to have the information available so that if 15 years from now we have new knowledge and decide that some slight revision of the feeding recommendation is made, anybody who's ever thought about changing the feeding recommendation will run after that possibility.

    But, anyway, if anybody gets brave enough to try to do that, we will have the information available from the cohorts in the six sites so that one could conceivably reconstruct the reference to conform with that.

    DR. GARZA:  Yes?

    DR. DOWNER:  Have you decided exactly which tool you will be using to do the skinfold measures?  And my second question is:  Because what we consider SES for different world populations differ so widely, how are you going to decide on what to use in this study?

    DR. FRONGILLO:  Okay.  The skinfolds, which tool, do you mean which skinfold caliper?  Basically I think--and Cameron can comment on this because he's the expert here.  But my experience is that it's not the caliper that makes much--any difference at all in the measurement.  It's the quality of the enumerator and their training in using the instrument.  So that's not going to make any difference.

    DR. DOWNER:  What instrument have you planned on using?

    DR. FRONGILLO:  The instrument we are using is--what is the instrument we're using?  Holtain, right.  Okay.

    And the second question?  I'm sorry.

    DR. DOWNER:  The SES.

    DR. FRONGILLO:  Oh, the SES.  In each site, a survey was done before the study began, the main study began, to actually look at the relationship between socioeconomic status and growth so that we could develop in each site exactly what the criteria needed to be from a socioeconomic standpoint in order to ensure that the population of infants selected was at high enough SES to not constrain growth.  So that was done separately in each of the sites, and some of us traveled around to different sites to help them actually carry that out.  And you're right, in each site different criteria were needed because the conditions were different.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  All right.  Thank you very much.

    We'll move on then to Dr. Larry Grummer-Strawn, who is the branch chief of the Maternal and Child Nutrition Branch at CDC, and he will tell us about the NCHS/CDC's growth charts.

    DR. GRUMMER-STRAWN:  Good morning.  If we can figure out how to forward this?  Which one?  Just here?  Okay.  Thank you.

    This morning I want to give kind of an overview of the new growth charts and contrast them to the old NCHS growth charts.  I'm going to start off with kind of a historical perspective.  I'm sure many of you know the history but to kind of just set a context for all of us, do some comparisons of differences and similarities between the old and the new, and then go into some of the differences a little bit more explicitly, and then finally end with some analytic issues that the charts pose for us.

    The original NCHS charts were released in 1977.  Those charts were only percentile curves.  They were published by Hamill, et al., really became the standard of reference for all U.S. infants.

    Subsequent to that, there was a normalization of those curves at CDC.  Those were actually published in 1987, but were actually available for use long before that.  So people who were interested in normalized curves had access to them earlier.  And, finally, the WHO adopted those curves as being the international reference, really referring to the normalized curves.  The adoption by WHO actually came prior to the publication of the normalized curves.

    The reason that I point this out is that those curves never became one and the same.  The percentile curves never matched with the normalized curves, and so someone who was using clinical charts that actually saw the graphs in front of them was not necessarily using the same cut-off points as someone who was using computer software or might be analyzing data sets.  They were very similar to one another.  They were analyzed off of the same data, but were slightly different from each other.  And then, finally, in May of 2000, CDC released a revision to these charts.

    Now, at the time that the original charts were created, NCHS was a separate agency, and so they were referred to as the NCHS Growth Reference.  When people talked about the normalized curves and put them in an international context, they might have referred to the NCHS/CDC/WHO because of the separate role each of those agencies played.

    Subsequent to 1977, NCHS was actually incorporated into CDC, and so the new charts are referred to as CDC charts.  That does not mean that NCHS was not an active player.  They actually were the progenitors that moved the new charts forward.  But it was in a larger context of CDC, and other parts of CDC were also involved.

    So what are some of the similarities?  First of all, both sets of charts are looking at the same indicators.  We have weight-for-age, length-for-age, weight-for-length and head circumference-for age.  Both sets are sex-specific.  In neither case did we have any separation according to the parental anthropometry, race, ethnicity, infant feeding mode, different things that might impact on the growth of the infants.

    Ed just described for us kind of the difference between the idea of a reference and a standard or a descriptive reference and a prescriptive reference.  These clearly are references, not standards.  The only kind of movement toward a standard is that very low birth weight infants, that is, less than 1,500 grams, were not included in the new charts, the CDC 2000 charts.

    These charts reflect attained size, not incremental growth, and in both cases, we have accessibility of percentile scores and z-scores.  In the new charts, the z-scores are one and the same.

    The differences between the old and the new are, first of all, that in the 2000 charts, the data for infants are now nationally representative.  These represent a broader spectrum of race and ethnicity across the United States, a broader spectrum of socioeconomic status, and there's an increased representation of breast feeding in the charts.  That doesn't mean that they are primarily breast-fed children, but there's a mixture of formula-fed and breast-fed children.

    The 2000 charts are based on a pooling of several data sets coming together, whereas the 1977 NCHS charts were all based on the Fels data set.

    There were some minor changes to the smoothing techniques, which I will mention briefly.  As I said, the z-scores now are one and the same as the percentiles.  There's a one-to-one match on those, so it doesn't matter whether you use computer software or you're using printed charts.  You're going to be looking at the exact same cutoffs.

    Another minor difference is that the length now extends down to 45 centimeters rather than 49 centimeters when we're looking at weight-for-length, and when ow have on the clinical charts the accessibility of 3rd and 97th centiles.  Of course, based on normalized curves, you can get any kind of centile that you're interested in, but the difference is that the clinical charts that are produced actually do extend out to the 3rd and 97th percentiles, and the smoothing-out rhythm is intentionally extended out to those centiles to make sure that we're incorporating the original data out that far.  And, finally, as I mentioned before, very low birth weight infants are excluded.

    So what are the data sources?  Well, in 1977, I'm sure you're all familiar with the Fels Research Institute study.  It was done in Yellow Springs, Ohio, primarily represented Caucasian, middle-class families.  And while not exclusively formula-fed, this group is considered to be almost all formula-fed infants.

    There was a longitudinal follow-up study.  The children were followed from birth, 1 month, 3 months, 6 months, and at three-month intervals after that, but we're focusing on the first 6 months today.  The data were collected between 1929 and 1975, and there were a total of 867 infants.

    Now, the data for the CDC 2000 curves, as I said, represents a number of different data sources.  What I've put up here is a graphic showing you how at different ages, different data sets come in.  I'm going to talk about kind of these different data sets at different points.

    The primary data source is the NHANES III data here when we're talking about children birth to 6 months of age.

    Now, this is represented by the long line here in light blue.  The NHANES III was the only data set for which we had nationally representative data prior to 6 months of age.  Starting at 6 months, the NHANES II data were also available, and starting at 12 months the NHANES I data were also available.

    Now, the reason that these are important for us today is that because we're smoothing these curves across age, the influence of those older data sets does come in at 6 months of age and at 12 months of age.  The smoothing is across all ages, and so the curves below 6 months are also affected by those other data.

    However, the NHANES III data started at 2 months of age, and among 2-month-olds, it was a fairly small sample size.  In order to extend these curves down to birth, we had to look to other data sets.  And in each case, whether we're talking about head circumference-for-age, length-for-age, weight-for-age, weight-for-length, in each case we have to turn to different data sources.

    I'm going to start in the middle here to talk about in the weight-for-age.  In this case, it was fairly straightforward to use a birth point coming from the national birth certificates.  We had all of the birth certificates during the years that the children in these NHANES surveys had been born and had the birth weights available on all of those.  So it's a huge sample size, a very precise point that is truly not only nationally representative but a census of all births in the United States.  And so in creating the curves, we were able to connect that particular point, actually anchored the curves to that point, and then smoothed it with the data starting at 2 months of age from the NHANES.

    However, we don't have national data on any of the other indicators, head circumference or length.  With regard to length, we did find that there were two states that routinely collect length data at birth in a representative fashion.  We analyzed the data in those states for their birth weights against the national birth weight distribution and found that they were quite representative--those states were Missouri and Wisconsin--and felt that because the birth weight distribution matched the national distribution, we could expect that they should represent the national birth length distribution, even though we do not have data on the national birth length.

    So when we looked at the weight-for-length curves, those are based on connecting the dots between the Missouri and Wisconsin data on weight-for-length versus the NHANES III data weight-for-length, and those curves were connected together.

    Initially, we intended to do the exact same thing with length-for-age, use only the data from Missouri and Wisconsin to connect these curves across age and with the NHANES III data.  We did that in our first pass but analyzed--as we were evaluating the curves against alternative data sets, we found that we were comparing against the Chicago data set, the WHO pooled data set that Ed just described for us, as well as some of CDC's surveillance data, and found a common pattern in all three of those that the curvature between birth and 6 months did not match what we found in external data sets and felt that this was partly an artifact of the fact that we only had about 35 infants from the NHANES III data that clearly were not matching the normal pattern of growth.  And so the curves were being pulled in the direction of those NHANES III data from a very small sample size.

    So what we opted to do was to choose an additional data set to add in here between just beyond birth--these were not birth points, but at the first visit to a clinic--up through 5 months of age from the CDC's Pediatric Nutrition Surveillance System.

    These are data on low-income infants.  However, we didn't use a representative sample of low-income infants.  Instead, what we did is we chose clinics that matched the national distribution in terms of their mean, standard deviation, and skewness at each age from birth--from 3 months of age through 11 months of age compared to the NHANES III data.  So we were pulling out clinics that the children in that clinic happened to look exactly like the national distribution and chose those clinics and assumed that they would also look like the national distribution would have looked between birth and 3 months, and then added those data to the curves here and connected using the Missouri and Wisconsin data, the CDC nutrition surveillance data, as well as the NHANES III, and continued the curves using that.

    Finally, for head circumference, we had no national data, and we returned once again to the Fels Institute data for the head circumference at birth point and, again, connected that with the NHANES III data.

    So you see the picture here is one of bringing together multiple data sets.  We had a number of comparisons to try and make sure that these were valid comparisons to make, but it certainly leaves us with a difficulty using multiple data.

    So what are some of the other differences?  First, with regard to the smoothing, the old curves were smoothed with cubic splines, with knots at birth, 6 months, and 18 months, for those of you who work on these statistical arenas.  What that meant for us was that there were six independent parameters that characterized growth between birth and 3 years of age.

    In the CDC 2000, a completely different approach to smoothing was applied.  Here we used fractional polynomials that had been used previously in other growth studies, primarily used in the Fels data as well as some Canadian data.  And so they had kind of been proven methodologies for assessing growth during the first three years of life.  However, there were a smaller number of parameters that described growth, really just three independent parameters--roughly three, because there were some other things that were done to the curves to get them to fit.

    Finally, on weight-for-length, there were no set models, and so we used a 5th degree polynomial to maximize the flexibility of the curves there.

    I mentioned before that in 1977 the standard deviations were estimated as a separate path and created a separate set of curves.  In that case, we had two different standard deviations.  There was a standard deviation above the median and a standard deviation below the median.  And so if you think about kind of plotting the standard deviation as it goes across the curve, it was a low standard deviation that instantaneously at the median rose to a higher level.

    We calculated these scores in the normal way, taking the measure minus the median divided by the standard deviation.

    In the CDC 2000, this kind of immediate change in the standard deviation at the median was thought to not be desirable, and so instead we had a more continuous change in the standard deviation.  In this case, we transformed the data with a Box-Cox transformation, a power transformation.  People understand how you take the log transformation of a data set or a square root transformation.  The Box-Cox is a continuous set of transformations that you can then choose a parameter to say how much you want to transform that data to make it symmetrical.  And then once it is symmetrical, you can fit parameters to normalize that curve.

    Some other differences, clearly this group would be interested in the differences between the infant feeding in the groups.  As I mentioned before, the old charts were virtually all formula-fed infants.  The quality of the infant formula across that time, of course, has changed, so it is a mixture of a number of different kinds of feeding across those year '29 through '75.

    In the CDC 2000, we have a mixture of breast feeding and formula feeding, but it still is primarily a formula-fed group.  At 2 months of age, about half were formula-fed, half breast-fed, but by 6 months of age that was down to 28 percent currently being breast-fed.  And we're not talking about exclusive breast feeding.  As you can see, the exclusive breast-feeding rates are much lower than that.  Down to less than 10 percent by 6 months of age were exclusively breast-fed.

    If I can just take a couple of minutes to show you a comparison between the old and the new curves with regard to breast feeding, and we'll stop there, Ed showed a number of things as to how the old curves performed against WHO pooled breast-feeding data set.  We did some additional comparisons seeing whether the new curves have actually improved that situation.  So we've compared the WHO pooled data set that came together from six different studies of exclusively breast-fed children and pooled those data together.  And instead of plotting the means, which is what Ed was showing us, here I'm going to show the percent below the 10th percentile using the old curves versus the new curves.

    When we're looking at weight-for-age, you can see that there really has not been much of a change between the old curves and the new curves.  We have this same problem that as children get older, we're going to diagnose more of them as being underweight.  Just as Ed was showing us that the means go down, the percent that would be low is going to get higher.  And we still have that problem with the new curves, that it is considerably--we would have considerably more older children being considered underweight in this breast-fed data set compared to the younger children.

    However, with regard to height for age, we've improved the situation somewhat.  With the newer curves, there still is a tendency toward increasing the percentage that would be considered low as we get to older infants, but it's not as steep a trend as we had with the older curves.

    And, finally, with regard to weight-for-height, whereas the old curves showed that same pattern of worsening nutritional status over the first year of life, on weight-for-height the new curves have pretty much wiped out that problem.  We see a much flatter distribution across age.

    I'm going to skip over these because Ed is going to come back to many of these points in his description of the analytic issues.

    Conclusion:  The interpretation of the new charts is really not widely different from the old charts.  We're still using the same kind of way of thinking about growth and the way we analyze growth as very similar in the old curves to the new curves.

    There are a number of enhancements that argue for changing over to the new curves.  I'm not arguing that we haven't made enough of a difference to adopt these new curves.  However, I do think that the WHO reference that Ed has described would relate to a more substantive change in our interpretation of growth parameters, and we really need to give much more thought to different ways of thinking about growth than we have so far.

    Thank you.

    DR. GARZA:  Thank you very much, Larry.

    Any questions?  Dr. Stallings?

    DR. STALLINGS:  I actually have a series of questions, lots of things jotted down.  One, also, thank you and your team for doing this.  It was a monumental change to give us this from the clinical point of view.  It's wonderful to have the charts revised.

    In that last set of slides where you were showing the less than 10th percentile, just so I'm thinking about it correctly, the perfect outcome would be 10 percent would be less than the 10th percentile, because if we were looking at a population study, would you by definition expect 10 percent to be less than the 10th percentile?

    DR. GRUMMER-STRAWN:  Yes.  What I'm more concerned about is the pattern of growth there than the actual level.  We're comparing curves that are based on formula-fed infants mixed with some breast-fed infants against a group of breast-fed infants.  And so I wouldn't have been surprised if the level was somewhat different in a group of breast-fed children that might have less malnutrition than a general U.S. population.  But I would expect the pattern should be representative of the pattern of growth.

    DR. STALLINGS:  But, still, when I was looking at it, when there was almost zero less than the 10th percentile, I saw that as unusual as when we have 20 percent less than the 10th percentile, that that's sort of the concept.  If we were getting--whatever right is, but if we were getting it right, the population would go along that.

    While we have all the experts in the room, we've heard three different people say three different things about whether the Fels data included a lot of breast feeding or not.  And before you guys leave today or tomorrow, I'd like that to be readdressed.  You know, you were very helpful in showing in your slide the percentage of children who were breast-fed and then exclusive, and so obviously it's not one number.  Where you are at 2 weeks of age is different from 6 months.  But I think it would be helpful for the committee and for the FDA to have an understanding of what we believe the Fels data represented as exposure to breast feeding and what you believe your 2000 charts represent.  This is--

    DR. GRUMMER-STRAWN:  I'd like to have that, too, because I certainly--

    DR. STALLINGS:  Okay.  Well, then, you guys aren't leaving until we get this right.

    [Laughter.]

    DR. STALLINGS:  The other part of that is I think getting a handle on that will be even more important as the WHO new charts come out and those data are there.

    My last question, which really is a question, is:  If I understand, then breast-fed babies grow faster during the first 4 months of life compared to our usual reference data and more slowly between 4 and 6 months, if we were to look at the zero to 6-month period that we're really supposed to be focusing on.

    DR. GRUMMER-STRAWN:  I think it's more like around 3.  Is that about the peak?  Three months of age.  So faster in the first 3 months of life and slower after.

    DR. STALLINGS:  So that's part of the pattern that we're trying to capture that has made individuals difficult to look at.

    Thank you.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  I have two.  In thinking through the reasons why the WHO took more frequent weight measurements, one of the principal things that drove that was that very often the pattern of growth was used to assess the nutritional management of infants, the first 3 to 6 months.

    To what degree can the present reference between used with that amount of detail, the first 3 to 6 months?  Were the smoothing techniques in your judgment sufficient to capture the differences in growth patterns that Ginanne just described?  Or did the smoothing eliminate much of that?

    DR. GRUMMER-STRAWN:  First of all, you have the whole difficulty of dealing with the cross-sectional data as opposed to longitudinal data.

    DR. GARZA:  I know.  That was Part B to 1.

    [Laughter.]

    DR. GRUMMER-STRAWN:  So all of those issues come in there.  You have not a very large sample size in that age range, and so there is a fair amount of noise.

    In addition to that, you're fitting basically a three-parameter model to the first three years of life.  That doesn't give you a whole lot of degrees of freedom to really let that first few months take on a particular shape.

    That said, we did examine what the curves were doing in that age range, and they looked like they fit fairly well.  But there's a large amount of noise in the cross-sectional data that bounces from month to month, and you look at the curves and say, you know, I think we did the best we can given the data that we have.

    So I don't want to blame the three-parameter model, but I would say that if one had better data and wanted to capture really what is the pattern of growth in the first year of life, I probably would not do the smoothing in the way that it was done.

    DR. GARZA:  And the second, since you answered Part B of 1, we've been asked to look at various control groups, either historical or using specific references as controls in clinical studies.  You also make the distinction between standards and reference.  To what degree can, in fact, one use the present reference in making judgments, value judgments for clinical studies and making comparisons between feeding groups that might be placed on new formulas and the current CDC reference?  Is it sufficiently robust to be used as a standard in making that clinical judgment that control groups normally play in clinical studies?

    DR. GRUMMER-STRAWN:  I think the question is one of what do you want to compare to.  What is right?  In comparing to these curves, you are implicitly saying I am comparing this child's growth or this group of formula-fed infants' growth to the way children have grown in the United States essentially over the last 10 years, maybe 15 years.  Are you comfortable with that kind of a comparison?

    You would say based on formula--in assessing a formula, you are saying this particular formula that we are evaluating generates a pattern of growth similar to the way children grow in the United States, whether they are formula-fed or breast-fed, fed on all kinds of different formulas, mixtures of solid feedings coming in at various ages.  If you are comfortable with that kind of a comparison, this formula produces a pattern of growth like children in the United States, like a cross-section of all the children in the United States, then I say yes, this is a perfectly fine comparison to make.

    If you want more of a prescriptive statement that this formula produces a pattern of growth that is the most healthy, I don't think that you can say that.  I don't think that we can infer that a cross-section of infants from the United States with all of the variety of primarily infant feeding patterns--there are also varieties in terms of socioeconomic status, the kinds of conditions children are running up against.  I think that in terms of the impact of those on these curves, those are not as great.  But the variation in feeding patterns, I don't think that we can say that we have the best pattern of growth here.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  Thank you again.

    We'll move on to the next paper.  I don't think Dr. Fomon needs an introduction.  I can't think of a more senior person in pediatric nutrition than Dr. Fomon, and I'm very pleased that he was able to join us today and was willing to leave lovely Texas for Washington for this purpose, from one native Texan at least.  And he will be addressing the Iowa data and the Iowa/Fels growth data.

    Thank you very much for joining us, Dr. Fomon.

    DR. FOMON:  As the most senior member of the presenters, I'd like to make a statement for the presenters that we were asked to prepare a 20-minutes presentation, and if we run over and get the zero sign at 15 minutes, we're not very apologetic.

    [Laughter.]

    DR. FOMON:  First I want to get out of the way what are the Iowa data and what are the Fels data, and then we can talk about more interesting things.  The Iowa and the Iowa Fels data, Iowa Fels data is published by Guo, et al.  Term infants measurements all made with highly quality controlled efforts.   Caucasian, we were able--and I will show you that--no, but it's in my paper--that the length and weight of the Fels and the Iowa series were very similar at three months, which gave us some encouragement in combining them.

    Iowa data.  There are other Iowa data, but I wanted to first speak about 8 to 112 days.  These were all formula fed infants, 380 males, 340 females, and mostly infants of upper socio status, but not necessarily economic, because they were mostly UI personnel, University of Iowa faculty and students.

    They were measured within 4 days of 8, 14, 28, 42, 56, 84 and 112 days.  No exceptions.  And the age of measurement by interpolation or extrapolation was made to the target age.  So we adjusted if a baby was measured at 16 days.  We used the 28, 16 and 8-day measurements to compute a 14-day measurement.  It was fully longitudinal.  There is no one of those 380 males or 340 females who was not measured at every time.  This is published, so you can read about it in books.

    Then we did also measure a number of infants from 112 to 196 days of age, and they were 165 males, 188 females, and there was a subsample of these that were also in the other group that I talked about.  There were 63 males and 74 females who went from 8 to 196 days.  We have many more now, but it's too expensive to do the analysis.

    The Fels data, there are a lot of Fels data.  The Fels data that I'm talking about are the Fels data that are included in the Guo, et al. paper, and that included 240 males and 236 females, wide range of socioeconomic status, few measurements during the first 3 months.  The target ages were 1, 3, 6, 9, 12, 18 and 24 months.  Most of the children were measured within 3 weeks of the target ages, and there were some missed points, and they arrived at these ages by mathematic curve fitting.  And there are other people who know a lot more about that here than I know.

    So the Iowa Fels data, we ignored the Fels data during the first 3 months, so it doesn't matter whether they were breast fed or formula fed.  There were, for the first 3 months, just Iowa data, 580 males and 562 females.  The reason that's bigger than the numbers I showed you before was what I showed you before was formula fed.  We included breast fed in this, and I'll tell you why.

    Then we used both Iowa and Fels data, having established that the Iowa Fels size at 3 months was very similar.  We used both for 3 to 6 months of age, and then we used only Fels data for 6 to 24 months of age.

    So those are the Iowa data and the Iowa Fels data.

    The Iowa data and the Iowa Fels data and most other referenced data, except those for international comparisons, have been developed to detect abnormalities of infants, of individual infants, and to detect abnormalities of individual infants you have greatest interest in the outlying centiles.  The more individuals you have, the more confidence you have in those outlying centiles. What you want is early detection of growth abnormalities.  Growth will not tell you whether a child is normal or not, but it gives the most important single clue to telling you that that baby is one that needs closer attention than the general garden variety baby.

    And for that reason, weight gain is more important than length gain, and that's because it gives you this clue much earlier than change in length.  I think I have that on the next slide.  And you need data for at least the first 2 years.  Iowa data are only good, up to at best, 196 days.

    I have to go back, but I don't know how to do that, so I'll tell you that--can you go back?  Just go on back.

    The weight gain is more important than length gain because as I said, weight gain gives you the clue earlier.  Length gain is very difficult to measure accurately enough to be useful for determining changes in length the way it is done in hospitals, clinics and doctors' offices.  So length is not really a very feasible way, and moreover, there are very few instances in which length gain will be abnormal and weight gain normal.  So weight is the most important thing.

    Now, when you evaluate an infant formula--we didn't think about this until much after we published most of the Iowa growth data--you have different criteria for what you need as reference data, and the characteristics are it should be longitudinal.  It's difficult for me to agree that you can use cross-sectional data as a sensitive way of analyzing longitudinal data.  If you're doing an infant formula study, you're examining how the infants grow over the period of study.  For that you need longitudinal reference data.  It should be gender specific.  I haven't heard any argument about that.

    The study integral must include at least part of the neonatal growth spurt, should include all of it or most of it.  Neonatal growth spurt is from 8 to 42, maybe 8 to 112 days of age.  After 112 days of age growth rates are substantially less, and we'll come back to that.  And you need length data as well as weight data because it is possible that you would find--and I'll give you an example of this--a situation in which babies would grow normally in weight or maybe super normally in weight, and the weight to length would be outside of what we see with usual infant formula, suggesting that maybe this formula is not fully adequate.

    The reference population should be similar to the study core, and that's always going to be a problem.  It will never be exactly like the study core, but the question is, how close can you get and how close do you need to get?

    I missed the last one.  If I knew how to use this, probably I'd know how to use my camera too.

    [Laughter.]

    DR. FOMON:  The duration of study should be at least 84 days.  That's a new minimum length.  I made up the old 3 months minimum length that's in the AAP report, and I didn't have any good basis for that, but I think that maybe you can agree to 84 days as well as you can agree to some slightly longer figure, and we'll come back to it.

    I said this.  The most sensitive evaluation of the longitudinal growth study of a cohort in the longitudinal growth study requires longitudinal reference data.  I don't say that this is gospel.  It's just what I believe. Gains in weight and length are more rapid in infant males than in infant females.  The formula may be adequate for females but not for males.  Nobody's arguing about the gender anyway.

    The study integral must include at least part of the neonatal growth spurt.  A formula may be adequate for older infants but not for younger infants.  The reason is that during the period of most rapid growth the ratio of specific nutrient  to energy is highest, and if you get beyond that period, the ratio of protein or calcium or whatever to energy may be down at a lower level.  So if you start a study at 4 months and you get 4 or 5 months of additional data, the formula may be fine, but it doesn't tell you that it will be fine starting at 8 days or at birth.

    Here is an example. These are males, and this is weight gain from 8 to 56 days, and we did a study of a relatively low protein diet based on isolated soy protein with or without a methionine supplement.  We were interested in getting a fix on the requirements for sulphur containing amino acids.  And with the methionine supplement the gain was 42.3 grams per day, and with no methionine supplement it was 38.8 grams per day.  Compared to the reference data, 8 to 56 days, this was not significant and this was significant.  From 56 to 112 days there was no difference.

    I wish I had 3 or 4 more studies to demonstrate this, but this suggests at least that you need to have an early portion where you have the maximum postnatal growth included in your evaluation of a formula.  You need data on length as well as weight, and this was the best example.  I think if I spent more time I could find more examples and then it would be more convincing, but I couldn't do that, because I had to work in my yard.

    [Laughter.]

    DR. FOMON:  These are males and this is the BMI, and this is a low-protein formula and this is the reference.  And at 8 days, when we enrolled them, the low-protein, the cohort receiving the low-protein formula had a BMI of 13.  That was significantly less than the reference data.  At 112 days the low-protein cohort had a BMI of 18.6 which was significantly greater than the reference data.  And all our data, including--if you pardon my expression--skinfolds, indicated that these babies were fat.  And what we speculate is that they were--and they took more volume.  We, in all our studies, record how much the babies eat.  We weigh the bottles in and the bottles out, and they took more energy in and they gained more weight, and we speculated that they ate more because of the low-protein content.  They were eating--I don't know how they knew how to do it--but they were eating more to make up for the low-protein concentration in the diet.  They got enough protein.  They grew normally in length.

    The study cohort must be similar to the reference cohort.  They should be healthy, that is they shouldn't include babies with illness.  They should be term if what you want to know is, is your formula going to be adequate for term infants.  You don't want to increase the noise in your experiment by adding preterm infants.  And then the question of ethnicity, terribly important in international studies, may not be so important in the United States where you're comparing what seems to be similar groups, but that's a question that needs to be carefully examined for each study.

    So the duration of study should be at least 84 days.  I think that 8 to 112 days, which is why our data are mostly 8 to 112 days, or 14 to 112 days, almost as good I think.  At 8 days, there are great advantages of 8 days.  Most formula-fed babies have regained their birth weight.  You can get a really good measurement of length at 8 days.  You're not there to get it when the baby is born, and even if you are there, the hardest time to get an accurate measurement on a baby is at birth.  Howard Meredith, many years ago, showed me some publications on how very hard it was to get an adequate length at birth.

    Now, I think that in the current U.S. climate, where you can hardly recruit any formula-fed babies before 42 days of age, which is still fairly easy, but you may be able to recruit them at 28 days.  Later than 28 days you miss too much of the postnatal growth spurt.  Earlier it's too hard to recruit, so maybe 28 days is the most feasible, and 28 to 112 is 84 days, and that's how I came to 84 days.

    Now, if I give you my recommendations to the FDA, straight out, no hedging, I would say that size data are not relevant.  I mean I know half the people in the audience hate me.  Size data are not relevant.  Data over 6 months of age are not relevant.  I'm not even sure that data over 4 months of age are relevant.  Breast-fed babies are not relevant, but that's not so serious because during the period we talk about they gain about the same.  I just don't like to muddy up a study of formula-fed infants with a mixed group that I can control.  And unless you have--you must match the cohort with the reference group.  If your reference group consists of term infants, then you can't muddy it up by including infants, preterm infants in any number that they might be present.

    So those are my messages and I'm willing to take the flak because I've done it before.

    DR. GARZA:  Thank you very much, Dr. Fomon.  I regret the miscommunication between FDA and the speakers.  We'll try to get that resolved for you in terms of the 20 versus 15 minutes.

    Are there any questions?  Dr. Anderson?

    DR. ANDERSON:  Anderson.  I understand that the Iowa Fels data are longitudinal data.

    DR. FOMON:  Absolutely.

    DR. ANDERSON:  And that the recently published CDC standard is largely from cross-sectional data.

    DR. FOMON:  Absolutely.

    DR. ANDERSON:  To what extent do the percentiles generated from the two sources differ in substantive ways?

    DR. FOMON:  I don't know that offhand.  I think from the point of view of infant formula evaluation, that it's not relevant because you can't get good analysis of longitudinal data from a cohort under study by comparing with size data.  In 1976, just to prove my seniority, I had a dialogue with Peter Hamill [ph] over about 9 months, trying to convince him to call the NCHS charts size charts so that people wouldn't be confused by thinking that they're growth charts, but I lost that argument.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  Dr. Fomon, I'm interested in your opinion of a couple things.  One, the need for a control group, a contemporary control group when you're actually doing a study, you know, particularly for the kind of things we would be looking at, a change in formulation.

    And secondly, when you said that you thought the breast-fed baby didn't have any role in thinking about evaluation of formula-fed babies, I think rather than is breast feeding--the growth pattern of breast-fed babies a pattern that should be strived for with formula-fed babies?

    DR. FOMON:  Well, let's see, question number 1.  Tell me question number 1 again.

    DR. STALLINGS:  About a contemporary control group.

    DR. FOMON:  I think there are circumstances under which a concurrent control is essential, and one was the study that I mentioned to you, where we had a formula, a low-protein formula fortified with methionine or not fortified with methionine.  In that case we had to demonstrate that with methionine it maps the reference group.  But I think in general it's probably not really essential if you have good reference data for comparison, don't think it's really essential to have a concurrent control.  It greatly increase the number.  The number that you need to compare one cohort with the reference data is considerably less than the number that you need if you're going to compare it with a concurrent control.

    So just from the practicality of making it possible at all to study new formulas, I think it's a reasonable compromise not to require a concurrent control unless something about the ingredient change or whatever suggests that a concurrent control would be valuable.

    And on the other question, should a cohort fed a new infant formula be compared to a breast fed control reference group, I think that's more a philosophic matter than a scientific matter, and my own conclusion is that if you want to study a new formula, you should study it in comparison with old formulas, and not with some group that we think might represent ideal growth.  That's just what I think.  Other people think other ways.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  Thank you very much.

    It is 5 minutes to 12:00.  We're going to try to make up the 20 minutes from this afternoon's schedule because we've been running a bit over time in the presentations, so that rather than coming back at 1:35 as the schedule suggests, I'm going to ask people to come back at about 1:10.  So that we can make sure we can started by that time, having everybody here at 1:00 o'clock would be ideal.  So we will ask the Committee to reconvene at 1:00 and we'll get started after that as possible.

    Lunch for the Committee and the speakers is in the room where you had coffee, and the cafeteria is available to everyone else.  1:00 o'clock.

    [Whereupon, at 11:57 a.m., there was a luncheon recess.]


A F T E R N O O N   S E S S I O N

(1:07 p.m.)

    DR. GARZA:  We can get started.  Our next speaker--can I have all our guests please take their seats?  I think the Committee is at the table.

    Our next speaker is Dr. Jon Tyson, who is a professor of pediatrics, obstetrics, internal medicine and epidemiology at the University of Texas Medical School in Houston and the School of Public Health.  Dr. Tyson will not be able to stay with us tomorrow, so I want to make sure that the Committee members ask all questions or clarify any outstanding issues that they might have related to the topic of growth data for preterm infants, because in fact Jon will be leaving soon after his presentation.  So it's important that you try to get your questions to him before he returns to Houston.

    Jon, thank you very much for coming.

    DR. TYSON:  Thank you very much.

    Well, as a neonatologist and epidemiologist, I'm going to try to make the case today that if what you mean by normal growth is desirable or healthy growth, that the evaluation of early growth in preterm infants will necessarily involve evaluation of health and development.  I'm going to try to go through this in a orderly fashion to promote a rational and evidence-based decision making.

    The first question is:  can the growth rate sustained by a new formula be adequately assessed using published growth norms?

    This is a growth curve from the Neonatal Research Network, observed rates of growth, are weight--sizes, Dr. Fomon would say--in babies according to their birth weight.  Throughout their hospital stay there is serial information assessed.  Also in length and head circumference and mid-arm circumference for these babies.  There's no data for babies greater than 1,500 grams and no data beyond discharge.  I would add that there in my little handout, there's a website that you can go to where you can, for an individual baby, enter the measurements at birth, and print out a growth curve for that particular baby.

    For preterm babies beyond discharge, this shows what I think is the best available data from the Infant Health and Development Program which had 985 preterm babies, and it provides data for 3 groups:  2,000 to 2,500 grams, 1,250 to 2,000 grams, and less than 1,250, plotted according to post-conceptual age--I think that's really post-menstrual age--up to 3 years of age.

    Now, do these both norms describe normal growth?  And I think we have to be really careful what we mean when we say normal values, because I think that's a term that often causes confusion.  I actually try to avoid that word.  I see it used sometimes as referring to values that are expected or typical, typical values, values that are not associated with adverse outcomes or low-risk values, values that do not cause adverse outcomes, healthy or optimal values, and values for which intervention has not been demonstrated to be beneficial.  I think it's really important that we keep clear what we're talking about.  At best these growth grids describe what are typical values.

    How should they be used?  I think that they're a useful clinical tool to assess the growth pattern of individual babies.  Whether they are appropriate regulatory standard to evaluating the formula, I'm sure the answer to that is no.  They provide neither the optimal values nor an adequate basis to compare the growth with new formula to conventional formulas.

    As I've spelled out in the handout, if you compare the growth for the new formula to one of these growth grids, what you describe as an effect on growth might be due to any of a large number of factors, including intervening changes in care and outcome since the growth grid was developed, the fact that in most studies you're using selected patients, whereas the growth grids are based on all patients; a myriad of differences between centers, and the opportunity for bias in patient care, selection, care and assessment in evaluating new formulas.  And I think it's particularly important when the sponsor or the investigator has a financial or even a professional interest in the outcome of the studies to attempt to avoid bias.

    I think we also have to ask ourselves whether the statistical tests that are commonly done in evaluating growth studies are misused and then misinterpreted in assessing interventions using historical controls.  The babies in feeding studies and the babies in these norms that we see for preterm babies and for term babies, are clearly not a random or even a representative population of the sample of the same population.  Moreover, a p-value of less than .05 is often taken to mean that the difference is due to the intervention, when it may be due to any of a number of factors a difference in population, differences in the way the populations are assessed, et cetera.

    So I think we should be asking the question, why use historical controls at all to evaluate new formulas for preterm infants or for term infants as well.  For every other intervention that we talk of in medical care, the randomized trial is the gold standard.  The concurrent cohort, carefully done, carefully studies, is a silver standard.  Historical control is a bronze standard.  Why do we want to use the bronze standard?  And these are also issues in concurrent controls as well.

    Should carefully designed randomized trials be required?  I think the answer to that is clearly yes with a number of other features designed to minimize random error or systematic error and increase the signal-to-noise ratio, which would include mass caregivers and evaluators, well-standardized evaluations shown to be reliable by the people who do the assessments in the study, effective procedures to avoid attrition, and intention to treat analysis predefine stopping rules in an adequate sample size.  And I would add to that I think a commitment to publish the data at least on the website, no matter what the data show.

    Now, there has been opposition to the use of clinical trials by formula companies that would be charged with this responsibility, and I think part of that opposition is the expense and the feasibility of such trials, and I think with the progress in organization of research effort, that this is more feasible now with lots of neonatal research networks out there, some that already include follow up evaluations in at least some if not all the centers, and by the recognition that you don't--you need only do simple management trials.

    Most people, when they think about randomized trials are very expensive, are thinking about the usual traditional kind of explanatory trial, which are designed to determine whether therapies work in ideal or restricted circumstances, or that are designed to define the mechanisms of action.  Management trials, on the other hand, or so-called effectiveness trials, are designed to determine whether therapies work under routine clinical circumstances, so all the effort that goes into trying to control all the co-interventions in explanatory trials is inappropriate in a management trial.

    Who should be enrolled?  Who should be excluded?  I think you want to enroll representative sample of the babies for whom the formula is intended or at least the highest risk group.  So you want to include any babies who make up an important part of that population, small for gestational age babies, very sick babies, twins, et cetera.  You would want to exclude relatively few infants, say the babies who have major congenital anomalies or overt nonbacterial infections.

    Should infants fed their mothers' milk be included?  I don't think it's absolutely necessary, but I think it's highly desirable to increase the generalizability of the results of the trial because a large proportion of all preterm babies are fed at least some of their mother's milk, and also to help identify limitations of the formula, and areas for potential advances based on the benefits of mother's milk over formula after adjusting as best feasible for other factors.

    What assessments should be performed?  I think, obviously, body composition or biochemical, physiologic or functional variables need be considered, but most of the time those would have been studies in prior explanatory trials.  There might be some need to get some of that in some of the patients though.

    What about health outcomes?  Well, I think one variable that has to be considered is the percent of infants with necrotizing enterocolitis.  This is a serious disorder with a mortality exceeding 50 percent in surgically treated extremely low-birth weight babies, and it may well be related to feeding.  Death is a competing variable for necrotizing enterocolitis.  You have to live long enough to get NEC, so you would need also to look at the composite outcome of death or necrotizing enterocolitis.  We also have to be worried when we feed babies with chronic lung disease that rapid growth may not be attainable or even desirable.  If you have marginal pulmonary sufficiency, how fast do you want that baby to grow?  So you would like to know about the combination of death or prolonged mechanical ventilation.

    Neuro developmental outcomes I think are at least highly desirable if not mandatory, because first growth and development may be differentially affected.  You need to exclude adverse outcomes on development, even in the presence of good growth rates.  I think also we need this information to better define the optimal growth rate and the appropriate goals for growth rates sustained by formulas for infants with or without serious illness.

    While growth assessments, weight, length, head circumference and weight-length ratio I think are essential, there may be others as well.  What minimum period of assessment is needed, I think we should remember that a reliable identification of major neural developmental impairments is probably not possible any earlier than 18 months adjusted age, that is, post term, and this would allow evaluation of potential late effects beneficial or hazardous on time-limited interventions given in the NICU or later.

    What standard should be used in judging the growth of preterm infants fed new formula?  The American Academy of Pediatrics has said that the goal should be to achieve rates in an extrauterine environment like those that would have been achieve in utero had the baby not been delivered early.  But we have to wonder if this is really the right goal.  We have some uncertainty about what this rate is currently and we can discuss why.  If we can only measure gestational age well, we could do this better.  Currently we think that it's about 15 to 17 grams per kilo per day weight gain, about 1.1 centimeters per week in length, and about .7 centimeters increase in head circumference.  We again need to ask is this an appropriate goal for infants with severe lung disease, and we also had this observation of persistent growth deficits after reaching full feedings.  So even though we can get babies to grow rapidly, once they get to full feedings, there's this long period of time when they're growing poorly as they recover from illness after birth, and this shows you data for the neonatal research network for babies of different gestational ages, 24 to 25 weeks, 26 to 27 or 28 to 29 in relationship to a so-called growth grid that Alexander published.  And you can see that the babies don't do that bad once they start growing and taking a full intake, but they end up with most of them smaller than the 10th percentile for babies developing in utero with the same gestational age.

    Whatever goal we try to take we have to think of it as provisional, but we could ask, should the current standard for judging preterm formulas be the formula that sustains the best catch-up growth, and that could be that the weight, length and head circumference and the body proportions would be most like that of term infants of the same adjusted age, providing there were no adverse effects on the health or development through 18 months as identified in a well-designed trial.

    How many infants would you need to study to assess a new formula?  This is a really complex and important generic issue in assessing intervention, any intervention where there may be an uncommon but serious potential hazard like necrotizing enterocolitis, and I'm going to spend some time on this even though this may seem to you like a statistical issue only, I think it's an important practical issue, because the kind of things I've said would make feeding studies so large that formula companies or indeed the NIH may be unwilling to fund these.  So I want to try and see if we can find some way to address this.

    So somebody may say, well, wait a minute.  The old formulas have not actually been tested that well, and I've got a new formula here that has strong a priori evidence and rationale for using it.  Say it has a component that's provided before birth across the placenta and in human milk after birth, not given in prior formulas.  It's not well synthesized from precursors in preterm babies.  And we think it's important for healthy development.

    I think even in that circumstance you still need to rule out the possibility that there are important unrecognized hazards of this formula and I'm going to try to list what I think those are or the most important.

    The first would be an absolute increase of at least 3 to 7 percent or more in major adverse neonatal outcomes, particularly necrotizing enterocolitis.  A 3 percent absolute increase corresponds to what's called the number needed to treat of 33.  That is, for every 33 babies that you this formula to, you would cause one baby to have necrotizing enterocolitis.  I think that would be unacceptable even if all the other babies benefited in growth or perhaps even in development.

    A second would be a reduction in developmental quotient at 18 months of a quarter of the standard deviation or more.  That's the mean developmental quotient of 18 months and reduction of a quarter of a standard deviation or more.  If you observe that, that would substantially increase the number of preterm infants with a deficient or marginal IQ that would be eligible later for educational intervention programs.  In the neonatal network this would correspond to a reduction of almost 5 points on either of the Bayley subscores.

    Third would be a reduction of a quarter of a standard deviation in length or head circumference at 18 months.  And this of course is arbitrary, but at least after recovery from serious illness, there's no apparent benefit of slow growth so that I would think a modest decrease in length or head circumference, if not weight, could be seen as presumptive evidence of harm.  And this would correspond in the neonatal network in 18 months to about 250 grams in weight, 1-1/4 centimeters in length and a half a centimeter in head circumferences.

    Now, you may think that this is too small to look at, but I would reassure you that the sample size needed to assess necrotizing enterocolitis, if you use a sample size that's large enough for that, you can evaluate very small effects on growth.

    Now, if you take a conventional approach to sample size, you would need 315 per group to have 80 percent power to identify a quarter of a standard deviation difference in either development or growth or size at 18 months, and an alpha error of .05, assuming you lose fewer than 20 percent of kids to follow up.  The power to identify an increase in necrotizing enterocolitis would be 78 percent, for a large increase, 7 percent, that would be a doubling of the right of necrotizing enterocolitis in the neonatal research network.  It would fall to only 22 percent for a 3 percent increase, so a really small power to look at a clinically important increase.  If you said you'd like 90 percent power to identify a quarter of a standard deviation difference of 18 months, you'd need 421 per group.  Your power to identify an increase in NEC would still be only 30 percent for a 3 percent increase in NEC.

    What can you do about this?  One potential way to address this is a non-inferiority trial.  For the sake of time I'm not going to talk about that.

    Another, and I think this is, practically speaking, the most attractive option, is to increase the p-value considered statistically significant in evaluating a serious hazard.  As you know, the same p-value, usually p less than .05 is used for benefits and hazards in studies, and this is an arbitrary and not well justified practice.  For a serious hazard like necrotizing enterocolitis, a higher p-value might be justified on multiple grounds.  First is, we know in clinical studies the direction of bias is toward finding benefits rather than looking for harms.  There's a lot more effort put into it in general, and the studies are powered to evaluate benefit rather than harm usually.  But the hazard may be much more important than the benefit.  And as pragmatic evidence, we know that data safety monitoring committees that review the accruing evidence in clinical trials will stop a clinical trial at a much higher value of p for hazard than for benefit.

    The appropriate p value shouldn't depend in part on the cost of drawing the wrong conclusion.  For a serious hazard like NEC, I would contend that we might select a p less than .30.  That would still result in a 70 percent chance or higher that a difference of that magnitude would not occur by chance under the null hypothesis.  If you did this, what you're doing is you're increasing the risk of a false positive conclusion, that is, that you would conclude that the formula causes NEC when in fact it doesn't.  In order to reduce the risk of a false negative conclusion, that is, a conclusion that the formula doesn't cause NEC when in fact it does.

    So if we go back to the numbers we calculated before for benefit, at 315 infants per group, again, that was for 80 percent to look at a .25 SD difference, the power to identify an increase in NEC would be 96 percent very high power for a large increase, 58 percent for a 3 percent increase.  So you're slightly more than 50 percent likely to identify it.  If you use 421 per group, you then get down to a power of about 2 in 3 to identify a 3 percent increase.

    If you found hazards at a p of .30 and benefits at a p less than .05, what would you do?   Well, I think you wouldn't recommend the formula, you'd recommend further study, and that would be a departure from what has been done in usual practice.

    I'm going to skip that one, and just conclude by saying that I hope I've convinced you that the growth of preterm infants should not be assessed in isolation from effects on health and development, that a large trial evaluating growth health and development to 18 months or more is needed to assure that the benefits of any new formula outweigh any hazards in preterm babies and to better define the effects of different growth rates, and the growth rate that we should be looking for in deciding how to design preterm formulas.

    Thank you.

    DR. GARZA:  Thank you very much.  Questions or comments?  Dr. Stallings?

    DR. STALLINGS:  It sounds like you've silenced us pretty well.  You know, we at the beginning, talked about dividing up preterm from term infants completely, and it sounds like, from your presentation, that you really, in the area of growth, that there really isn't anything you learned from term studies that would influence you on preterm.  Would that be a fair--

    DR. TYSON:  Well, I wouldn't say wouldn't influence you, but I don't think you can determine whether a new formula is appropriate for a preterm infant based on observations in term infants.

    DR. STALLINGS:  The other thing I'd like for you to elaborate on a little bit is I think it's often that--you were telling us a bit about who the sample should be, and in essence, the inclusion/exclusion criteria.  Which infants, if you elaborate, which infants should not be in a growth study?  Which preterm infants should not be in a growth study of preterm?

    DR. TYSON:  Well, I think it would be babies with the kind of problem that's very unusual, and that's going to have an overwhelming effect on growth like trisomy 13, growth and mortality, nonbacterial infection.  Other than those things, I think you're talking about 3 percent of babies or something.  The rest of them I would vote to include.

    DR. THUREEN:  Thureen.  Dr. Tyson, I know that in your paper you said you would include growth restricted infants as part of this because they're such a large portion of the population, but that you would substratify those infants for further analysis.

    DR. TYSON:  Yes, right.  You can of course include explanatory evaluations within a management trial, so it might be that that formula has a different effect on those babies.

    DR. THUREEN:  In terms of neuro developmental outcome, do you think it would be fair to exclude infants who had had very high risk factors for significant neuro developmental outcomes, such as intracranial hemorrhage, prolonged asphyxia, evidence of white-matter disease, before the trial even started?

    DR. TYSON:  If the formula is going to be fed to babies with severe asphyxia, then I think you would want to test it in those babies.  For some of those conditions you had mentioned, they would occur after you started the feeding, so like cystic white matter disease you might not identify till 36 weeks post conceptual age or something, and that's really, that's potentially an outcome variable.

    DR. THUREEN:  Would you pair match those infants then at all with other infants who had similar risk factors or known disease that affects neuro developmental outcome, or would you just do a purely prospective randomized trial?

    DR. TYSON:  If you do a large randomized trial, first of all, it gets really cumbersome to try and pair anybody at birth.  As long as you're stratifying by center and maybe a couple of other things like birth weight less than 750, 750 to 1,000, something like that, that you will end up with an approximately equal number of those babies in the two groups, and then you can go back and do an analysis.  If you try to stratify for birth weight, SGA, gender, birth asphyxia, et cetera, you end up with so many huge strata that the study gets really complicated to do.  And I don't feel as strongly about that as most statisticians, but my understanding of the school, most statisticians are towards the minimal prognostic stratification at randomization, and more toward post hoc looking at individual groups who should have been predefined ahead of time which group you were going to look at.  Does that answer your question?

    DR. THUREEN:  That makes sense.  And would you change any of your ideas about how to conduct a study if you are going to look at patients who this is their exclusive formula fed from initial feeding versus studies started when infants really attained full feeding?  Do you think that it makes any difference on how you conduct the study if you're looking at those two issues?  Did that make sense?  Because yours sound like you're referring to infants who may start minimal enteral feedings with the study of formula, rather than waiting until they attain full feeding and then starting from that standpoint?  Do you think it's preferable to do one or the other, or do all of your idea really apply to--

    DR. TYSON:  It depends on when the formula is going to bed.  If it's a formula that's going to be fed in the first week or something like that, I think you want to test it as it's going to be used.  Let's say that it was a formula that was recommended for us from the first feeding.  Let's say for the sake of argument that it cause necrotizing enterocolitis, and you didn't enroll baby, and you didn't start that formula until 3 weeks of age or something, or at a point when the babies were on full feedings, you might miss that effect, so you want to test it as it's going to be used in the real world.  Does that make sense?

    DR. THUREEN:  Yes.  And then lastly, do you believe that there are really no good reference standards for growth in the preterm infant or at least a certain subgroup of preterm infants that may be extremely low birth weight?

    DR. TYSON:  In the Neonatal Research Network, we have research nurses that are doing standardized--that have done standardized evaluations of anthropometry in intervals.  There are huge center differences.  If you try to take the data from any center to apply it to another center, you could easily be mislead just by the center differences.  So I don't see why you would want to use comparisons that would involve center differences or time differences.  This was gathered data 3 years ago when they were using steroids, postnatal steroids more often or less often than they are now.  Why not randomize and get the cleanest--I think the belief that you don't have to use controls, you don't have to use randomized controls, that you can answer the question with fewer patients is an illusion, that at a given number of patients your ability to get a unbiased answer to the question is going to be greater with randomized controls than with historical controls.

    DR. THUREEN:  Thank you.

    DR. GARZA:  Heubi?

    DR. HEUBI:  I think, Jon, this is all very interesting.  I wanted to actually ask you a couple of questions, and you'd have to follow my line of thought here.

    The number of subjects that you would entertain as being appropriate for a study is about 10 times what a typical current formula study would be.

    DR. TYSON:  Right.

    DR. HEUBI:  With that in mind, knowing what you knowledge is of the Neonatal Network, would the Neonatal Network sponsor studies like this because this is specific to preterm infants and it would be a potential rationale to study with partnering between industry and NIH money to do studies like this?

    DR. TYSON:  The Neonatal Network has a protocol review committee and standard procedures for--there's no reason that couldn't be proposed and seriously considered or accepted if it went through all those things.  There are lots of networks out there.  In Texas we've started a Texas network, and there's an Oxford network, and the Canadians have a network, and the Australians have a network, and I'm sure there's networks developing in the United Kingdom if they're not already in place.  So there are a lot of people willing to do this.  The hardest part is going to be the 18-month follow up.  That's a lot harder than studying NEC, but as more and more people say, "If I'm going to take care of babies this size, I need to know how they turn out.  That means I have to have a really well functioning follow up system.  So I think there are going to be people out there that can do it at much lower cost than if you just went to them on day one and said, "We're going to fund your whole follow-up effort in order to answer this one question.

    DR. HEUBI:  But I was looking at it from the standpoint of it being economically more attractive to industry to do studies through the network that exists through the NIH because some of the infrastructure already existed and was already being paid for in part by federal money.

    DR. TYSON:  Right.  Kathleen Kennedy and I proposed to the network a feeding study, and one of the things--and I'm glad I was involved in this effort--the business about the p-values that I presented today, that was aware stumbling block to us when we got to the--the reviewers really liked it and the statisticians said, "Well, you're going to have to study 6,000 babies or something," and I was working under the same mindset I had been before, well, that's if you want to look at a p less than .05.  But why not accept a higher p-value, or as I was going to say on the last slide, predefine what you think is an acceptable ratio of the number of babies who benefit to the number of babies that are going to be harmed, and then test that and say, does the number of babies who are helped by improved growth or development,  relative to the number who are harmed by NEC or worsening BPD, if indeed that's a hazard, is that an acceptable ratio or not?  And try to design studies not to look at one outcome variable but at the relationship of one or two variables or perhaps even more.  And I think that's a cutting edge issue in the experimental design that the time is ripe to do now.

    DR. HEUBI:  And this is a circumstance where clearly DSMB or some monitoring board, during the--

    DR. TYSON:  Yes.

    DR. HEUBI:  --would be pretty accepted.

    DR. TYSON:  Right.

    DR. DENNE:  Jon, you've made an argument about following preterm infants out to 18 months for neuro developmental outcome and I understand the rationale for that argument.  How do you feel about similar studies in terminants?

    DR. TYSON:  Do you mean randomized trials or--

    DR. DENNE:  No.  I mean the necessity for evaluating neuro developmental outcome at 18 months in studies of new term formulas?

    DR. TYSON:  I don't see why not.  I mean everybody in this room thinks nutrition's really important.  You take these formulas and you feed them to millions of babies.  Why test it in only 50 or 60 babies?  Why allow it to go on the market without knowing does it have beneficial or harmful effects as far as you can tell with an appropriate sized study in term babies?

    DR. ANDERSON:  Anderson.  In your discussion of NEC much of the calculations were done based on a baseline rate of 7 percent.  How would you feel about some of the adverse event monitoring being done not necessarily in the context of a randomized clinical trial, but against some fixed standard?  That is an infant formula would be unacceptable if it produced a rate of NEC above 10 percent.

    DR. TYSON:  There are several problems with that.  One is there is a lot of institution variation in NEC and with any institutions, there are periods when the NEC rate really goes up, and when it comes down, nobody quite understands that.

    And finally you have the potential problem of bias.  There have been studies, for example, they took x-rays of kids with NEC and x-rays of kids thought not to have NEC, going to every pediatric radiologist in California, and they found this incredible variability in what was called NEC and what wasn't.  And so when you're in the context of a study like that, I just think the opportunity for bias is there, and that your ability to relate that to some other institution in a different point in time I just don't think is worth the effort.

    DR. STALLINGS:  Stallings.  A slightly different angle.  And we talked this morning a little bit about, or inferred a little bit about term babies growing too fast, and you know, that's of concern, but certainly for my clinical time, the worry over preterm babies growing too fast, and I can remember bedside debates about too fast and it's only fat and it's no brain and it's no muscle and all of that.  I don't think we have nearly as good a handle on the body composition component of the preterm babies.  And then you add to all of that the concerns that we're all reading more and more about, is early postnatal growth a very--I mean we know it's an important time, but a differently very important time in lifelong health.  Would you make a few comments?  I know you made the caveat about babies with really chronic lung disease and concern, and that's really related to CO2 retention, but put those babies aside, and can we grow preterm babies too fast, and how do we determine when we're approaching that?

    DR. TYSON:  Well, I clearly don't have the answer to those questions.  It seems to me the only way we can get it is to randomize babies to different feeding regimens that produce different growth rates and see who turns out to have the best health and development.  That might be a different answer for the really sick babies and the healthy babies.

    DR. GARZA:  In terms of randomized trials, the implicit assumption is that in fact it will be a concurring group of formula-fed infants.  The other control group that's been discussed by this committee in using breast-fed infants, in this case perhaps infants fed human milk that may be fortified or enhanced with other nutrients, what's more important in your perspective, a randomized trial, which obviously you can't do unless you're willing to go down the hospital hallway and say, "Are you going to breast feed or are you going to bottle feed," which obviously is not feasible.

    DR. TYSON:  So among those whose mothers commit to breast feeding those babies to randomize them to the new formula versus the old formula for supplementation, when they need supplementation, so you can stratify pre hoc for intent to breast feed or not.  Then randomize within those strata.

    DR. GARZA:  And then use the amount of formula you're feeding them as part of the co-variants in that?

    DR. TYSON:  Yes.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. TYSON:  Thank you.

    DR. GARZA:  All right.  Thank you very much, Jon.

    DR. GARZA:  We're going to move from preterm infants to generic analytical issues.  And Dr. Frongillo, who you've met before, is on next.  I remind everybody Ed is from Cornell University Nutritional sciences.

    [Pause]

    DR. FRONGILLO:  So I was asked to look at analytical issues related to the evaluation of formula, and I was asked to look at 5 issues, the sensitivity and usefulness of several types of comparisons, the potential for evaluating a meaningful difference in the growth increments per day, the impact of transformations of raw data measurements into normalized indices, advantages and disadvantages of comparing with various reference data sets, and finally, circumstances that might favor one type of comparison to another.

    So the first issue is sensitivity and usefulness of several types of comparisons.  In 1988 the Academy of Pediatrics had issued some guidelines for determining physical growth, for evaluating new infant formula, and just to remind you, the suggestion there was to look at weight gain over the first 4 months, to look at measurements at 14, 60 and 120 days, and to look at rates of weight gain in grams per day over the intervals that are implied by that, 14 to 60, 60 to 120, and then the whole period 14 to 120.

    Some possible comparison groups to infants who are receiving new infant formula would be infants who are randomize to receive a standard established infant formula or alternately infants whose growth is represented in a reference, infants whose measurements are in a currently available data set, and finally, another possibility, infants whose measurements are in a historical data set.  So I'm going to go through each one in turn.

    The advantages--I'll use pluses for advantage and negatives for disadvantages.  So in terms of advantages for randomized clinical studies, is first of all theoretical close control of the factors that might influence the outcome, in this case, weight gain as an example, the incorporation of design feeders to minimize known sources of bias.  This might be something like doing stratification based on certain characteristics of the infant, perhaps size at birth or something like that, and that the probability statement is justified--this gets to sort of statistical philosophy here--but that the probability statement is justified on the design itself as well as the statistical model that's used.

    And these issues are important, especially the first one in light of the fact that we might expect that, for example, if there was challenges in recruiting infants into a study, if it was difficult to recruit infants into a study whose caregivers are interested in formula feeding, we might be concerned about selection biases or the differential characteristics of people who agree to participate in such a study.  And so being able to control for those would be important.

    The disadvantage is that the sample size is going to be larger than in the other approaches.  I've said here the sample size would be twice as large.  That's under the assumption that the other data set that one might be compared, whether it's a reference or a historical control, contributes no sampling variability.  It's even larger if we're willing to assume that, for example, a reference has no sampling variability.  And then it might be as great as a difference of a factor of 4.  So the sample size then in a randomized control study would be potentially much larger than if we didn't do that.  If we compare it to a reference the advantages would be that we can compare to a known established reference, something we know and love.  One group of infants would be measured.  Therefore, saving on sample size and work.  The negatives would be that the new cohort on new infant formula may be different in some important ways from the reference sample, and that the reference may exhibit somewhat different characteristics in terms of the growth patterns from the new cohort.  And I already showed you an example of that earlier when I was talking about WHO growth reference, and we know breast-fed infants grow differently than formula-fed infants, and we might be concerned about whether the reference is the appropriate sort of comparitor for the current cohort.

    And in relation to the current U.S. reference, it's a cross-sectional reference, and if our interest is in growth increments, we would like something that would characterize the variability in growth increments.

    And then finally, reference data are not free of sampling error.  Reference data have a finite number of infants measured in any particular time, and if we're doing a comparison between a new cohort and a reference, we really should be taking into account the sampling error that's present for both.

    And the other choices had to do with use of currently available or perhaps historical data.  These are data that--existing data that are collected earlier in time, so I'll consider those together, obviously advantages if this minimizes data collection, but the disadvantage would be that the characteristics of this new cohort might differ from the current cohort in some important ways.

    The second issue was the potential for evaluating a meaningful difference in growth increments per day.  So I want to talk about what I call the smallest meaningful difference, abbreviated SMD.  This is not the difference expected or what's previously been found. What it is, is the smallest difference that we think would be substantively important.  It's a judgment about what do we think is the smallest difference that would matter.  And this is inherently--we should be thinking of this as a population rather than individual characteristic.  In other words, the population perspective is most salient.  We're not trying to judge whether one infant is growing differently than another.  We're trying to judge whether a whole group of infants, potentially a large group of infants fed on an infant formula would grow differently than a group or population of infants fed on some other means.

    The previous recommendation from that 1988 guideline was 3 grams per day, which amounts to 318 grams over the 14 to 121-day period.  And I tried in the background paper to sort of get some perspective on how big is this difference.  So if we look at the increments in the Iowa and Fels data, this difference here is about the same difference in those reference data between the 25th and the 50th or the 50th and the 75th percentile, to give you an idea of how big that is.  If we think about high and low altitude, which is one of the factors that causes the largest differences in birth weight, differences at high altitude are in the order of 350 grams or so.  So that difference is about the same or a little bit bigger than this.

    And we're always concerned about the effect of smoking on birth weight, and this difference of 318 grams is about 50 percent larger than that difference.

    Another perspective we can get is if we look at some prior results.  These are some results from a paper by Roche, et al. in 1993.  They compared for males and females growth on 3 different infant formulas.  They also had breast-fed infants that's in the paper, but I left that out here for simplicity.  And this is the growth that occurred from zero to 4 months, and the total number of infants was about 260.  And so these are the values of growth that occurred during those periods for male and females.  What I was particularly interested in was having us look at the differences.

    So the differences among those 3 groups, for males were 210, 270, and 480, and for females were 110, minus 10 and then 100.  So you can see that first of all the differences for the males are much larger than the difference for females, and these differences here are smaller than the 318 grams over roughly the same period, and this difference is quite a bit larger than that.

    And so it seemed to me, just from looking at this, we might say, well, these differences maybe aren't too big, but my guess is most of us would be concerned about differences of around 200 grams over this period.

    The previous recommendation for sample size that was needed per group for the smallest meaningful difference was based on 80 power, testing at .05, a one-tail test and this standard deviation.  And the result that was given was for 3 grams per day, a sample size of 28 was needed.  So I wanted to comment on this.

    First of all, as Jon Tyson pointed out in the talk just before, we're trying to do what's like a bio-equivalence.  We're trying to make a judgment whether things are really producing about the same effect or whether there's evidence that something is really different.  In that regard we're particularly concerned about power, because if there was a real difference in growth, the power represents the probability that we would find that difference.  And 80 percent power means one fifth of the time we wouldn't detect the difference that was really there.  My guess is that's a risk that we're probably not willing top take.  And so I'll argue that 90 percent power is probably better.

    Also this used a one-tail test and there's already been comments this morning that we're probably concerned about differences in growth that are in either direction, whether infants grow too slowly or whether infants are growing too quickly in terms of weight, so a two-tail test is probably more appropriate.  And finally, this figure was based on a personal communication with one of the investigators at the time, and since then a larger standard deviation has been published.  So if we go to 90 percent power, the same test for the p-value, a two-tail test and a slightly higher standard deviation, then I've given a table here.  In the background papers there's even more variance of this.

    But this shows that first of all at 3 grams per day, we would need 67, not 28 under these parameters.  And then if we go, obviously, to smaller and smaller meaningful differences, then we're going to need larger and larger samples to be sure that we would detect them.

    The third issue was the impact of transformations if we go from raw data to normalized indices.  The idea of Z-scores is that we match measurements with reference values for age and sex.  The Z-scorer then takes that measurement, compares it to the reference median for that age and sex, divides by the appropriate standard deviation.  The primary purpose for Z-scores is descriptive.  It allows  combining together ages and sexes so that we can get one overall description of a where a population is or a group is. This assumes that the pattern of growth in the sampled population is expected to be the same as in the referenced population.

    In terms of the application of this to evaluating new infant formulas, first of all, age adjustment would usually not be needed if the measurements are taken at the prespecified ages, and I think Dr. Fomon gave us the motivation and the example we need as to how you do that in getting measurements within plus or minus 4 days, which is remarkable.  If we can do that, then we really wouldn't need to do age adjustment. And then if age adjustment was needed--because we're all not that good--what we would probably prefer to do is include co-variates for age or interpolate and extrapolate the time series rather than converting to Z-scores as a means to deal with this.

    And finally, as I've already said, males and females would typically be analyzed separately because there are sex differences in the growth response.  So the need to do sex adjustment from Z-scores isn't even there.

    The fourth issue was advantages and disadvantages of comparing with the various reference data sets.  References are a tool to provide a common basis of comparison, and as I said before, the referenced population should reflect the growth that's expected for the children that we have under study.  The reviews that were done by NCHS and CDC and also be the WHO in the early '90s, led to the development of new references, and so we need to consider this in the context of those.

    I've made a table here which shows what the possible references might be.  The CDC 2000 reference has both breast-fed and formula-fed infants, but it's cross-sectional, which is a limitation in its application for this particular purpose.  The WHO data has breast-fed infants.  It's longitudinal, which is good, but besides having breast-fed infants, it won't be available for a while.  The Iowa data and Iowa Fels data combine together breast-fed and formula-fed infants in a longitudinal, and so probably at the current time, if we had to do this today, we'd probably choose to use this as a reference if we wanted to make a comparison for descriptive purposes.

    And then finally circumstances favoring one type of comparison to another, it seemed to me that we would probably want to consider using currently available data if we wanted to test several new infant formulas nearing us in time.  If we were doing a set of tests over some period of time that's relatively close, then perhaps it would be efficient to sample from the same population for the whole series of studies, but without having to do repeated sampling for a comparison group.  The concern always will be that the characteristics of later samples might differ from the earlier samples.

    So in summary, this review suggests the for primary analyses, the main analyses to answer the question, that it probably is appropriate to choose a design with a randomized concurrent comparison group.  Otherwise, we're not going to be really sure that the growth of the infants on the new formula would really be expected to be the same as from some previous comparison, and this is of a concern about the characteristics of the samples and the populations, and it's particularly concerned potentially because of selection issues as to who actually gets into and agrees to be in the study.

    For descriptive purposes, it would be useful to compare the attained weight for all groups at each measured age with the current U.S. reference, the 2000 reference, and to compare for descriptive purposes, rates of weight gain with the Iowa or Iowa Fels data.

    And then continuing the summary, the sample size per group of 28 is clearly, in my view, without sufficient power for meaningful differences of even 3 grams per day, and a larger sample size is needed even if we stay with that guideline.  The smallest meaningful difference might be smaller than that.  It might be perhaps 2 grams per day.  I think some might argue for something smaller than that, something larger than that.  That's something that would have to be considered, but in any event, this would imply a much larger sample size, and determining the smallest meaningful difference should be based on the best understanding we have of the biology, but also on the required regulatory, clinical and public health decisions that are going to be made with this information.

    And finally, to Anna Milman, who's an undergraduate at Cornell who helped me in the preparation of the paper.

    DR. GARZA:  Any questions or comments? Dr. Stallings?

    DR. STALLINGS:  Talk tome a little bit about the--we heard some of this in the other talk to, the differences in comparing just the needs and medians and in growth studies looking at the number of children that would be, for example, less than the fifth or less than the third, some cutoff that we also think of?  I've been--you know, we tend to analyze the data just looking at the group means and differences, rather than a secondary analysis, or often we do these things that we don't have any idea because of the way the data presented, how many children, how many infants might have been in the tails of the extreme.  Any thoughts on that from a statistical point of view or an approach to believing that a formula supporting growth properly?

    DR. FRONGILLO:  No, that's exactly right.  This whole discussion that I've just given is based solely on the idea that under a new formula as opposed to an existing formula, say, that what we would be looking for and all we would care about is shifts in the whole distribution that could be captured by differences in the mean.  Now, if there are differences of other kinds like differences in the variability or that if there was concern that maybe with a new formula for most infants it does fine, but there is a small subset of infants that don't do well at all, this approach is not designed to detect that. And Jon Tyson's thoughts about that I think are important, in that if we're concerned about what's happened at the extremes, that's going to inevitably require much larger sample sizes.

    We know that any time we, for example, categorize infants into whether they've done okay versus not done well at all, and we have binary data, the sample sizes go way, way up, because in essence we're looking for those rare, rare events.

    DR. GARZA:  Ed, a related question to the one just asked by Dr. Stallings.  In a clinical study one might be interested in individual outcomes, so that one doesn't have to wait until the end to decide that the child did not grow well.  What sort of analytical issues would this present when you want to be able to monitor for safety reasons the ongoing growth of infants, of individual infants?

    DR. FRONGILLO:  You mean during the conduct of a trial perhaps?

    DR. GARZA:  During the conduct of a trial.

    DR. FRONGILLO:  Well, surely, you would want to have some guidelines in place so that if a child is growing extremely poorly, that some intervention is made.  If you're starting at 14 days, you're not going to want to wait until 120 days to intervene, to decide that this child just is not thriving at all.  So that would have to be built in.

    DR. GARZA:  With that thought in mind then, you talked about perhaps 3 grams being--sample size of 28 for 3 grams being definitely inappropriate or 2 grams obviously more so.  But that assumes a linear rate of growth through this period, which several speakers have described as highly nonlinear.  Should we be looking at narrower periods and different amounts of growth during those narrower periods than 2 or 3 grams over that longer period, where one assumes then the linear pattern that doesn't exist?

    DR. FRONGILLO:  Yes, I would say so.  And the 1988 guideline has suggested that, what was it, 14 to 42--now I can't remember--that intervening interval would be looked at.  The standard deviations there are a little bit higher.  The growth rates on average are higher, and there are also more variables. So if one was particularly concerned about say the earlier part--if there were three measurements and one was concerned about the earliest period, he'd need slightly larger sample sizes based on that.  But, yes, one would certainly want to be able to evaluate growth.  And I think Dr. Fomon's admonition that it's the earliest--the time when probably most concern and maybe most sensitive to different formula characteristics may be the period when the fastest growth is occurring, and that would be the earlier part of that period.

    DR. GARZA:  And yet--with that idea in mind then, when do you have to worry about regression towards the mean?  I mean, so that in fact if a child is born large, do they begin to downsize immediately, or is that not a phenomena that we should worry about in the first 3 months of life?

    DR. FRONGILLO:  No, I think that is a phenomena we should be worried about, and that ties to what I was saying about selection issues.  If, for example, for whatever reason, the parents who agree to enroll their infant into a trial are parents whose infants tend to be small or have lower birth weights, for example.  We would expect infants with lower birth weights to grow faster.  That's what we know, and if they're at the high end of the birth weight distribution, they're going to grow more slowly, and so one of the important advantages of having the concurrent randomized comparison group is if any regression that occurs because of that kind of selection, that will be common to both groups.  Any other way of doing it won't be guaranteed to pick that up.

    DR. GARZA:  Any other questions?  Dr. Stallings?

    DR. STALLINGS:  One last technical question while we have you here.  Like Bert was saying, maybe the 2 grams across the whole period is an optimal.  Should we consider different rates for the gender since that's a pretty well appreciated fact as well to do sample on boys versus girls, different sample sizes in designing the study, or is that not--

    DR. FRONGILLO:  I guess my initial reaction is figuring out what the smallest meaningful difference once is going to be difficult enough.

    DR. STALLINGS:  Forget it, okay.

    [Laughter.]

    DR. FRONGILLO:  And now you want to do it twice.

    DR. STALLINGS:  I was just thinking about how if we were, if we got to design sort of perfect studies in the young group, the youngest age group, and thinking about what we heard this morning, where there's at least one good example where the boys didn't try it as well as the girls, and that if we had been in that unique place where it had been a balanced gender sample, we might have missed the boys.  Anyway, I was just--

    DR. FRONGILLO:  That's right.  I mean from data I've seen, from the Iowa Fels data the girls are slightly less--their growth rates are slightly less variable than the males.  On the other hand, at least that example from the Roche, et al. data--others have far more knowledge than I do--suggest that males are more differentially responsive to differences in feeding mode, so if both of those are in place, that suggests that on the grounds of variability, a smaller sample size would be needed for females, and on the grounds of variability, a larger for males.  On the other hand, the differences we see for males might be larger than the important differences we see for females.  So some thinking about that is probably--certainly would be helpful.

    DR. GARZA:  Dr. Denne then Dr. Heubi.

    DR. DENNE:  I just wanted to clarify what you said about the growth of large infants and small infants, that larger infants grow more slowly and smaller infants grow more quickly.  My understanding of the growth curves is that the 10th percentile and 90th percentile are fairly parallel in terms of their slope.  So I guess I--

    DR. FRONGILLO:  On a population basis infants who are born at high birth weights tend to grow a little bit more slowly than infants who are a low birth weight, so you actually see a convergence of the distributions a bit.  So that the variability at birth tends to be a little bit larger cross-sectionally and little variability after that.  It's not a factor that comes into play anywhere after the immediate neonatal period.  But for this purpose that's relevant.

    DR. GARZA:  Heubi.

    DR. HEUBI:  You showed us an example of a circumstance of where the smallest meaningful difference was in fact probably too large.  I guess the question I have is do we know from formulas that are currently on the market, how many of them would have fallen out if you'd applied more strict criteria to them in terms of their trials?

    DR. FRONGILLO:  I asked exactly that question at lunchtime, and at least the people at my table didn't know the answer to that.

    [Laughter.]

    DR. HEUBI:  So we don't know the answer to that question.

    DR. FRONGILLO:  I think that it would be helpful to have an idea of how often is it that we see differences that are larger than whatever the smallest meaningful difference we happen to be fixated on at the moment, how often is it that when trials are done do we actually see differences that exceed that and therefore would cause worry?  So knowing something about the distribution of the differences that are seen when studies are done would probably be helpful.  The answer I got was that's not something that has to be publicly reported, and so we don't have a database of information available.

    DR. HEUBI:  And I guess my follow up question was, I'm actually surprised that they agreed to a one-tail test to begin with when they actually made these recommendations.  Do you have any comment about that?

    DR. FRONGILLO:  On why they agreed to that?

    DR. HEUBI:  Why they decided that one-tail T-test was adequate.

    DR. FRONGILLO:  Well, I think that if your concern was we have currently available formula that are fine and you're introducing a new formula and you're concerned about under nutrition, that it doesn't meet the infants' nutritional needs, then it would sense to have a one-tail test because you're concerned about deviations that are in one direction.  I guess I'm suggesting that where we are now in the public health concerns we have about obesity, it's probably the case that we would be concerned in either direction.  So at the time it may have been a perfectly reasonable conclusion that isn't necessarily the one we should reach now.

    DR. HEUBI:  I guess it's just been pounded in me so often not to use a one-tail test for anything, that it just seems like it makes no sense.

    DR. GARZA:  You don't have to answer that.

    [Laughter.]

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  All right.  Thank you very much.  You may not want to leave the podium just yet.

    We have next on the agenda is an opportunity to bring all the previous speakers forward for a question and answer period.  And what I'd like to ask the Committee is whether you would find that helpful or whether we should just move on to the next two talks and thus bring everybody up at one time.  I need a sense from you as to what you would find most helpful.

    On with the speakers?  Is that the consensus.  And then we'll just ask everybody to come up.  Is there anyone objecting to that on the Committee?

    [No response.]

    DR. GARZA:  If not, Dr. Benton?  Dr. Duane Benton, who is retired.  I don't know whether there is an emeritus ranking within industry, but certainly you would merit that title if it existed.  So we will refer to him as an emeritus industry representative, at least as an industry employee in the past, but one of high distinction.  And he will be talking about product composition considerations, clinical studies.

    DR. BENTON:  I want to make sure you can hear and other things.  One, I'm not going to--at my age and my position, I'm not going to try to impress you with how fancy I can make slides.  So we aren't going to have any.  I hate to a slide, to a screen.

    Also I think all I'm going the be doing is a critique of my own talk, so I've got some things here I don't want you to forget.  I enjoy looking at the audience and if you're falling asleep, I'll start at you, something like that.

    Now, you should have background of why am I up here.  Well, one of my various qualifications is, as Bert said, I'm retired, and they somehow felt that would reduce the conflict of interest.  Obviously one of the senior people.  Sam's a little more senior than I am.

    But in this, what I am drawing from is 25 years when I was director of nutritional research with the Ross Products Division of Abbott Laboratories.  And when I refer to my company, then I'm referring to--some of you used to know it as Ross Labs back in the good old days, and it is now Ross Products Division.

    It was a very good company to work for.  It may not be representative of the industry in its attitude toward physical growth studies, and in fact what I say may not be representative of what the industry or the Ross Products Division presently do.  In other words, when I went into this, I asked the people at Ross.  They were calling me up because Bert couldn't find me, because I was wandering around out in Kansas.  And so they called me up and said, well, wouldn't you do this?  And I said, "Do you want me to?  I could be kind of a loose cannon."

    [Laughter.]

    DR. BENTON:  And they said, "Oh, yeah."  The people on the Infant Formula Council said, "That's a great idea."  We'll see.

    [Laughter.]

    DR. BENTON:  Because these are my opinions.  This is not edited material.  I really sort of identified when Benjamin Franklin was asked whether he would write the Declaration of Independence, and he said he made it an issue of point, that he did not write material to be edited by other people.  I can understand that now that I'm in my--other than my wife did try to read it through and tried to figure out whether it made any sense at all.

    Now, the issues in here are, in my 25 years at Ross I was doing all kinds of different things.  I'm a biochemist by training.  I worked in nutrition throughout my professional career, and I was very concerned about safety of our products.  Now, in this we're not supposed to be talking about safety.  We're talking about nutritional adequacy.  When you are in the position that I was in for 25 years you don't know how to draw a line between those.  You remember back when you were a graduate student and there was SMA out on the market that was vitamin B-6 deficient?  And that company was--well, it was a real blow.  Rudy Tomarelli told me that he spent through his whole career going into court cases in relationship to that.  That's the one thing you don't want to do if you are in the position I am.

    So, safety, nutritional adequacy, et cetera, there is no discrimination in there.  The formula has to be good, and you've got to somehow figure out whether it is or it isn't, and if it isn't, you'd better do something about it very fast.

    I was in a company where the aspect of a physical growth study was something that was assumed.  When we were in a meeting and they were talking with the president about, "Are we going to introduce this?"  You know, the answer was, "Where are the growth studies?"  You didn't want to be in the position of not having that kind of information because that was sort of viewed as something not right with it, and we're not going to get out there, and we're not going to be hit with something very undesirable.

    So it's also very helpful to see what I'm saying here and what I've written in the paper, and I'm going to be talking basically to the paper, is directed toward that.  Now, it is not an aspect of you saying, "Oh, well, this didn't sort of turn out right and maybe we ought to do this and a little more of that."  Within the company, clearly, they want to do something.  You can tell them, "No way.  This stuff doesn't look like it's safe."  But tell them, "Oh, we haven't got that done yet, or "We haven't looked into that," or something like that.  You better have looked into it.  You better know what you can as far as the answers to it.

    Now, what we're basically saying about a physical growth study, from our point of view, is that if there were a nutritional deficiency of any sort, some slight depletion of an essential nutrient, that the first thing we would expect to show up would be a diminution in weight gain.  Now, the other thing that in many cases we would expect to see would be a diminution in intake of the formula.  Dr. Fomon did address to a degree the intake of formula.  Most of the other things are talking about growth of infants and things you can measure on infants and so forth.

    In my evaluation, really, wherever I could, wanted to see if the kids really did take in the formula and would continue to do that, and if I could work that into a study I would.  It's hard.  It's very hard work, but there are ways to find out that you can do that.

    Now, when we progress into--and you now, I'm talking about the studies as such.  Before I talk about the formula and the ingredients, although my job was formula ingredient, I've got to talk about the studies to the degree of what can we expect from them, because it isn't going to make any sense why I would want to run them if I could expect--if I didn't have in mind clearly what we can expect from them and what we cannot expect from them.  And one point that I have tried to make in here, those things that we can measure in a physical growth study, and clearly you've outlined most of them that could be.  I mean we could do innumerable measurements on these infants.  But our attitude, or at least mine was then--I may find somebody that would disagree with me--are that growth is a combination of responses, thousands of them, and those might or might not be measured.  But we need the most sensitive measure of how an infant is growing to detect in this short period of time, and with as small a number of infants as possible, that nothing is going wrong.  And therefore we need the most sensitive, sensitive response within the study.

    With that, one of the things that you people are addressing is, well, shouldn't we use a breast-fed group as the control?  This just sort of knocks me off my seat to think of that.  We know that those breast-fed infants grow differently than any other infants fed on infant formula, that there are real differences of what we can measure as far as growth.  And I'm trying to answer one question.  I'm trying to answer the question, should we put this formula on the market?  I may have made a bad infant formula, and now they do go over--they gain weight or length or whatever at the rate that a breast-fed infant does.  That may be a bad situation.  It's not because the growth of a breast-fed infant is bad.  It is that I may have gotten there for a very, very different reason.  I may have depleted a nutrient.  I may have caused a metabolic imbalance.  I may have done a whole bunch of things because you cannot equate a weight gain or a body weight or something like that in a infant and say, "These two are the same.  Therefore, their metabolic patterns are the same.  Their nutrient contents are the same."  No, they're not.

    If I get one that is like a breast-fed infant, I would probably conclude that there was something wrong with my formula, and that's basically what we do.  I mean we look at all of the information that we can collect and is practical from that point of view.  When you come down and you make a decision and you're going to make advice to your company, you're saying, "Is this safe when you compare it to a present formula that we believe is safe, and we have long experience?"  If it isn't, you don't have a formula, and the issue is that you're going to go back there and you're going to change the formula so you fit into that.

    Now, clearly, there are innumerable research studies that can be done, and you people would probably look at the budgets that we've used for our research studies over the years because there are extensive ones, and you'd think, "Hey, that's a pretty good looking budget."  That wouldn't even come close to trying to answer the question that you are posing in the sense of can we somehow make infant formula feeding similar to that of the breast-fed infant.  That is an enormous research project.  I'm not really sure how I would try to undertake it if I were forced to try that.  I'm sort of glad I'm not.

    The other thing I'm going to say here is that I'm not going to discuss the premature infant.  Dr. Tyson gave a very good talk about the aspect of doing that.  I guess I would say that if he were to implement and be able to implement his thing about the risk of necrotizing enterocolitis, all of our formula bottles would say the surgeon general has determined that this formula may increase the risk of necrotizing enterocolitis, because I have not seen a formula that in some way or other didn't appear in a particular nursery to increase the risk of NEC.

    Now let's proceed on with this, and another thing that I wanted to address, and Dr. Fomon introduced me to that in his paper and in his talk.  He said that these days it's going to be very, very difficult to start a study with formula fed infants before 28 days.  This is very, very serious from my point of view.  We really depended on Sam for our work, and he started those infants--well, it was not within the first week, but certainly the end of the second week.  and if we had some kind of a nutritional inadequacy which represented the depletion of a nutrient, or I guess you could even say accumulation of a toxin.  Those very early part of the growth would be a very serious loss of that period, would be very, very serious in trying to detect a deficiency, because sometimes around 4 months or 3 months or 4 months, solid foods are going to be introduced, the proportional growth rate is decreasing.  If you had had 3 weeks of feeding of a formula or breast feeding, you might have depleted or repleted stores of a particular nutrient, and I'm not certain that I wouldn't miss a serious problem.

    One of the things that I always fall back on is saying that when I was director I was referring to the vitamin B-6 deficiency in a formula.  Would I have been able to detect that in a clinical study?  I seriously doubt if an infant had been fed for 28 days on a totally adequate infant formula that had been fortified with vitamin B-6, that at the 4-month period of time whether you would be able to detect a weight gain difference, you know, using that formula, because they would have received enough Vitamin B-6 that it pretty might likely have carried them through that deficiency, and we would have never known.  And we well know what happened when it really went out on the market and infants were really fed on that product as the sole source of food from birth.  So I think it will be a great loss if we cannot introduce our infants quickly into the formula feeding when we're testing.

    Another thing that I would point out, at least in our studies--and it was tremendously important to our work with Dr. Fomon and most of our other centers--they had an expert nursing staff.  They observed the infants closely, just as the pediatricians observed the infants closely.  They talked with the mothers.  They resolved observations that the mothers had and so forth.  And I personally believe that nutritional or other types of problems will probably be detected by the nursing staff before we get statistically significant weight differences, which, from our point of view, would be important to be able to stop a study where something was going wrong.  And we have had studies where--that were stopped, anyway.  I don't know whether you would statistically ever have found out whether something was going wrong or not.  We didn't want to know.

    One of the things that is in here and that I have not really discussed, but I would just want to mention in this, there are situations where claims are going to be made for infant formula.  And, yes, if someone is making a claim for infant formula, a growth study is important.  It's my personal opinion that there's an awful lot more research that is needed to go out and make some kind of a claim for an advantage or disadvantage of an infant formula.  And no one has ever regulated that.  I certainly would hate to see claims that I see at the health food store for other foods.

    Now, when I went through here, I gave you on page 2--I tried to break down the aspect of, you know, how we make an infant formula, what the steps are in it, and then we'll try to discuss how those go into the aspect of our evaluation.  Certainly the ingredients, which we have had discussions of, and they've asked me to try to discuss all of the other various ones.

    But the aspects of things like batching, in other words, this is where you're going to put together all the ingredients, they have to be put together in a specific order, specific manner, specific temperatures, and specific times of what is going on in there, and it can make tremendous differences in the stability of nutrients, interactions between nutrients, a lot of other things that are involved here.  And we will say later on, you know--and someone mentioned before, hey, we could just test this ingredient in an infant formula someplace, and then everybody could use it.  Well, I have serious doubts about that, and I will try to address those as I have time here.

    Certainly heat processing is very, very important in the interaction between the various ingredients, and, therefore, we're going to have to always think about how much heat processing a particular product has received.  But that heat processing is going to interact with all of the other aspects of things that I'm talking about here.  And it has to do with, you know, aspects of how we get the product into the container, how it's going to hold up over shelf life, innumerable aspects of that.

    Now, let's just go down through and look at a few things in relationship to proteins.  Dr. Fomon addressed it.  Maybe I better not spend too much time on it.  But, clearly, if you think you're going to measure the protein quality of an infant formula as to its adequacy, you must feed a very, very low protein feeding.  In fact, Dr. Fomon, just a little lower than you dared feed in those studies that you showed them here.

    Certainly when you want to use a rat to study protein quality, you get half its maximum growth rate.  I don't think you'd like that for a clinical study on infants.  So if you want to answer about protein quality, you're going to have to do other types of research.  There are such studies that are very, very complicated.

    I also make reference in here to the PER.  This is a rat study.  From my point of view, it has almost nothing to do with how acceptable a new protein source is for the quality of an infant formula because the amino acid requirements of an infant, as far as, say, patterns of amino acids, however you want to address it, are very, very different, certainly must be very, very different between the rat and--but it is a very useful test.  It is useful in the sense of if a formula is different, in other words, if I have done different heat processing, I have an ingredient that's interacting with the protein or something like that, it will tell me that something has changed in relationship to the protein in a PER study.  And you better watch out.  It is just like our attitude on the clinical growth study.  It is a very sensitive test, but it is very non-specific.  It doesn't tell you what is wrong.  It says something is wrong.  And that is very important to our work.

    I gave you some interesting examples in here in relationship to why I think we're a clinical study, and one of them I should mention because it got tossed into that, goat's milk.  Goat's milk is a high quality protein.  Why would somebody run a clinical study if you made an infant formula with it?  Some of you even wondered.

    Well, from my point of view, it is certainly different from bovine milk.  It certainly can interact, as the protein mixtures always will--protein mixtures are complex mixtures, and they have a potential to interact with other ingredients, possibly to produce unsafe material or to change the quality of the protein.  Our feeling would be that we would have to run a clinical study.

    I tried to think through and look at examples of ones that you might look at in relationship to protein quality, and they posed the question of:  What if you used ultra-filtration to clean something out of soy protein isolate?  Well, I said that wouldn't really need a clinical study.  It certainly doesn't damage the soy protein isolate.  It doesn't remove anything that you normally think of as nutrients.  It removes phytoestrogens and assorted other materials.  And so I think you could justify that you didn't need to run a clinical study.

    If we were faced with that situation, would we run a clinical study?  Yes.  I think so, Ross.  You just wouldn't want to be sitting out there without it.

    I also made some derogatory remarks about protein hydrolyzates.  They scare me to death because it makes--and that's not the mild protein hydrolyzates.  Some of you are smiling.  These are the highly hydrolyzed mixtures that taste like mud and so forth.  But they have a very definite use.

    The interaction of those with other ingredients in the formula is unbelievable.  Certainly you would run a growth study of anything, you know, that received that kind of treatment.

    I think my time is running a little short.  Let's go through--because you can ask in questions if you have anything about carbohydrate sources, minerals, et cetera.  To me they are relatively straightforward vitamin mixtures.

    Food additives is something that has bothered me, always has, the aspect of how you approve food additives.  One of them--probably the only one that has been studied adequately for an infant formula from my point of view is carrageenan.  I'm sure there are people that will argue with me about that.  I spent my whole career working with carrageenan.

    Other additives, however, are based on are they safe for adult population at a level that they would be fed there.  The intake in an infant can be so much greater and where it's the sole source of--I put one example in here that I just wanted to show you of a product that is generally recognized as safe, or GRAS, as far as the Food and Drug Administration.  It's considered to be perfectly safe to add to any food.  FDA may disagree with me on that.

    But when we went into using this food additive, we found out that we knew nothing about its metabolism.  We didn't know that the infants were going to be excreting large amounts of organic acids that we knew absolutely nothing about.  And so we started out with rat studies and with labeled compounds to try to find out what in the devil was happening to--in this case it's octenyl-succinic-anhydride-treated starch.  And we found out that there were these organic acids that were being excreted in the urine, and I was proposing some rather extensive animal studies and hopefully to go into the studies later on.  And after a while, the people in companies ended up spending quite a bit of money.  They said, Do you think you're ever going to convince yourself these are safe?  And they decided, oh, let's not keep dumping money into a hole.

    So as far as I know, our company did not use that additive, at least in the way we were talking about.  But it was certainly one that there was no legal bar to its use, and I felt that there was an awful lot that had to be studied about what the material was.

    I also have the situation where at some time people came to me and said we think you have this horrible hydrolyzate formula, and we think it would be a wonderful idea if we could improve the flavor of it because the infants would consume more of it and grow more happily.  Well, one, their food consumption was very good on it, even though it tasted--I mean, you would never drink it.  But you probably would have a hard time when the formula bottle was open and be in the same room with it.

    But, anyway, they wanted me to evaluate flavors and the possibility of feeding it to infants.  And there are more than 2,000 flavoring compounds that are perfectly legal to use in foods, and I spent a lot of time looking over those.  And where I could learn the information I needed about the metabolism and so forth, I had some serious reservations.  Many of them I couldn't even find out what happened to the--what would--how they would be utilized.

    I don't know that we have flavored any infant formula--I hope we don't, or I hope somebody around there goes back and does that over again, because I wasn't too happy with it.

    Now, I was asked to try to get criteria for the aspects of how you would evaluate whether a clinical study would meet--it's right at the end of the paper.  There are five aspects that we have here, and I guess it at least shows that I'm a chemist because I feel I have to know the chemistry and the reactivity of anything that I put into an infant formula.  And if somebody can't explain to me what it's going to interact with, what it's going to do, you know, I can't approve it going in there.

    Then you start proceeding to how is the infant going to metabolize this and do we know anything about that.  And I think we ought to have some data, even before you go into a clinical study, but have a reasonable idea of the metabolism.

    Now, some of these things are easy.  Sure, we use mono- and diglycerides as a multiplier in infant formula.  Those are natural products of the digestion of fat.  That's easy, both in the chemistry and the metabolism.  But there are other things that are much more complex and sometimes require important research studies.

    Then you've got to go through and evaluate your whole process, and this is something to get you to realize that the aspect of the clinical study is something that's going to come last.  All of these things are going to be done.  All of the chemistry, all of the stuff that's done in the pilot plant, is going to have to be worked out before anybody thinks about, you know, feeding it to an infant.  And I think with good reason.

    But any change that we have made in the formula has to be thrown into the mixture and say how could it have changed some nutrient, some interaction of ingredient, or something like that, to damage it?  We certainly have tremendously sophisticated chemists these days that can go in and tell us just an awful lot of things about what has happened when a nutrient is--the state it is in a particular formula.

    And if we started to find out that it was very different from what was in existing formulas, I would have concerns and feel that you have to have it resolved even before you went into the clinical.

    Also, you should know whether you have any historical experience with similar formulas.  In other words, if somebody--you know, we need a new formula or marketing people have some idea that there's reason for one, and you adjust the amount of milk protein down a little and the whey protein up a little, and you, you know, shift a little in the amount of fat and some of these things, you can visualize, well, you know, gosh, we've got all these clinical studies, and we've done things almost identical with that.  Why do you want to run a clinical study?  And it's right.  There isn't any reason if you have historical experience in clinical studies or experience out in the market that a very similar formula was the same way.

    But if you find that, no, it is very different, then you've got to proceed to these other issues and try to answer the question of, you know, do we have enough knowledge to not run a clinical study?  It's asking a negative.  And it has to be that way.

    You also should ask yourself the question of whether there are going to be physiological effects, and as I note in here, people are always coming to us with marvelous ideas about some fancy carbohydrate that we can add or a new fat or something like that, and it's going to have some remarkable physiological effect.  Well, it is?

    From my point of view, that means that's it.  We certainly must at least run a growth study.  No way are we going to, you know, not have that.  But we also ought to be able to within that study try to understand--try to measure something that will tell us whether it has that physiological effect, or any, and can it in any way be injurious?  And don't tell me that, aw, but it's more like human milk.  I do not think that human milk would be safe to feed to an infant if we processed it in the manner that we process infant formula.  I don't think any of you can make it into a product that would be safe.

    Now, that's just something to remember in this.  I'm not saying anything against human milk or breast feeding at all.  I'm just saying we are tough on the stuff that we've put into those bottles.  And, yes, I know you give some heat treatment to the human milk that mothers collect and bring into the hospital.  But you don't give it the kind of heat treatment we give it.  No way.

    So these are the kinds of things that we have to do, and if you do get down through all of those and you're confident of all of these things, then you can draw the conclusion that, well, there isn't any real reason that you have to run a growth study.  The likelihood of it is small.  And often within our company, the consequence was, aw, run it anyway, even though it's expensive, because the last thing you want to do--as I say here, one should use caution in drawing this conclusion.  Why?  Because if you're out there with the product and it doesn't perform the way you want it to, life will be unbearable.

    Can I answer questions?

    DR. GARZA:  Thank you very much.  Yes, I'm sure you will.

    [Laughter.]

    DR. GARZA:  Any questions?  Don't make me--Dr. Heubi?

    DR. HEUBI:  Can you comment about pre-biotics and pro-biotics and what you would think a company would require in terms of proof that they're safe and potentially efficacious?

    DR. BENTON:  Whew.

    DR. HEUBI:  It would take an hour, huh?

    DR. BENTON:  Well, I've thought about the subject.  It is scary.  It probably could be done.  You're dealing with types of materials that in most cases are going to be reactive, that you have no experience with intake of these materials at the kind of intakes that an infant is going to have.  You really ought to be looking for various kinds--for innumerable different effects that you would have on the infant or could have on the infant.

    I would have to have an individual example to try to deal with, and I'd probably spend the next month in the library, at least, because it--those are hard questions.

    DR. GARZA:  Dr. Baker?

    DR. BAKER:  You, I think, have made the best argument I've heard for not including a breast-feeding control in infant trials.  But I still have a little bit of a problem with the issue of a baby who's growing too fast.  I think that we all agree that there are times when babies are growing too fast.  How are you going to differentiate--how are you going to tell whether your formula is giving optimal growth as opposed to good growth?

    DR. BENTON:  Well, I think you can tell, one, I'm not going to.  But even back in the days when, you know, I would have been involved in any decision within the company trying to address that issue, it would have been an awesome process because I really view growth as a very, very complex thing.  I mean, it's not a maximum thing of how much fat do you have in the baby and, you know, what muscles.  I mean, it's down to the cellular level.  It is innumerable things.  And to say what is the right growth, you're going to have to look at those things at least to some degree, and you're going to have to have follow-up for a long, long time, you know, to draw that conclusion.

    DR. BAKER:  I think some of us in this room would say that formula-fed babies are too fat or too big, and so we're trying to figure out is there a way to feed babies with formula just right.  I think that's a real issue.

    DR. BENTON:  Oh, well, Dr. Fomon did a series of studies at different caloric intakes, and these are perfectly adequate formulas.  They just have a little more water added to them.  And he was able to get down to a level that even scared him.  He got down to 11 calories per ounce from 20 calories per ounce.  And you've got a different growth rate.  At that point he was a little scared, and we did quit on some of those things.

    I was scared, too.  But we could reduce the caloric concentration on the formula, and the way mothers would feed it, you would get a slower growth rate.

    I'm fairly confident you probably would reduce the body fat some, although I'm not so sure of that.  Certainly the relationship of growth to body fat and so forth is a very, very complex issue.  Long ago, when I was at Cornell, we studied that in the rat, and we could do tremendous things, although usually the rat followed constant increase of fat increase with time no matter the crazy things that we did to the poor creature.  That relationship seemed to stay constant.  But you would have to address it and do that.

    Of course, you could also reduce it by reducing the protein content of the feeding.  I get scared of that, too.  I guess you would say, hey, he gets scared awfully easily.

    DR. BAKER:  But you wouldn't claim that that was optimal growth?

    DR. BENTON:  I have no idea.  I have no idea of what's optimal growth.  I know that I have to sell an infant formula--well, when I was out there, we had to sell an infant formula next week, and so we could not shut down and say, well, when we've got an optimum formula and an optimum feeding pattern and an optimum--got everything arranged, then we'll start, you know, introducing feeding again.

    I think our company would be very--Ross Laboratories, my ex-company, would be very interested in participating in studies of that.  The idea that as a stockholder in Abbott Laboratories I don't think they could afford it to fund the whole thing because it is an enormous project.  But it might be possible to do it.

    But if, you know, somebody comes to you and says, well, that's easy to do, I don't think they understand the situation.

    DR. GARZA:  We'll take one more question, if there is one more.

    DR. DOWNER:  Thanks for such an interesting presentation.  Did I hear you correctly in saying that in your opinion you didn't think that clinical trials were indicated as this time for the matter at hand?

    DR. BENTON:  For what?  Pardon?

    DR. DOWNER:  Did I understand you to say that clinical trials in your opinion would not be indicated for the matter that we're discussing at hand?

    DR. BENTON:  For formula we were discussing?

    DR. DOWNER:  Did I understand you to say that you did not agree that clinical trials were needed at this time for the matter that we're discussing?  You didn't say that then?  Okay.

    DR. BENTON:  No, I don't think so.  Our comparisons would always be to an infant formula.

    DR. GARZA:  I'm sure we're going to come back to that during the general discussion, Duane, but let's move on to the next speaker.  Thank you very much.

    The last presentation before we move on to that discussion is Dr. Denny Bier, who directs the ARS/USDA Children's Nutrition Research Center at Baylor College of Medicine, where he is professor of pediatrics.  And he will be discussing clinical consideration in determining the need for clinical studies.  Dr. Bier?

    DR. BIER:  Okay.  Duane said that the clinical studies come last.  Well, here I am.

    Whereas I'd like to make a few disclaimers, first, I was fed on evaporated milk formula.

    [Laughter.]

    DR. BIER:  So if I say anything dumb, that's the reason.

    DR. GARZA:  Should we add that your Dad was six feet tall?

    [Laughter.]

    DR. BIER:  Right.  And my mother won the Nobel Prize.

    Two, whereas I did nose around among my knowledgeable friends for opinions, although the ones I express here are my own, and Duane and Sam said they could say what they want because they're retired, and I'm a director and my faculty thinks I'm retired, so I can say anything I want here.  Let's see.  Forward, if I can figure it out.  The round button?  Okay.

    General operational principles for this talk are that as far as I can tell, today's formulas all contain--contain all of the known essential nutrients, and we don't find nutritional deficiencies that we know about, and for this reason new additions to formulas are likely to be those which have other purposes besides simple replacement of nutrient deficiencies.  And I have not considered things like--no, no, that's not it.  Okay.

    [Laughter.]

    DR. BIER:  I'm not going to press it again.

    I have not considered these things which we heard about this morning.  That was good because we weren't supposed to consider them.  I did leave out one important thing here, which is political considerations, which are certainly going to enter into whatever happens here later, I would assume.

    Okay.  Real-life Investing 101.  When the energy and metabolism bills are paid, Mother Nature makes a contribution to a growth fund.  This is why measuring growth is so important, because it occurs when all the maintenance needs are met.  So it's a fundamental indicator of adequacy.  And many people have talked about this, but growth measurements are advantageous because they're simple, they can be measured accurately, precisely.  They're not invasive.  And, most importantly, they're non-specific, that is, they're the best generic indicator of the fact that something unanticipated may have gone wrong or may have happened.

    Factors controlling linear growth are different from those responsible for body weight accretion.  For this reason, I think it's important to measure linear growth.  Now, we had some discussion of this in various ways this morning, but the genetics that control linear growth is different than the genetics that controls accretion of body fat.  The hormones and things and growth factors that control linear growth are different.  So I think if we're talking about growth as a whole, we have to measure both weight accretion and linear growth.

    I do not believe that body composition measurements are mandatory for a clinical study today, for a formula study, because the relationships among body components and childhood--infant, childhood, and adult outcomes are really not well established.  And for that reason I would say it's nice to measure body composition.  It's very important for research purposes, but it's not mandatory for routine clinical formula evaluation.

    My next guideline is that animal studies are never a sufficient substitute for human growth studies.  Several people have addressed this today.  Animal studies are necessary for proof of principle, for preclinical assessments of safety, for all sorts of other things.  But the rat stops there, or the pig or the mouse or whatever.

    Species differences in all sorts of events that deal with growth and developmental characteristics, the hormones, growth factors, metabolic differences among animals, and the species-specific characteristics of milk components and how they respond to them just make it not realistic to substitute animal growth data for human growth data.

    Guideline 4 is that the presence of a substance in human milk is not sufficient in itself to eliminate the need for a human growth study.  First, as I opened with, current formulas are nutrient sufficient, at least as far as I'm concerned.  The non-nutritive components of milk are not well characterized.  Some enter milk passively.  They come along with the movement of fluid and water and electrolytes.  Many don't have any established purpose, and some have profound biological activity but their role in human growth and development is not well understood, for example, all the growth factors in colostrum.

    So another way of saying this would be that the non-nutritive components of breast milk, what their function is in humans today, many of those are hypotheses, and because they're hypotheses, they deserve to be tested.  And then I'd just like to remind you that there are compounds in human milk that you don't want there.  For example, the infant's largest source of dioxin is breast milk.  Now, that's not the mother's fault.  But the infant's least source of dioxin is infant formula.  There's no dioxin in infant formula.  There's dioxin in breast milk and, therefore, we shouldn't use--I'm not trying to imply that anybody wants to put dioxin in infant formula.  But it's just an indicator of the fact that its presence in human milk doesn't mean that it's necessarily safe or should be there.

    Guideline 5, I don't believe that data from post-marketing experiences elsewhere are really sufficient to substitute for a pre-market growth study.  Post-marketing surveillance is largely an uncontrolled, anecdotal experiment.  The validity of post-marketing data is heavily dependent upon the reporting of adverse events, and I think there are well-established and documented cases for reasons why people don't report adverse events.  Sometimes they don't see them.  They frequently attribute them to another causes.  Sometimes they don't talk about them because of potential liability issues.  And then they don't like to fill out the paperwork or get involved in answering questions about this.  So there's a significant underreporting.

    I also don't believe that post-marketing surveillance is likely to detect subtle growth effects because of the range of normal infant growth and development and length, some of the things we heard about earlier, and then the various confounding effects that occur in routine clinical practice, and these are not going to be helpful, I think, in understanding control differences in new formula.

    Okay.  Those are sort of guiding principles, and we were asked to provide criteria, and my first criterion is that an infant growth study is required if a substance is being added to human milk for the purpose of influencing human growth.  Given all the things that are--just, for example, colostrum growth factors, these sort of things that might someday be added to infant formula.

    If the stated purpose is because this material influences human growth, well, it seems to me you need a growth study.  Now, this is a subset of a more generic question, and even though--I forget who this morning said--these are not on the table, I'm going to throw in a few gratuitous comments.  If one claims an effect or benefit for a material added to the formula, then you have to demonstrate the claimed effect or benefit.  I think that should be a fundamental principle.

    The Working Group on the Nutritional Assessment of Infant Formulas of the Committee on Medical Aspects of Food and Nutrition Policy in the U.K. recently issued a statement which said "the goal should be an hypothesized functional or clinical benefit based on defined outcome measures."

    "Any modification which is hypothesized--this should be "or," I guess--"or claimed to have significant advantages should be subject to clinical trial."  This is basically sort of the same generic principle.

    Okay.  The sort of complement to that principle, Criterion 2, is that an infant growth study is required for macronutrients or other components that are known to affect the hormones, growth factors, genes, or metabolites that control human growth.  So we have, you know, a variety of examples of that, things that, you know, for example, growth--amino acids that are growth hormones, secretagogues.  We have fatty acids that are known to have gene regulatory effects, things of this sort.  And if that's the case, then one needs to do a growth study.

    Criterion 3, I would suggest that an infant growth study is required for formula changes that result in nutrient levels that are outside of established ranges.  Now, I didn't quite know how to say this and it make sense, so I'll explain to you what I was thinking.

    First, I think it's time to update CFR 107 because it talks about, as I recollect, minimum levels of 29 nutrients, maximum levels of only 10.  It exempts a variety of others, and I think we now have LSRO reports on term infant formula, preterm infant formula, and the new DRIs.  And I think it's time to start establishing a new set of consensus ranges for term and preterm infant formulas.  And that would also require addressing the exemptions that currently exist for preterm formula.  Should it still be exempt?  I mean, these are legal questions and regulatory issues.  I don't know, but I think it's time to look at them again.  And I think its time to address the issue of the various non-nutritive substances that people are considering or talking about may have benefit in infant formulas.

    Once that's done, you would have consensus ranges.  You know, they might turn out to be regulations.  I don't know.  But you would have some relative established ranges.  If they became statute, then it would be illegal to actually manufacture or to sell a formula that was outside those ranges.  But hopefully science changes along the way or we advance in some way, and there will be a time when someone wants to test things that are outside of those ranges.  And I think growth studies or studies of effect would be required for any proposed formulas that fall outside those ranges.  And this is, in fact, you know, equivalent to the opinion of a recent workshop report from the European Society for Pediatric Gastroenterology, Hepatology, and Nutrition.

    Criterion 4, addition of an entirely new compound to formulas requires a clinical growth study.  I also sort of slipped away from trying to define what I mean by an entirely new compound, but I mean things that haven't appeared in formula before.

    Now, as I mentioned, substances that are present in human milk, in my opinion, is not a sufficient reason for an exemption from this.  Substances not present in human milk, again, for reasons we talked about earlier, animal studies are insufficient for establishing either safety or efficacy.  A human study is required.  If a human study is required, it contains several endpoint variables.  One is growth as the non-specific variable of overall adequacy or identification of untoward effects, and then some primary endpoint variables that are relevant to testing the hypothesis of why you added this material in the first place.

    Criterion 5, all entirely new formulas require a growth study, and, again, I'm not talking about legal definitions here.  These will have to become legal definitions, but formulas that are not simply modifications of products already marketed in the U.S.  For example, if one has a marketed formula and one changes the composition in a modest way--and I don't want to try to define that here--that would not necessarily fall into this category.  But if an entirely new company is formed that's marketing an entirely brand-new formula that's never been marketed here before, even though they've stolen it from Ross or Mead Johnson or somebody else and are now making it themselves, they would have to establish the proof of nutritional equivalency or--equivalency to or superiority to marketed formulas because of the things we just heard about from Duane, that is, the different types of ingredients, ingredient sources, production and processing variables, matrix interactions, differences in absorption, bioavailability, et cetera.  This coincides with the 1988 Academy of Pediatrics Task Force position.

    Then, finally, the last criterion is that formula alterations that are likely to affect GI function or nutrient bioavailability require a growth study, and it seems to me this is, again, sort of a fundamental axiom.  I mean, the gastrointestinal tract is required to assure bioavailability.  And if you put something into the gastrointestinal tract that affects the function of the gastrointestinal tract, you have to show me that the function is still adequate.  There are known matrices that are more or less difficult to extract nutrients from, changes in macronutrient composition that affect absorption, for example, different types of fatty acids, enterocyte function, nutrients that react with enterocyte receptors and, in particular, gastrointestinal flora that have effects both among what the flora do in the intestinal lumen and how they interact with enterocyte receptors.

    Then, finally, something we didn't talk about at all here today that I recollect, things that might affect gastrointestinal motility that may affect the ability to absorb nutrients.

    So, with that, I will close and answer any questions.

    DR. GARZA:  Thank you, Dr. Bier.

    Any questions?  Dr. Stallings?

    DR. STALLINGS:  Thank you.  There were two things that I thought about while I was listening.  Early on you were discussing body composition, and, you know, that's one of the things I think as a committee we're thinking about, the role that might play.  And you made the distinction between body composition measures being appropriate for research and not appropriate--and suggesting not appropriate or necessary for these infant feeding studies.

    And I think as I've thought about this over the last, you know, few months that we've been doing this, part of it is what is the quality of the research that's going to be required to bring products to market.

    So I'm not trying to put you on the spot.  Clearly, you're hearing some of that, too, that what we might be doing in our laboratories under protocols that we derived and are those really--should those be different, we're trying to get at the same thing.  So just maybe elaborate on that in that context.

    DR. BIER:  Well, first, if someone wants to add body composition measurements to studies for licensing of a formula, I think that's entirely appropriate because I think we need far more information about what body composition means.

    Right now I don't know what to do with the data, so if you gave me an old formula, Formula A that's marketed and now you test Formula B and it has slightly more fat and whatever, I don't know what to do with that information.  Does that make it better?  Does that make it worse?  I don't know.  Certainly a lot of people would say, well, if the infant is slightly fatter, well, that makes it worse because they may be fatter later, and I would say that's an hypothesis that we certainly need to test a little bit more.  So I'm not sure I know what to do with the data.  Once I understand from the research studies what to do with the data, then I think it might be required, you know, for a new formula.  But right now I just don't the information's adequate.

    DR. STALLINGS:  But you're making a bit of a distinction between the research and the activities the companies are doing--

    DR. BIER:  Okay.  What we call research--okay.  You know, yes, I'm talking about research for a primary scientific purpose as opposed to research for a licensing.

    DR. STALLINGS:  Well, that leads nicely into the other question.

    DR. BIER:  Good.  I'm glad I'm baiting you.

    DR. STALLINGS:  You did fine.

    [Laughter.]

    DR. STALLINGS:  But the other thing you've heard us contemplate here is what is the role of the pattern of growth of modern-day contemporary breast-fed babies and looking at if not optimum but desirable, or whatever we want to call it, what are your thoughts on that, because we have had 20 years of--

    DR. BIER:  I don't know what optimal is, and I know if I want to test a formula and know that it's adequate, I need to test it against other adequate formulas, not against breast milk, because I know it's going to be different than breast milk.  And I think it was Duane who just said we may be able to produce formulas that are equivalent to breast milk, and until someone shows me that they are better in some long-term way, I don't know what that means.

    DR. STALLINGS:  Thank you.

    DR. GARZA:  Any other questions?

    [No response.]

    DR. GARZA:  Taking Duane's suggestion that, in fact, because one can manipulate formulas in a variety of ways, theoretically at least, to be able to achieve a growth pattern that would be comparable to human milk, what formula would one use as a standard against which to judge, given the fact that formulas are evolving and changing over time?

    DR. BIER:  You mean any new formula with regard to another formula?

    DR. GARZA:  Yes.  And if they're different, how do you decide if it's an improvement?  Does following growth then become a moot point?

    DR. BIER:  If it's an established--if you're making a change in a formula that's compared to an established formula, then the established formula becomes one.  If it's not, then I think what you have to resort to minimally are growth data from, you know, formula that babies, not breast-fed babies--

    DR. GARZA:  Does that in essence then become the desirable growth pattern?  And what evidence is there that that, in fact--

    DR. BIER:  I don't know what a desirable growth pattern is, Bert.  Can you tell me what that is?

    DR. GARZA:  No, but--well, I was trying to follow up.

    DR. BIER:  Well, tell me.  I don't know.  I mean, I think that's what we struggle with.

    DR. GARZA:  That's right.  The logic of the proposition that, in fact, if one were to use a formula, then you have to choose the formula that you're going to be using to compare it with.  And if growth is going to be the criterion and that becomes the control or the standard, then, in fact, a priori, one has decided what is the control or what--a value judgment has been made.

    DR. BIER:  I think that's a fair question, and I'm not sure I, you know, know the answer.  I think now we have a variety of infant formulas that at least by all nutritional criteria are adequate.  I honestly don't know if we have the data sets that allow us to compare those to each other in these critical periods of time to determine what the noise is if we use any one versus the other.  Perhaps the industry people can tell me that.  But that would be something that would have to be established.

    DR. GARZA:  The other implicit assumption is that one would use growth in isolation of any other information to be able to assess manipulations in a formula that would achieve a growth that's coming from human milk.  One could use body composition or one could use growth rates at different periods, or one might be able to use metabolic indicators for adequacy to make sure that nothing was limiting growth in an adverse way.

    Among those three, which would you think would be the most relevant or which others?

    DR. BIER:  Well, I was asked to talk about growth, so that's why I focused on that.  I mentioned body composition because I don't think it's necessary.  I think in the case of many of these ingredients, one would have to have a metabolic measurement.  So if you're changing the iron content of milk, for example, you would certainly want to know what serum iron is or ferritin or something like that.

    I mean, I think it depends on what you're adding, and you have to add other components that allow you to test what the functional changes in the formula are.

    And, by the way, there maybe--you know, I think your question also has in it this generic issue of if we re-establish what the minimum requirements are for testing formulas, which I understand the committee is now going to expand into those questions over time, yes, what besides growth is necessary and what are some of the fundamental things?  We heard about neural development earlier.  That's obviously terribly critical in formulas, and as you know, the latest additions to formulas, the LC PUFAs are in some degree based on changes that might occur in neural developmental outcome.

    DR. GARZA:  We heard about Einstein formulas last time.

    Okay.  Any other questions?

    DR. KUZMINSKI:  I listened to your presentation, Dr. Bier, and to Dr. Benton's, and I'd link--both of which I appreciated very much.  And I try to link material that I've heard and read with the material presented in Table 1 that was presented to us right up front this morning and, coming out of that, conclude that there are very few instances where a change in formulation does not necessitate a growth study to validate the change effectiveness, the effectiveness of the change.

    And I guess I reflect a little bit that this is going to cause--this is a gray area.  I hear comments like depending upon the experience, internal experience of the manufacturer, the experience at the clinical testing organization, that would conclude that a growth test is needed or not, is it a serious change, is it a significant change?

    I guess I finally get to my question, that I suspect that this will cause difficulty for the agency to try and put a rope around to try and harness what is a significant change.  What is a significant amount of internal experience that leads a manufacturer to judge that no trial, no growth trial is necessary?

    I'd be very interested in your comments and Dr. Benton's comments on that observation.

    DR. BIER:  Well, you noticed I avoided all of that in my talk.

    [Laughter.]

    DR. BIER:  No, I mean, that's the nitty-gritty.  I mean, we're going to--there's going to be cases when you're doing something that's so outside of prior conventional practice that everyone will agree that you need a study.  And then there are going to be minor modifications to formulas which I think most everyone in the room, including myself and people who think you need growth studies, will agree it's almost certainly unnecessary in this case if some information X, Y, and Z is established.  That's where, you know, the FDA and the infant formula manufacturers and whoever address this will come to some functional rules.  And I don't think they're going to be very easy to find.  I just don't know--you know, I don't think there's any simple rule.  Well, the simplest rule is to say yes, any change of an infant formula requires a new study.  That's the easy rule.  But that's probably not going to be the functional.

    DR. GARZA:  All right.  Thank you very much, Denny.

    We're going to take a break now and come back at 3:45 and then have the speakers all come to the podium at that time for a more general discussion.  Thank you.

    [Recess.]

    DR. GARZA:  If we can get all the speakers to come up to the podium?

    We will try to go no later than 4:30 with the panel--I'm sorry, no, that's not the whole committee.  Sorry about that.

    [Laughter.]

    DR. GARZA:  We are good, but I fear we're not that good.  We now have everyone assembled, so let's start with questions.

    Maybe we will done by 4:15.

    [Laughter.]

    DR. GARZA:  Dr. Anderson?

    DR. ANDERSON:  Imagine that I've taken a marketed infant formula and added to it a new substance and that the only information that I have subsequently beyond safety is that in a clinical growth study the children who were first measured at 14 days had a distribution which was set right at the median of the CDC 2000 standards, and when measured at 1 months, 2 months, 3 months, and 6 months the weight-for-age, length-for-age, weight-for-length and head circumference was all centered at the 50th percentile with--let's suppose that there were 100 such infants, and 2.5 percent of them were above the 97.5 percentile and 2.5 percent were below the 2.5 percentile at each of the measurement points.  Tell me why that particular formulation should not be approved for marketing.

    DR. FOMON:  Because the ingredient that you've added was supposed to be added for some purpose, and if it didn't accomplish that purpose, there is no point in approving the formula.  If you're going to add something to a formula, you have a reason for adding it.  If that reason is not substantiated, I don't think we're going to let you do it.

    Anybody else?

    [No response.]

    DR. GARZA:  Do you want to follow up?

    DR. ANDERSON:  Yes.  So a natural consequence of that would be that a double-blind, randomized, controlled clinical trial that wasn't focused on--that showed that there was no difference in growth between the new formula and the standard formula, in the absence of any measurements demonstrating that the additive had a desired effect would mean that such a formula would be inappropriate for marketing?

    DR. FOMON:  As a member of the FDA decision team, I say you're right, it would not be appropriate to market it.  There's no point in putting out just a whole series of formulas that are going to be marketed because of some proposed advantage if that advantage isn't present.

    DR. GRUMMER-STRAWN:  Let me address that in a couple of directions.  First of all, you haven't given us enough information to say whether we would want to approve it or not, because all that we've looked at is the growth data.  There may be other purported effects of this new additive that may have been demonstrated to be there.  There may have been particular marketing reasons that the company feels is important.  Those need to be evaluated, so there's a whole set of other characteristics that we might be interested in other than the growth data.

    But with regard to the growth data, how do we interpret that movement along the 50th percentile as the group mean?  Really getting back to the discussion that I had with regard to Bert's question, the question is:  What is the appropriate reference?  What you have demonstrated to me is that this new formula leads children to grow in the way that they grow across the United States.  Does that mean that it's a good formula?  I don't know.

    Suppose that the previous formulation prior to adding this new ingredient had children growing starting at the 50th percentile and then falling off to the 25th percentile, and this new additive now has caused them to have a more rapid growth?  Is that a good thing?  I don't know.  Back to the question Dr. Baker was asking.  As we're in a situation in the United States with a growing epidemic of obesity, starting in young children, is it a good thing to have faster growth in the first 6 months of life or in the first 9 months of life?  I don't know.  But certainly I would question whether the way children are growing right now in the U.S. is the optimal set of circumstances.

    DR. GARZA:  Dr. Bier?  And please identify yourselves as you answer because whoever is recording this will have difficulties.

    DR. FRONGILLO:  Ed Frongillo.  Just to add to that, I guess a concern would be that, for example, the 2000 CDC reference has an admixture of infants who were fed in different ways, and we know that breast-fed infants, at least who have been intensively fed for some time exclusively on breast milk for some time, for a few months, grow quite differently.  And the results that we looked at earlier showed that this has been replicated many times.  In the Euro growth study, they show differences that are similar, not quite as large but very similar in pattern, and that's from a large, multi-country data set.

    So I think one of the concerns--I mean, I think to me there's two parts to your question.  One is you seem to be trying to push our thinking about are there any circumstances when an existing reference might be used as a comparator rather than, for example, a randomized clinical trial.  And I think we should probably consider that part of the question.

    The other part is then what's the appropriate reference if you're going to use a reference.  I would say the 2000 CDC is probably not the appropriate reference because of the admixture of breast-fed and formula-fed infants.  If there was a reference group of formula-fed infants, one could maybe take up your question and think about are there circumstances when that's a good enough comparator that you would avoid the trouble of doing a randomized clinical trial.  Well, I argued in my presentation that it would be hard to justify doing that because in this particular case, things kind of came out nicely.  But when we do the study, we don't necessarily know how it's going to come out.

    DR. BIER:  I'd just like to go back to our mystery formula.

    You know, when you put a new ingredient in infant formula, you're doing a very big experiment on some number of children.  And I think one of the fundamental principles of experimenting on children is there has to be a significant benefit-to-risk ratio or an insignificant risk-to-benefit ratio.  Well, if you don't tell me the benefits, then the risk becomes infinite, or at least incalculable mathematically if you're dividing by zero.

    You know, it's almost impossible to prove that what you're going to add is infinitely safe. So it has to have, you know, the potential for some risk, and until you give me a reason for saying that the benefit outweighs the risk, I'm reluctant to take the chance.

    DR. BENTON:  Duane Benton.  I guess I should comment on it.  I certainly agree, I think, with all of the things that have been said.  There isn't anybody in the industry that wants to get an ingredient in there that isn't there for some purpose.  I mean, that's just going out there and looking for trouble.  Every ingredient is trouble, and you're certainly not going to put something in that you can't find some kind of benefit for.

    Now, if, you know, a regulatory agency were to look at this and think that some marketing people had cooked up something that was off the wall for a so-called benefit, I think they'd be highly justified in saying, Forget it, this doesn't even justify a clinical study.  Don't come back to us and talk to us about it until you can tell me why you're adding it and what the benefit is.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  To come back to one of the issues I think I've been trying to get us to think of, and as our consultants, I want to pose it in a little bit different way.  I'm still intrigued by the idea of what the standard should be and knowing that none of us at this point in time know what's optimal.  But if we weren't in the current environment where we didn't have the history of the current regulation and the history of a very successful program of, you know, the additive tests, the previous formula, and it was 2002 and we knew the epidemiology of growth in this country, if we were just starting from scratch, what would we think of as a good way to say what the normal pattern of growth is?  Because I think we're struggling with two different things.  One, we have a history of something that's been working pretty well, and in many ways it protects the population, and it's given industry a framework.  But part of what we're being asked to do is to come back and really potentially look at it very freshly.  And I think that's where I keep coming back to is the pattern of healthy children born to healthy mothers who are exclusively breast-fed, say, for the first 4 months of life, is that something as experts in pediatric nutrition and growth and statistics and--you know, is that--would that be a place to start if we didn't have the history that we're currently working on, and the pros and cons of that.  I think it would be helpful for the committee to hear some of this.

    DR. CHUMLEA:  Okay.  I'll take a whack at that.  You said--

    DR. GARZA:  Please identify yourself, Cameron.

    DR. CHUMLEA:  I'm sorry.  Cameron Chumlea.  You said, okay, basically in 2002 we'll start from scratch or we'll reinvent the wheel here.  Okay.  And then from whatever information we collect, we would be able to determine what optimal growth is.  Is that--

    DR. STALLINGS:  I have a standard--

    DR. CHUMLEA:  I have a standard which we could do.  Okay.  We could do that, and we would find out basically pretty much that's what we did with NHANES III.  It was a study just recently done of children, collect information, gives us status information.

    And if that was where we started, we wouldn't know that we have a problem of obesity because it's the previous studies that kind of gave us that.  So we'd be starting off with potentially false information or bad information.

    DR. STALLINGS:  Four months.  First 4 months of life.

    DR. CHUMLEA:  Okay.  In the first 4 months of life.  So if we want to do this again, it will give us what is currently right now.  Okay?  If we did this in 1940, we would have gotten what's then, or 1929, or whatever.  Each of those would be what was at that present time, and they're correct for the present time.  But whether that's really the best or the end to meet all ends, none of us particularly know that.

    So if I was going to want to do that and to come up with the information and assuming that the federal budget is not an issue and we have plenty of money, you could do this by creating a sufficiently representative sample of children, multi-ethnic, both sexes.  It would be very intense to collect the information.  And we could collect, starting from birth and a multi-center study, the information that would give us both status and rate of growth from birth to 6 months.  But you're talking lots and lots of money--well, relatively.

    DR. FOMON:  Okay.  It's 2002, and we're throwing out everything we thought we had and starting over.  And now we have ease of recruiting breast-fed babies.  We know that they don't grow appreciably differently than formula-fed babies, providing we start at 14 days, because it takes the breast-fed baby a little longer to get on that post-natal growth spurt.  So if you start at 8 days, then the breast-fed babies grow faster between 6 and 14 days because they're still in catch-up.

    So we start at 14 days, and we compare breast-fed babies and formula-fed babies, and there's nothing too much the matter with that.  And if we demonstrate that we can--that the growth of breast-fed babies now is the same as it was during the past 20 or 30 years, we can use our big cohort of reference breast-fed babies.  We have to--if we're going to do it for a 4-month study, then the breast-fed babies presumably should have some rules like they're all solely breast-fed, and the formula-fed babies should have some rules like they don't get anything else except formula.  But that would be feasible.  I mean, I can't argue against that just because we're comparing formula-fed with breast-fed babies.  For 4 months--between 14 and 112 days it really doesn't make that much difference.

    DR. GRUMMER-STRAWN:  Larry Grummer-Strawn.  I think that the appropriate reference is largely as you described it, that I think we need to have a real prescriptive reference.  I disagree with Cameron in terms of we could do this based on going out and sampling the U.S. population, not only because of the problem of obesity but because we have an admixture that Ed was talking about of a variety of ways of feeding, and we cannot conclude that somehow mixing all of that together is going to give us what's optimal.

    I think that in developing formulas, if we're starting from scratch and we don't have them in front of us, if we're developing formulas, we would want to develop those formulas to best match what Mother Nature offers us and what we have defined at this point in time as optimal feeding.

    It is true that 50 years from now we might think differently of what is truly optimal and we will have to re-evaluate that decision.  But that is the nature of public health, that we always work with the best information that we have available to us.  So I think the appropriate reference is to find the population that is feeding in the way that we currently think is the best way to feed.

    How we get there from where we are now I think is a very difficult question.  I think we've raised interesting issues in terms of we already have a formula that is producing a different kind of growth.  It is not automatically better to--any formula that moves in the direction of breast feeding is the right formula.  So we can't immediately assume that that is a better formulation.  But I think that somehow we need to figure out how to get back onto the track that Mother Nature would have us on.

    DR. BIER:  Dennis Bier.  I thought Sam was going to give us the history of this in the 20th century and it would save my guessing at it, because he was there, you know.

    [Laughter.]

    DR. BIER:  I only guess.  And my guess is we started out the 20th century with infants who were formula-fed, you know, not doing as well as breast-fed infants.  And people developed the formulas, and, in fact, what they used as their standard was the health and growth of the breast-fed infant.  And at some point in the middle of the century or slightly thereafter, they found out, yes, by God, we can make infants grow just as well and be just as healthy as breast-fed infants, in fact, even a little bit better.

    Now, if we weren't concerned about the hypothesis that these infants have more fat, maybe, and are getting fatter on that basis later in life, the hypothesis that this is true, what would be arguing about here as far as breast and formula feeding?  They're as healthy.  They grow slightly better, if not the same or slightly better.  So, you know, our concern here about too much growth is based on accretion of body fat.  And any standard relative to that is going to be an arbitrary number based on someone's belief at this moment, in my estimation.

    DR. FRONGILLO:  Ed Frongillo.  Virginia, if I understand the question you were asking, if we could start over and dismiss the 20th century and we're now in the 21st century, what would be an appropriate reference?  But then I think that begs the question of:  For what question, for what purpose?  What is one trying to evaluate?

    For example, if our only concern is are the nutrients adequate and we're not really interested in growth itself but we're interested in the fact that growth represents the summation of a whole series of processes that include having adequate micro- and macronutrients, that's one question.  That's a very different question than does it lead to growth per se that we think is optimal.  And even that could have a time frame of the first 4 months, say, or first 6 months, or it could have a much longer time frame.  I mean, there is some evidence--I don't think it's terribly compelling yet in its magnitude, but there's certainly evidence that exclusively breast-fed infants tend to be less obese later one.

    Now, one could pose a question that we would like to have the growth that sort of represents not just satisfying the nutrient needs of the infant and some pattern of growth that we think represents optimal for now, but something that tells us something about what's going to happen with this infant in the future.  And so how we think about will probably determine what choice we make of the reference.

    I think Sam is right in the sense that there are certain windows--if you took perhaps the mid maybe 1 month to 4 months--I don't know exactly, somewhere in there--you'd probably find on average that the amount of growth of breast-fed infants and formula-fed infants, the growth that's accrued during that time is about the same.  They probably got there differently over that period.  If you extend it to 6 months, you'd probably find out they're not quite the same.

    One of the things that we should recognize is that there's been lots of studies done now that have shown that breast-fed and formula-fed infants don't--their pattern of growth isn't the same through the first year.  Everything we know tells us that by 24 months they've come back together.  But those studies have been done.  A lot of that has been motivated by--and I should have said this when I talked about the WHO Multi-center Growth Reference Study this morning--has been motivated by the management problems that have resulted from this discrepancy and pattern so that the concern has been that infants who are breast-fed appear to be faltering relative to the current international previous U.S. reference.  That management problem has been a major motivator in going ahead with a breast-fed based reference.

    So many of the studies have examined the differences in patterns in light of that question.  Here we're asking a different question.  And it may be that some effort to evaluate perhaps the Iowa data that have infants from both feeding modes might tell us something about this question that we've kind of overlooked in asking a different question.

    DR. GARZA:  A related question that is useful for the panel to discuss relates to Virginia's question.  Given the fact that we don't have long-term data to try to assess the functional consequences of different growth patterns in the first year of life, where should the burden of proof lie in terms of should we assume than that differences, until proven otherwise, are significant?  Or should we assume that differences, until proven otherwise, are not?

    DR. FRONGILLO:  Can I just ask, for my benefit, at least, for clarification?  Differences from what to what?

    DR. GARZA:  Growth patterns, the first year of life, breast versus formula, I mean, or one formula versus another formula.

    DR. STALLINGS:  But you're getting into long-term--

    DR. GARZA:  Long-term--given the fact that we keep referring to the fact that we can't--we don't have long-term data that try to assess the functional consequences of growth differences in the first year of life--

    DR. CHUMLEA:  What's long term?

    DR. GARZA:  Seven years, 15 years, 30 years, I mean, however you choose to define it, there are data, for example, now suggesting that certain growth patterns may predispose children to obesity.

    DR. FRONGILLO:  But I think--I'll turn this to somebody else who has an answer, because I don't, but--

    [Laughter.]

    DR. GARZA:  We agree to do the difficult, the impossible, but we're contracting out to you guys.

    [Laughter.]

    DR. FRONGILLO:  But it seems to me that question you're asking, Bert, there's at least two aspects to the scope of that.  One could be--has to do with formulas versus breast feeding, and if we started again at the 21st century, would we ever agree to have infants fed on any formula?  That's a very different question than saying given that we've already accepted that infants can be, should be in many cases fed on formula, about differences among the formulas.  Those are different issues.

    DR. BENTON:  I may be sort of throwing a question back at you instead of trying to answer it.  But I am really perplexed in the sense that you people seem to think that what you are seeing in difference in growth patterns has something to do with nutrition.  And I have no confidence in that at all.  I mean, clearly, breast feeding is an interaction between the mother and the infant.  And what the infant consumes is very determined in that process.  And if that is the case, what is changed is in the breast-feeding process and how it changes the baby and how it changes the mother and her outlook on feeding and so forth.  Our fooling around with formula to try to duplicate that is utterly hopeless.

    Now, clearly, if you want to have studies to do this, you're going to have to have randomly assigned feedings.  And you know--

    DR. ANDERSON:  Yes.

    [Laughter.]

    DR. GARZA:  Are you saying, Duane, just so I understand, that you think it's due to feeding behaviors between--differences between the two groups?

    DR. BENTON:  Because [inaudible - off microphone] I mean, they're not at that critical [inaudible].  It is later you're seeing the striking differences.  And that is the point at which, you know, it's the mother and the infant and how they're interacting.  I guess the other way you study it is you trade in [inaudible] mothers.

    [Laughter.]

    DR. GARZA:  You think that would be true cross-culturally, I mean, the fact that these differences are seen in all settings, that, in fact, it still reflects behavior.  Okay.

    DR. FOMON:  Fomon.  First time.  Bert, in answer to your question, and putting it, if I may, in the context of testing a new formula, if we don't know the consequences of more or less rapid growth during the first 4 months, all we have left is the comparison of how they grow in the first 4 months.  And that's all we can do now.  Either they grow as we anticipate on the basis of a cohort of breast-fed babies or a cohort of formula-fed babies, or they grow the same, and that's all we can do.  There is very little in the literature that tells you how growth during the first 4 months relates to any subsequent time.

    DR. GARZA:  Let me follow up because you've made an important point.  Let's assume, then, that one can--given the fact that in Duane's presentation, he clearly pointed out that one can manipulate growth in a variety of ways by formula changes.  How useful would body composition be in assessing both differences as well as similarities in growth pattern given what we heard in Dr. Ellis' presentation and DXA?  Would that be of any use in helping assess the nature of the differences or similarities, or do you feel that that would be either trivial or not very useful?

    DR. FOMON:  I think it would give you very little information at considerable cost in what you would get from measuring length and weight, if done properly.

    DR. FRONGILLO:  Ed Frongillo.  Comments to add to what Duane said.  I think that we know--Bert knows this more than just about anybody--that human milk is a biologically active substance.  There are regulatory factors there that we don't really understand very well.  So in addition to the perhaps maybe behavioral differences, it could be that some of the hormonal substances in milk play some role in determining the pattern that ensues after that.

    And then in response to your last question, I agree with Sam that for the kinds of questions that are being asked, it seems that having both weight and length information would be important.  Whether--

    DR. GARZA:  No one argues with that.  Body composition--

    DR. FRONGILLO:  And whether or not there'd be any additional value of body composition on top of that, I would be skeptical that that would bring much more information relative to the work that would be required.

    DR. CHUMLEA:  I'd just add, going back one question, to the effects within the first 4 months on some subsequent event, since we have recorded information rather than anecdotal information, and the issue, say, 15 years ago was that babies became fat adults type thing.  In the analysis of Fels data, there was no indication that anything before, in terms of skinfolds and weight before 2 years of age had any impact upon subsequent levels of obesity by 5 or 6 that did affect, and, of course, Sun's been happily able to show that changes in weight stature, BMI stature are very predictive of--in childhood and adolescence are very predictive of subsequent risk for obesity in adulthood, but nothing that shows up within the first 2 years.  Going to kind of what's called the Barker hypothesis, we've got blood pressure.  We've looked at birth weights.  There's nothing within Fels where we actually have recorded information and any information of adulthood that has any link whatsoever.

    From a nutritional standpoint, we actually have seven-day dietary records that were collected from the children.  What we have noticed in the preliminary analysis of that is that nutrition potentially in late childhood, say 8 to 12 years of age, and early adolescence, 12 to 16, does have predictive values on bone mineral density in terms of milk consumption in adulthood.  But nothing before that has any relationship--and we've got dietary records back as early as 6 months of age.  So nothing there links up.  So I think frankly that the infant is extremely plastic and can absorb a lot of insults or whatever that goes on and nature has it so that pretty much it can take whatever abuse we're going to give them, and they're still going to probably turn out pretty good.

    [Laughter.]

    DR. ELLIS:  It's just like we're saying, that we don't know what the translation is at this point for babies between, say, in the first 4 months or 6 months of life really what that translation is down the road.  We do know that, like Cameron has said, when you get to age 3, that there are relationships.  Of course, the closer you get to adulthood, the stronger the correlations would be, which is understandable.  Of course, again the 3 to 5, whatever, the main factors going between the first 6 months and there, environmental, genetic issues, even behavior issues and so forth, that are going to compound a simple relationship between what goes on the first 6 months and later.

    However, most people do argue that unless there's something unusual about a child, they tend to track along the same percentile.  And so presumably then if you have a high percentile, in terms of being percentiles, you're a high fat baby or a high probability of any increased fat, and, therefore, presumably those are the ones who also appear in the higher percentiles later on.

    Again, today can you say for certain a baby was 30 percent fat versus one who's 15 percent fat, that he or she has a higher probability of developing cardiovascular disease or obesity or diabetes?  The answer is no, we don't know that.

    DR. SIGMAN-GRANT:  Sigman-Grant.  My two questions are going to just demonstrate my ignorance, but be that as it may, nobody mentioned head circumference.  Is there any difference between, say, formula-fed and breast-fed babies in relationship to head circumference?

    DR. FOMON:  Head circumference in a normal baby is proportional to length.  And head circumference--increase in head circumference is proportional to increase in length.  The value of head circumference is in detecting discrepancy between gains in head circumference and gains on length, which has neurologic, diagnostic implications.

    DR. SIGMAN-GRANT:  And my other question is:  Is there any difference in organ weight betweens breast-fed babies and formula--and I know that there's only one way to look at that.  However, I'm just curious because we talk about gross--you know, gross weight and gross percent fat versus--I know, but that's--the reason I ask that is because of the potential bioactive compounds in human milk and its relationship to stimulating GI growth and that kind of thing.

    DR. FOMON:  I can answer that question.

    [Laughter.]

    DR. FOMON:  The answer is I don't know.

    [Laughter.]

    DR. FOMON:  And I don't think anybody knows.

    DR. GARZA:  Although there are some scattered reports at least that have been done with sonography for thymus differences.  But I don't know whether they've been replicated, and for thyroid, I think as well.  But I don't think those have been replicated, and they've been done indirectly rather than by direct examination.

    DR. SIGMAN-GRANT:  Well, I bring that up not to be facetious.  It's to ask a question.  There's just another thing we don't...

    DR. FOMON:  Any other questions that none of us can answer?

    [Laughter.]

    DR. GARZA:  Is their pay dependent upon whether they can answer these questions?

    Any other questions from the group?  This group will be here tomorrow, but we should not be shy.  Dr. Anderson?

    DR. ANDERSON:  Supposing the development of the infant formulas containing very long chain fatty acids, no growth studies have been done, but there was evidence of the benefit of neurologic development at 24 months or 36 months and the additives were generally recognized as safe.  Anybody willing to proceed without a growth study?

    DR. BIER:  No, because there is a priori evidence that fatty acids, particularly polyunsaturated fatty acids, can have effects on acosinoid (ph) metabolism.  They have effects on activating fatty acid oxidation genes, you know, reducing fat synthesis genes.  So we have a lot of a priori information that suggests we should do a growth study.  So the answer would be no.

    DR. GARZA:  I certainly want to thank each of you for being so patient with this committee and its unreasonable demands of you.

    [Laughter.]

    DR. GARZA:  But we will continue being unreasonable.  I hope that each of you can join us tomorrow.

    We're going to turn now--and you are certainly welcome to join us as we begin going through the seven questions.  You all have these in your packets.  I'd like to see if we could get through the first section on metrics for the evaluation of normal physical growth before we adjourn, and if possible, to take on at least part of No. 4 and 5.

    Before getting into that discussion, though, there is one item of take-home work for each of us, unless you can figure out a better way of doing it.  That is, if you look at Question 7, what I'd like each of you to try to do this evening--on your own, it doesn't have to be done as a group activity--is to--well, it can't be done as a group activity, anyway--is to look at some specific--pick one or two specific changes in infant formula that you think would reasonably be expected to be accompanied by a clinical study.  That is Part B to that question.  And then use that as a basis for trying to come up with general principles and criteria that led you to that decision.  Or you can do it in reverse, but if you can choose a specific example and guidelines and criteria, I think that if each of us brings those to the table tomorrow, then we might be able to agree on general principles or criteria that we would be able to answer Question 7 within the time frame that we've got.

    In the absence of that preparation, I think it may be more difficult for us to get there.

    You can pick an original one or go to Table 1.  It's whatever--so that if I want to deal with it in less than the abstract--Roger?

    DR. CLEMENS:  Roger Clemens.  To that point, Item 7, first of all, I was a 17-kilo baby at 12 months.  And you'd probably say I should have been very big right now.

    DR. GARZA:  Well, you do look big.

    [Laughter.]

    DR. CLEMENS:  I must admit, I am taller than anyone in my family, if that has any merit.  Just to help with Item 7, the IFC, the International Formula Council, provided each member here on the panel with a diagram--a decision tree for documentation of nutritional adequacy.  This was to have been mailed out to everyone, but I wasn't certain that everyone received a copy.

    DR. GARZA:  We received it about a week ago.

    DR. CLEMENS:  That's great, because I did not, Bert.

    [Simultaneous conversation.]

    DR. CLEMENS:  --copy of that decision tree.  Thank you very much.  I appreciate the fine work that Jeanne has done on that point.

    Also, just a comment, because Moski (ph) had made a comment from the CDC on examples that would not necessarily require clinical trials.  Again, on the handout that was provided on the clinical protocol that has been followed by the IFC members for the last 20 years, fundamentally, you'll see that on page 6 there is a list of about eight items, of which under the current regulatory guidelines do not require additional clinical trials.  That doesn't mean that the companies won't do them.  It just says they're currently not required.  I just want to bring that to light to everyone.

    DR. GARZA:  If some of you did not get the e-mail, you should let--

    DR. CLEMENS:  They were all distributed.  It's just a case of e-mails--

    DR. GARZA:  If you haven't received that type of information, then either the e-mail system is not working--but it was sent out.  I know that I received it about a week ago along with three or four other items on the attachments.

    DR. CLEMENS:  I appreciate that, Bert.  I just bring out No. 7 in particular because you talked about the items, and I want to be sure I have that as a guideline because it talks about those changes, what changes would require clinical trials under this current environment versus those which would not.

    DR. GARZA:  I'm glad you raise that.  I'm just more concerned about whether information committee members may not have received that was shared--

    DR. HEUBI:  I have this letter from IFC but didn't get this document.

    DR. CLEMENS:  See, that was my fear that they did not receive all the documentation.  Thank you for the opportunity to share that.

    DR. GARZA:  All right.  Then is dealing with Item 7 in the way that I've described acceptable to the group so I don't surprise you tomorrow morning?  I'll turn to each of you and say no.

    All right.  Then with that housekeeping item, let's turn to Question 1.  And we agreed that we would do this for term and preterms this morning as we reviewed the questions.  It's asking us to group the following metrics in terms of clinical usefulness as endpoints, and I would suggest that we try a grouping scheme that says extremely useful or mandatory or whatever, of moderate use, C, of no use, or, D, it's still in the research area and we really can't comment on whether it's of high, medium, or no use without additional research because there's enough biological plausibility or there's enough potential interest in the item that we would want to get it, but we just don't have enough information to make a judgment.

    That's one suggested grouping.  I don't know whether you may--and it doesn't mean that we need to fill in each of those cells.  But that's the grouping that came to my mind as I reviewed this question when they were first sent.

    Is that acceptable to the group?  Okay.  Then would you like me to call on each of you, or do you want a period of discussion before we do that?  All right.  Then why don't I start with Dr. Baker.  Is that all right?

    DR. BAKER:  Let's see if I've got this charge properly.  You want me to go through each of these measures and say whether they're mandatory for a growth study, whether they would be useful for a growth study, or not so useful, and then if they are experimental and not--

    DR. GARZA:  Without further research, you really can't put them in any box, but of interest, but, in fact, the research should be done.

    DR. BAKER:  Well, I think the first two should be mandatory without a doubt.  The third, head circumference, it's certainly not in the experimental stage.  I think it's accurate, but I don't see where it's useful for a growth study alone.  So I would say it would be helpful but not mandatory.

    I think the same thing goes for skinfold thicknesses.  I think that the major amount of information would be--is obtained through the body weight and length.  Skinfold thicknesses would be additional information that's useful.  The accuracy is somewhat less, and I would say it's probably not mandatory.

    Bioelectrical impedance, I don't think it's a stage where you can clinically use it, so scientifically I don't think it's appropriate yet.

    Stable isotopes--

    DR. GARZA:  I'm sorry.  Would you put that in the third category, No. 3, research?

    DR. BAKER:  Research.  Stable isotopes I think are at the stage where you can use them but not practical, so I would put that as...

    DR. GARZA:  Can you define "practical" for the group?

    DR. BAKER:  It's not practical in terms of the money that it would require, the testing, the exposure, the instrumentation, the--

    DR. GARZA:  Is that relative to the benefit?

    DR. BAKER:  --the repeatability of the study, so I don't think it's practical to use it for a growth study.

    The final one--

    DR. GARZA:  DXA is--

    DR. BAKER:  DXA.  DXA I think is--would be in the category of very helpful but probably not required.  Again, I think it relates to the feasibility of doing it, the availability of the testing, the time that's required, and, therefore, the money that's involved.

    DR. GARZA:  Any other physical body measurements or body composition measurements that are not in that list that you would like to add?

    DR. BAKER:  I'm not coming up with any.  Underwater weighing--

    [Laughter.]

    DR. GARZA:  Mothers may object.  Okay.  Dr. Stallings?

    DR. STALLINGS:  Body weight and recumbent linked, would those be essential?  I have more interest in head circumference, and so I put that in the second category of moderate interest.

    The two skinfolds that we discussed, triceps and sub-scapular, I would put in moderate interest.  I think bioelectrical impedance will not play out, so I have it in no use in the age group we're looking at.

    Stable isotopes research, DXA in moderate interest because I think it may be a method that will help us in the future if we go down that way.

    And the only other physical measurement that I felt like was missing would be the mid-arm circumference, and there's a history in neonatal care of using a head circumference, mid-arm circumference measurement that at times has been helpful in the research.  So I would put that with the skinfolds under research interest.

    DR. GARZA:  Can we get that down?  Dr. Heubi?

    DR. HEUBI:  Can I say "ditto"?

    [Simultaneous conversation.]

    DR. GARZA:  --literally with everything, but--

    DR. HEUBI:  I don't have really much--

    DR. GARZA:  But who's ditto because they differed a little bit?

    DR. HEUBI:  Ginanne's.

    DR. GARZA:  Okay.  I think I almost totally agree with what she said.  I don't know that I'll add much by making any more comments than that.

    DR. GARZA:  That's fair.  Dr. Anderson?

    DR. ANDERSON:  Obviously, this is not an area of expertise for me, but based on today's discussions--

    DR. GARZA:  This is for term infants.  We're going to do this again for preterm, so I just want to make sure everybody understands that.

    DR. ANDERSON:  It seemed to me that my enthusiasm for head circumference is somewhat greater than what I've heard so far in light of what we heard about the relative correlation of length to head circumference and its ability to potentially identify deviations from standard neurologic development.  Beyond that, I don't have anything to add.

    DR. GARZA:  You would classify head circumference then in the necessary--

    DR. ANDERSON:  Yes.

    DR. DOWNER:  Body weight, recumbent length, and head circumference would all be mandatory.  Skinfold thickness I would put moderate.  Bioelectrical impedance, do some more research on that.  Stable isotope as well as the DXA, additional research.  And for other physical body measurements, I too think that arm circumference may be of interest.

    DR. GARZA:  And you would put it in which category?

    DR. DOWNER:  Not moderate, but additional research.  Additional research.

    DR. GARZA:  All right.  Thank you.

    Dr. Sigman-Grant?

    DR. SIGMAN-GRANT:  The first three essential, body weight, length, and head circumference.  Skinfold thickness, I think it's moderate interest at this point.  Bioelectrical impedance, I'd say no.  Same thing for stable isotopes.  It's a research, but it wouldn't be mandatory.  I think there's potential in DXA, and I can't think of any other physical body measurements, but I think we should start thinking about getting out of the box, and maybe there could possibly be.

    DR. GARZA:  Any examples?

    DR. SIGMAN-GRANT:  Nothing that wouldn't be like MRI or--I don't even know, echograms or something like that.  We can look at--but it's not my area particularly.

    DR. GARZA:  All right.  Yes?

    DR. MOYER-MILEUR:  Moyer-Mileur.  I would agree that the first three, body weight, recumbent length, and head circumference, should be mandatory.  Skinfold thickness would be of moderate use.  Bioelectrical impedance I don't think has a valid reason for use in infants.  Stable isotopes for research, DXA I think would--could prove to be very useful with a number of caveats.  And then there are other physical body measurements such as air displacement and TOBEC that potentially would be of moderate usefulness.

    DR. GARZA:  Okay.  Dr. Kuzminski?

    DR. KUZMINSKI:  Sure.  Thank you.  Kuzminski.  Again, this is not my area of expertise.  I only know what I've read in the book and what I've heard in the discussions today.  As mandatory, I would think body weight, recumbent length, and head circumference.  Interesting, but not mandatory, certainly, skinfold thickness, but interesting.

    I agree with the comment of looking towards the future, thinking out of the box in terms of traditional measurements, and maybe DXA falls into that category.  And the others I would classify as research.

    DR. GARZA:  So you would put DXA into the research category or into a category that ought to be requested or--I'm--

    DR. KUZMINSKI:  Useful category.

    DR. GARZA:  Okay.  Now, just to help the recording, it may be difficult.  We've got, gee, it should absolutely be of use, moderate, might be useful, so you would put it in that second category then, DXA, along with skinfold thickness?  Okay.  Good.  I wanted to make sure I understood that.  Thank you.

    DR. DENNE:  Denne.  I would also include body weight, length, and head circumference as mandatory, and I would put head circumference there because it really is our only surrogate measure of neurologic outcome and reflective of brain growth.  So I think it belongs there.

    Skinfold thickness is sort of moderate.  Bioelectrical impedance, as discussed before, probably not useful, and the other techniques, research.

    DR. GARZA:  Okay.

    DR. THUREEN:  Thureen.  I agree that the first three are essential.  Skinfold thickness is of moderate interest, and I always do mid-arm circumference with skinfold, so I'd put those in the same group.

    Bioelectrical impedance I think is probably of no use at this point.

    Stable isotopes and other physical measurements are research tools, and DXA I have mixed feelings about because I think in most instances it's a research tool because it's not widely available.  On the other hand, I think it's probably going to be of some significant use in the future, so it's kind of between the second and fourth category for me, but at this point I guess I'd call it a research tool.

    DR. GARZA:  Okay.  Thank you.

    Dr. Briley?

    DR. BRILEY:  Briley.  The first three I think are of great use, skinfold thickness moderate, and the bioelectrical impedance, not yet.  And the isotope and the DXA are research tools.  I just wish that industry could get it down to it would be less costly piece of equipment.  And the last is research also.

    DR. GARZA:  All right.  Roger, would you have any comments?

    DR. CLEMENS:  I certainly agree with the [inaudible - off microphone].  I certainly agree with the rest of the [inaudible] research tool [inaudible] more cost-effective, technology will change [inaudible].

    [Laughter.]

    VOICES:  Say it again.

    DR. GARZA:  No, we won't ask him to do that.  That's all right.

    There is a fair consensus on the first category in terms of body weight, recumbent length, and head circumference.  I realize that it isn't unanimous, but generally as close as we come.  Would anyone like to object to it being characterized in that way on the committee?  With skinfold thickness as being of moderate interest, DXA at times was placed there, at other times it was put with research, but predominantly as a research tool.  Is there any objection if the minutes record that in the end the discussion said, well, it's more of a research tool right now but one that we would recommend highly to FDA to get more information on because of its potential usefulness in future assessments?  Or do you want to say no, it's of moderate interest?  Ginanne?

    DR. STALLINGS:  I'd--

    DR. GARZA:  You need a mike.  I'm sorry.  Otherwise--I may get away with not asking you to repeat once, but possibly not twice.

    DR. STALLINGS:  Stallings.  I'd like to pause and have a little bit more discussion of that, because I think it is going to be very available and is in most centers--and if you do some of the study kinds of things, like having the phantom and centralize the assessment and that sort of thing.  I think it's moving forward fairly quickly.  So I'm lobbying to keep it on the--out of the research only and into the moderate.  Certainly it might not be appropriate for everything we're doing, but there's so much interest both in body fatness and the childhood issues related to osteoporosis that I think those two--this is the measurement that give you some assessment of bone health and some assessment of body adiposity.  So that was why I was putting it in the moderate rather than research only.

    There's a comment from the floor.  I don't know if that's not possible at this point.

    DR. GARZA:  Not unless you want to ask them a question.

    DR. STALLINGS:  I don't.

    [Laughter.]

    DR. STALLINGS:  I don't want to break the rules.

    DR. GARZA:  That's certainly within the procedure.  You can ask if you want.

    DR. STALLINGS:  So that was why I put it in--sort of started that.

    DR. BRILEY:  Could I ask you a question?

    DR. GARZA:  Yes, Dr. Briley, that is certainly within the procedure.

    DR. BRILEY:  What percent of the centers in the United States currently have access to this kind of equipment?  This is not something I know about, so kind of fill me in about how many are already using it.

    DR. STALLINGS:  Well, it's like what we've been doing today.  You'll have to define "centers."

    DR. BRILEY:  Whatever you--

    DR. STALLINGS:  But I think most children's hospitals are going to have them or have them.  I think that Level 3 nurseries, which is where a lot of the sicker babies are often at adult hospitals, and they certainly have them.  We will probably never see, you know, private pediatric practices having them, but they would have access to them in their community, generally under the auspices of women's health centers and things like that.  But you've heard all the caveats about if you really are going to do this, you have to have a technician, an operator who's really good with kids and moms, and so they're all things that flow from that.

    But the current DXA technology is fairly widely available in middle-size towns and up and in all major medical centers.

    DR. GARZA:  Virginia, related to that, what percentage of centers that have an interest in doing growth or nutritional studies to you estimate now have DXA?  Is that more than half or less?  In terms of groups that might be engaged in this type of assessment.

    DR. STALLINGS:  Well, I think the groups that really define themselves as interested in growth, both in little--you know, failure to thrive and obesity, are going--either have it or are approaching 100 percent, because it is--it was a breakthrough technology in childhood body composition and growth studies.  So I think if they don't have them, it will be hard to stay in the field, and most of us who have gone from a lot of stable isotope work to this find this a much easier technology to work with with children of all ages.

    DR. GARZA:  Dr. Heubi?

    DR. HEUBI:  My comment was, although this is widely available, not everybody has pediatric software, and that becomes a bit of an issue.  But Ginanne is absolutely--she's absolutely correct about it.  Most centers that have GCRCs that are in pediatric centers have them, and they're broadly available.  And they're often free and at low cost in the GCRC--

    [Laughter.]

    DR. GARZA:  That sounds like a paid political announcement.

    DR. HEUBI:  This is an infomercial.

    DR. GARZA:  An infomercial, that's right.

    DR. STALLINGS:  He's the center director.

    [Laughter.]

    DR. HEUBI:  But it's true that most of them have--actually, I'd say--there are eight pediatric centers in the country, and I'd say pretty uniformly they all have them.  So it's widely available.

    DR. GARZA:  Would anyone like to speak against Ginanne's proposition?

    DR. DENNE:  I guess I'd just want to raise the question about what are you going to do with the data.  You know, how are we going to interpret the DXA scans?  Again, I'd love to have that as a reference, but how are we going to, in the context of a formula study, interpret the results?

    DR. STALLINGS:  Well, you know, we all can see that we don't have great reference data right now.  And, in fact, there's a major NIH-funded study looking at reference data down to a certain age that's going to address that in older children.  And I think one of the gaps that remains is having similar quality--I think it's a five-center study?  I think one of the challenges that remains is the reference data, but I would propose this would be used in the randomized trial setting where you would have enrolled groups randomized, and then you really could start to see are we having differences.

    My interests really are in all three compartments, actually, that--are we making children fat to the detriment of the fat-free mass?  Or are we making them fat and fat-free mass is fine?  And there are a number of issues that are coming along in the infant formula world and in the antecedents of osteoporosis that really directly have to do with bone mineral accretion.

    Now, it would be completely misrepresenting it to say that what goes on at 4 months we know has something to do even with 2 years being now ambulatory or at 7 or at that critical time sort of 10 to 14.  So--but, I mean, it is, I guess from the committee point of view, you know, it's futuristic, but I'm already pleased that we've added length to this.  You know, so we are trying to think about where do we want to go in the future.

    So, to me, the big deal is we've changed from just worrying about children not growing, which is failure to thrive, to trying to look at both sides of the growth spectrum.

    DR. DENNE:  And, again, I don't have--I think that data would be useful, but specifically if you find a difference in one compartment or another, what is it that you will do with that between formula?

    DR. HEUBI:  I think what it's going to require is the more longitudinal view, because one of the comments was made about not knowing anything below age 2 in terms of what its relationship to adult obesity is.  This would give us an opportunity to actually go another step further with follow-up studies in the future.  Again, it would require probably out of the scale of an infant formula trial but would be some information that could be added as added knowledge.

    DR. THUREEN:  Thureen.  I agree with what you said, the usefulness as a tool.  I just don't want to--this committee's recommendation come across as meaning that unless you have a DXA you shouldn't be involved in formula trials.  And I'd hate to see that happen because I know at most centers, pediatric GCRCs are at most children's hospitals, but there are still a lot of people out there who don't who have been involved in formula trials.  So I think we have to make it very clear that this--when we say moderately useful, it shouldn't be looked on as that kind of a center would have a preference for doing these types of studies if they had a DXA machine, because I don't think at this point in time that's realistic.

    DR. GARZA:  Okay.  There is still is somewhat of a split, is my sense, but if it's not required, then I think we've given the FDA a sufficient breadth of views on this that prolonging the discussion probably will not be very useful.  I don't think that there is a fundamental difference in how this technique is viewed, but where it might be placed.

    Okay.  Bioelectrical impedance was pretty much put into the useless category for right now, and with stable isotopes also being in the research category, and arm circumference possibly being placed alongside with skinfold thicknesses as of moderate use.  Is that...any objections to that summary?

    [No response.]

    DR. GARZA:  All right.  Then we're done with terms.  We'll start with preterms, and I'm trying to find...

    Dr. Thureen, would you like to start?

    DR. THUREEN:  Yes, I will start.  I think that body weight, recumbent length, and head circumference are critical.  I think that skinfold thickness and DXA are of moderate interest, but are of less use at this point in time than they actually are in terms infants because of the technical difficulties in using them.  And I think there is no role for bioelectrical impedance.  I think that stable isotope and other physical body measurements are not indicated at this time, are really in the research area.

    I do want to stress that in this age group, I think head circumference is critical because we use head circumference frequently for detecting significant abnormalities in growth, and it is a major way of looking at--or at least suggesting long-term neurological growth.

    DR. GARZA:  Okay.  Thank you.

    Dr. Denne?

    DR. DENNE:  I don't think I would add anything different to that.  That seems a reasonable position.

    DR. GARZA:  Okay.  Dr. Kuzminski?

    DR. KUZMINSKI:  I have to defer to the experts on this and agree to the same thing.

    DR. GARZA:  Dr. Moyer-Mileur?

    DR. MOYER-MILEUR:  I would just concur for preterm babies that head circumference is probably more critical than recumbent length and easier to obtain.

    DR. GARZA:  I didn't sense in the previous discussion--just to make sure that the minutes reflect the group's sentiments correctly--that head circumference, recumbent length and body weight were prioritized.  We said all three are required at the same time.

    DR. SIGMAN-GRANT:  I agree with everyone else so far.

    DR. GARZA:  Sigman-Grant.

    DR. DOWNER:  Downer.  Ditto for me, too.

    DR. GARZA:  And that's Dr. Downer.

    DR. ANDERSON:  Anderson.  The same.

    DR. GARZA:  We're on a roll.

    DR. HEUBI:  Heubi, except that I do want to say I'm glad that Patti came around to thinking that DXA was of moderate importance.

    [Laughter.]

    DR. STALLINGS:  And I would agree, and I was one of the ones who didn't put head circumference in priority for term, and I agree completely it's essential for preterm.  And I was just trying to have some distinguishing characteristics.

    DR. GARZA:  Okay.  Thank you.

    DR. BAKER:  I think I agree with everyone.  I would vote for head circumference being essential for preterm growth studies.

    DR. GARZA:  All right.  Dr. Briley?

    DR. BRILEY:  I agree with Patti.

    DR. GARZA:  All right.  Thank you.

    Roger, would you like to make any comments?

    DR. CLEMENS:  I certainly concur with the group.

    DR. GARZA:  Okay.  Well, gee, all right.

    [Inaudible comment off microphone.]

    DR. GARZA:  You can repeat it this time.

    DR. CLEMENS:  Officially, yes, I concur with the group.

    DR. GARZA:  You have to identify yourself.

    DR. CLEMENS:  Roger Clemens.  I'll get it out yet.

    DR. ANDERSON:  Then I think we've answered Question 2 in the way the groupings were made.  Does anyone want to address any aspect of two that you don't think we've addressed?

    DR. ANDERSON:  On behalf of the group, I say no.

    [Laughter.]

    DR. ANDERSON:  This is Jim Anderson.

    DR. GARZA:  Thank you, Dr. Anderson.

    All right.  Then let's move on to No. 3, and 3A is that the metrics above can be evaluated as either retained or absolute growth or velocity, rate of change.  Comment on the distinguishing values and merits of each static or variable method in the assessment of normal physical growth.  I'm going to limit the discussion to body weight, recumbent length, head circumference, skinfold thickness, because, in fact, none of the others were either recommended or seen as useful, so I don't see much point in our discussing each of those.

    Who would like to start?  Dr. Stallings?

    DR. STALLINGS:  I think that both attained growth and velocity are essential for the first three, for weight, length, and head circumference.  I think velocity would not be helpful in skinfold thickness because of the very small magnitude that we're moving through.  And in the age range we're looking at, I honestly don't know about mid-arm circumference since we've clustered that one.  I think the change is relatively modest, but I'll defer on that one.

    I think that covers it.

    DR. GARZA:  Well, the only other thing that might be useful, although it may be frosting on the proverbial cake, is whether you feel that velocity, since you said that it would be helpful for body weight, recumbent length, and head circumference, should all be obtained with the same frequency or whether you want to suggest any frequency of measure to be able to accurately reflect velocity.

    DR. STALLINGS:  Well, there are two issues here.  I'm still of the mind that most of the things that merit growth study would have a contemporary, comparable group, so that I would be measuring them--if you were going to existing data, I think the velocity curves are 1 months at this--birth to 4 months?

    DR. GARZA:  I just meant in terms of how often would you want measures of body weight, recumbent length, or head circumference obtained to be able to define velocity.

    DR. STALLINGS:  In a study--

    DR. GARZA:  In a study population.  Are three measurements sufficient, or do you want more frequent measurements?

    DR. STALLINGS:  A baseline and--well, baseline, and then assuming that was before 1 month, because everything we've heard said that that should be, and then 1 month, 2, 3, and 6.  Again, I guess I don't understand how long we're planning to run this.  A 4-month study?

    DR. GARZA:  Four to six months would be my--

    DR. STALLINGS:  Okay.  So baseline, 1 month, 2 months, 3, 4, and 6.

    DR. GARZA:  I'm just basing that on the previous question that said birth to 6 months.

    DR. STALLINGS:  Right.  Good.

    DR. GARZA:  I'm sorry.  Could you repeat that again?  You said birth--

    DR. STALLINGS:  Baseline--well, birth, the 14-day, then 1 months, 2 months, 3 months, 4, and 6.

    DR. GARZA:  So, in essence, monthly for the first 4 months, and then 6 months, the first month having measures at birth and at 2 weeks.

    DR. STALLINGS:  Right, or whatever that earliest--but something--I was calling it baseline.

    DR. GARZA:  Between 1 month and birth.

    DR. STALLINGS:  Right.  But something before--at 14 days as your baseline, and then again at 1 month.

    DR. GARZA:  Okay.

    DR. STALLINGS:  So that might be a 28-day or a 14-day measure.

    DR. GARZA:  I don't know that we have to be that exact, but it gives people a sense of how velocity would be obtained, or measured, at any rate.

    Dr. Heubi?

    DR. THUREEN:  Dr. Garza, can I make a comment?

    DR. GARZA:  Sure.

    DR. THUREEN:  I'd like to ask the committee if they'd agree on the time points that Dr. Fomon suggested this morning, the seven time points over the first 6 months of life, because I think that's what you're referring to.  And unless there's dissension from that, which I suspect there's not but I'd like to ask the committee, that those may be the time points that we'd recommend for longitudinal studies.

    DR. GARZA:  They pretty much coincided with those.

    DR. THUREEN:  But we could just call it the Fomon criteria from now on rather than--

    DR. GARZA:  That sounds good.  Thank you, Patti.

    Dr. Heubi?

    DR. HEUBI:  If you'll give me one moment, I just want to see what he actually said.

    DR. DENNE:  This is Denne.  I think it was Dr. Chumlea who suggested that time frame.  I think Dr. Fomon was sort of talking about 4 months as the criteria.

    DR. HEUBI:  I think birth, 2 weeks, 1, 2, 4, 5, and 6 months.

    DR. GARZA:  So the third month was omitted.  I think that's--we just need a general idea.  I don't know that we have to come up with the exact metrics or exact time points.  That's close enough.  There'd be some wobble.

    DR. SIGMAN-GRANT:  This is Sigman-Grant.  Is there any advantage to having a measure between birth and 2 weeks?

    DR. GARZA:  Between birth and 2 weeks, is there any advantage?  There is quite a bit of wobble, is the sense that I'm getting from the group again at the table because of weight loss during--

    DR. SIGMAN-GRANT:  But is that an important measure?

    DR. GARZA:  There are no standards that I'm aware of.

    DR. STALLINGS:  Right.  I think what you would end up there mostly would be with the individual variation and hydration and immediate pre-partum kinds of things, which wouldn't reflect the feeding experience as much.

    DR. SIGMAN-GRANT:  Okay.

    DR. STALLINGS:  And, also, just--I mean, I don't want to talk about practicality, but, you know, there is the issue of if we could get a birth, true birth measurement, and then 14 days or something like that, that would be great.  But I think what we're doing is just skipping the variability of when you regain your birth weight and establish full feeding.

    DR. GARZA:  In the interest of time, while Dr. Heubi's getting these things together, would you--maybe we can take both preterm and term together?  Would you change any of those recommendations for the preterm, or would you leave them pretty much the same for both term and preterm?

    DR. STALLINGS:  I think I would leave them pretty much the same, again, just recognizing the preterm--that the physical measurements on the preterm are more stressful, but we still have to have them to be able to make the decisions we need.

    DR. GARZA:  Would you recommend--well, going to 6 months post-conceptional equivalency or-

    DR. STALLINGS:  Yes, I would.

    DR. BAKER:  I think that both the static and the velocity measurements should be obtained for weight, length, and head circumference.  I don't think velocity is necessary for skinfold thickness or mid-arm circumference.  And I would agree with the time points as Ginanne said them, and I also think that it would be useful for both full-term and premature babies.

    DR. GARZA:  Okay.  Thank you.

    Dr. Heubi?

    DR. HEUBI:  Well, after all that, I don't think that I disagree.

    [Laughter.]

    DR. HEUBI:  I was looking very feverishly for this information.  I think Jim showed it to me.  So I don't think that--I think that I would agree with the same time points and the same measurement parameters would be quite appropriate, and for preterm infants, I think it would be 6 months post-conceptional age.  I think that's the issue that has to be addressed, and I think that should be left up to Dr. Denne and Dr. Thureen.

    DR. GARZA:  All right.

    DR. ANDERSON:  Anderson.  Agree.

    DR. DOWNER:  Downer.  I, too, agree.  Body weight, recumbent length, and head circumference at birth, 2 weeks, 1 months, 2, 3, 4, and 6 month intervals.  I'd also agree that skinfold thickness and mid-arm circumference, the velocity for that is not important here.

    DR. SIGMAN-GRANT:  Sigman-Grant.  Ditto.

    DR. MOYER-MILEUR:  Moyer-Mileur.  I agree for the term infant on the attained and velocity measures as well as the measurement periods.  But for the preterm baby, I think we need to keep in mind that some of these studies will be done in-hospital versus post-discharge, and I think in the hospital that your intervals need to be different.  And so it would be wherever--a set baseline, and then probably every two weeks until that child is discharged, and then go into the 1-month interval to 6 months post-conception age.

    DR. GARZA:  Okay.  Dr. Kuzminski?

    DR. KUZMINSKI:  Thank you.  I agree.

    DR. DENNE:  Yes, I would agree for the--

    DR. GARZA:  Dr. Denne?

    DR. DENNE:  I'm sorry.  Yes, I would agree for the term, and also agree that preterm needs more frequent measurements in-hospital, probably every 2 weeks, every week to 2 weeks, something like that.

    DR. GARZA:  Dr. Thureen?

    DR. THUREEN:  Thureen.  I agree on the term infant, and, again, the preterm infant, it's going to be really critical looking at what their growth rate is over their hospitalization period, especially when they're sick.  So initially it may need to be every week, but probably no less than every 2 weeks during hospitalization.

    DR. BRILEY:  And I agree.

    DR. GARZA:  I'm going to ask the group the question of whether somebody is willing to comment on the merits and value of having both static and velocity measurements.  I think we've got fairly good agreement, other than on the number of measurements, and the frequency of measurements of premature.  Would there be any objection if we summarize it?  In fact, the in-hospital phase of the premature management should probably be more frequent.  Every two weeks seems appropriate.

    DR. STALLINGS:  That was actually part of my question because the--Stallings.  I think the question you bring up about weekly measurements, while hospitalized--

    DR. GARZA:  She said every two weeks, I think.

    DR. STALLINGS:  No, it was--

    DR. GARZA:  Is it weekly or--weekly.

    DR. STALLINGS:  Both neonatologists considered weekly measurements, so that was why I wanted to bring that back up because I would agree with that, if you think it's merited, because things are happening so fast, and they could either be in a rapid rate of growth or a very clear static phase.  So I was asking the two neonatologists to comment on one versus two weeks, and then the measurement error and all of the things that go from that.

    DR. THUREEN:  This is Thureen.  I would advocate for one week because the fastest rate of growth is actually the in utero rate of growth, and so potentially, if these infants are growing at the in utero rate, you've got it higher than you have during the growth spurt post-natally?  So I think that there are going to be dramatic changes even over a week period, and I would advocate for weekly measurements.

    DR. DENNE:  Yes, I would basically agree.  It depends a little bit on the study design, and when patients get entered and those kinds of things.  I guess the principle would be more weekly measurements early and spacing out later, but--

    DR. GARZA:  Dr. Anderson?

    DR. ANDERSON:  I would only ask that we consider the distinction between what's required, for lack of a better term, good patient care, and what's required for the purposes of documenting normal infant growth in the setting of an infant formula study.  I wonder whether, in the latter, every two weeks might not be sufficient.

    DR. THUREEN:  Thureen.  I think pretty much every neonatal unit measures these parameters on a weekly basis anyway.  Whether the actual research measurement would be more destabilizing I think is difficult to say.  I'd also say that in most units, if your patient is unstable, you don't even do your routine measurements.  You just have to have some latitude for those kind of decisions, so ideally every week, but if the patient is not stable or if it appears to be a difficult measurement for a patient, then I think you may have to be a little bit more flexible.

    DR. DENNE:  Yes, and obviously that partly depends on the study design, but things do happen quickly over a week.  These measurements are clearly possible over a week, as opposed to later on when it's tough to make that difference over a week.

    DR. GARZA:  Roger?

    DR. CLEMENS:  I do want to refer to that.  In terms of body weight and length, we do, the infant formula manufacturers, do conduct these studies, and when they do, they measure length and weight at the 14, 28, 56, 84, 112 days.  Clearly, the Fomon formula, if you will, when we conduct four-month studies, and certainly to include two more data points would not be a big deal if required by statute to go on to six months for these kinds of growth studies.

    We do collect skinfolds, but not as frequently at this point in time.

    DR. GARZA:  Any other comments regarding the one or two weeks for the prematures?

    [No response.]

    DR. GARZA:  All right.  What about the values and merits of attained and velocity, does anyone want to speak to those any more than you may have in your comments?

    Dr. Anderson?

    DR. ANDERSON:  Perhaps I'm not thinking of this correctly, but it does seem to me, if we have the data points at the specified times that we've talked about, then the data can be presented, in either way, simply through a data transformation.  And so if people are normally used to thinking about velocities, they can be recorded as velocities, and if they're used to thinking about weights and specific I points, they can be recorded that way.

    DR. GARZA:  Any other points that anyone wishes to make in response to that request from the Agency?

    [No response.]

    DR. GARZA:  Then why don't we move on to 3B, and here again I think we can take both the preterm and the term together.  The outcomes above can also be evaluated as individual infant data or as group comparative data, comment on the values and merits of using individual or aggregate data in the assessment of normal physical growth.

    DR. DENNE:  Dr. Garza, before we start, I'm not sure I understand that question.

    DR. GARZA:  I think what they're asking is should the Agency be presented with the individual growth data for individual subjects or just the aggregate data for the entire group.

    DR. DENNE:  So in terms of what's actually submitted to the Agency.

    DR. GARZA:  Or here it's phrased, in terms of the values and merits of each one, but I suspect that that's why they'd like to know that.

    Is there value in submitting both and or merit, and would one or the other suffice?

    Dr. Anderson?

    DR. ANDERSON:  I think that either would suffice.  For longitudinal data like this, there's often a choice in terms of how one both presents and analyzes such data, and it reflects essentially the question here; that is, one can analyze parameters at the patient-specific level and then estimate population parameters, considering the patients as random realizations from that population, so each patient would have specific parameters, some showing a strong growth tendency, others less so, and that the parameters, the estimates would be the average of those parameters in the population or one could take what's often called a population-average approach, which is essentially one is not interested in the parameters of individual subjects, but how they're realized in the observed population, and there the focus is more naturally on the population values that are observed at individual time points.

    My own personal view is that the answer that you get from the two approaches are oftentimes largely the same, and that I would think that either approach here would be acceptable.

    DR. GARZA:  But what sort of information would you recommend be obtained if population or group data are being submitted that would describe the distribution?  Would a mean and standard deviation be sufficient or do you think we ought to look at "skewdness" or whether, in fact, it's normally distributed?  Can you elaborate a bit on what you think would be the essential information you would need to assess the distribution?

    DR. ANDERSON:  Well, we're largely getting ahead of ourselves because we haven't talked about what we would be comparing things to.

    DR. GARZA:  Assuming only one group, and you're going to look at individual data versus group data.

    DR. ANDERSON:  If it was a single group, I would think that you would want more than simply means and standard deviations because, as we've seen from the, from the growth charts, the data can be somewhat skewed at various time points, and so either some summary of the data reflects a transformation towards normality at each of the time points or some other type of display which allowed one to see not only the measure of central tendency, but also a sense of the general distribution at the times when the data were collected would be appropriate.

    DR. GARZA:  Drs. Heubi or Downer, would either one of you like to go next?

    It looks like you're the winner, Dr. Heubi.

    DR. HEUBI:  Did I win?  And it's still on.

    I tend to think that it probably would be appropriate to submit all of the data showing individual data points, in addition to some summary data, just because it looks to me like based upon the size of these groups are likely to be such that it would be a value to see what the distribution looks like, in addition to the summary statistics.

    DR. GARZA:  Can you elaborate a bit on the value of seeing both?

    DR. HEUBI:  It would allow someone to look at how normally distributed the data was and how whether there were outliers that were not reflected by the summary statistics.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  Stallings.  I would like to see both.  The reason for the individual data I think is important to, I mean, I can't give us a good example of what's happened historically, but I think it would be important to see a group of children that would be on both ends of the distribution, and again particularly also by gender.  I mean, I think there are some different things going on, so I'm assuming we get the different segmental analyses that we had talked about.

    So I would be really interested in seeing how many individual babies fell out less than a fifth, less than a third greater than the 95th.  Now, the logical question is exactly what am I going to do with that data, and I don't really know that, except that I think if I were in a regulatory environment and I saw that, yes, most of the kids were doing just fine, but there really was a group that wasn't, I would want to explore that more, and that might start the dialogue to find out more about that.

    DR. GARZA:  Dr. Baker?

    DR. BAKER:  I would certainly want the individual data I think, just thinking about the same sort of things that there are a possibility of outliers that would trigger you to think more about the whole problem.  I don't think we can give guidelines about what exactly you would do, but I certainly think it's worth looking at.

    I think some group data would also be useful sort of, but I think certainly the individual data and a good bit of the group data would be useful.

    DR. GARZA:  Dr. Clemens?

    DR. CLEMENS:  Clemens.  You caught me early.

    DR. GARZA:  I called you earlier, that's right.

    DR. CLEMENS:  I appreciate that.  Actually, the pro forma to manufacturers provide all of the data, to provide individual data, as well as group data, and so they can clearly see, to your comment there, Ginanne, they can determine if there's any outliers, if you will.  Also, if there are any outliers, potential outliers, as there is in any population group, then, actually, the clinical or the medical director provides clinical comments, as well as to the principal investigator, to provide clinical comments specific to that particular child, so you get here exactly each child.

    DR. GARZA:  Thank you.

    Dr. Downer?

    DR. DOWNER:  Thank you.  I'm still pondering this.  I think both data sets would be important.  As Dr. Stallings said, I'm not sure what we would do with the individual data, from a group perspective, but it would provide some information on the substance within the group, and that's important.  So I think both data sets would be important.

    DR. SIGMAN-GRANT:  Sigman-Grant.  I think both sets are important.  I think, if you look at individuals, you might see clustering, perhaps by ethnicity or something else that you might not see if you only saw population data, and that might be helpful information.  So that's like clustering of individuals.

    DR. GARZA:  Dr. Moyer-Mileur?

    DR. MOYER-MILEUR:  Moyer-Mileur.  I would agree that both individual and summative data is important.

    DR. GARZA:  Dr. Kuzminski?

    DR. KUZMINSKI:  Both data, please.

    DR. DENNE:  Denne.  I would concur with the previous comments.

    DR. GARZA:  Both in terms of the--

    DR. DENNE:  Yes.

    DR. THUREEN:  Thureen.  I would concur.  I have nothing to add to the comments that have been made from that side of the table.

    DR. BRILEY:  Briley.  I would concur.  I also think about this might be the first time that we have these data, and the next time around, it would be nice to do some comparison with it.  So it would be nice to have both.

    DR. GARZA:  All right.  Does anyone want to add any comments to what you may have said earlier on this question?

    [No response.]

    DR. GARZA:  If not, there seems to be a consensus that both individual and group data should be evaluated, and the values and merits, having to do with the ability to assess the distribution and potential outliers, et cetera, in a way that perhaps summary statistics may not lend themselves to easily.

    It is 5:30, almost.  We can go ahead and try to do the next one or two questions, if you're up to it, or we can try to get the van here by 6 o'clock, and we will still be here.  Let me confer a bit.  It'll be here at 6:00, so why don't we go ahead and do the next one.

    DR. SIGMAN-GRANT:  It's a really big one.

    DR. GARZA:  It is a big one, you're right.

    Dr. Briley?

    DR. BRILEY:  Don't you think we can do the question now and not have to do the work tonight?  Would that be a trade-off you'd allow?

    [Laughter.]

    DR. GARZA:  Oh, that doesn't buy us more time, Dr. Briley.

    DR. SIGMAN-GRANT:  This is Sigman-Grant.  I think both of these require a lot of, it's going to be a lot of discussion, the difference between reference and standards.

    DR. GARZA:  You notice that I said it with some trepidation, and you're absolutely right.  I'm trying to think how one could use the next 30 minutes most productively, given the fact that the idle mind tends to be the devil's workshop.

    Why don't we then work on Question 7 on your own, as Dr. Briley suggested.

    Dr. Denne, did you--

    DR. DENNE:  Well, I just wonder whether Question 4, whether there wouldn't be reasonable consensus about the answer to that question.  I mean, my sense is there might be, but--

    DR. GARZA:  We have one potential masochist in the group.

    [Laughter.]

    DR. GARZA:  Do I see another one that's willing to start with No. 4?

    DR. STALLINGS:  Take Question 5.

    DR. GARZA:  Well, 5 might benefit from having 4 in the discussion, and that's why I didn't want to take them out of order.  I thought of that, and so why don't we begin and then see whether, in fact, we are making progress, and we don't have to come to a forced conclusion because Dr. Sigman-Grant is right, it's probably going to take quite a bit of discussion, but Dr. Denne may prove us wrong, in the fact that we can come to a consensus in 30 minutes, and that would save us an enormous amount of time tomorrow.

    So let's, since the bus can't pick us up until 6:00, let's try and do 4:00.  It's for adequate evaluation of normal physical growth.  Below are examples of clinically distinct reference groups, and we really have three.  One, our concurrent controls, a reference data uses a control, and historical control, and I'm assuming that by reference, what is meant is a more comprehensive data base with historical controls, perhaps referring to a specific formulation that might have been looked at in the past, but not necessarily information that may be in the public domain.  Is that an accurate interpretation of that?  Dr. Walker is saying yes.

    So a reference data, think of it as something that is in the public domain that is quite extensively documented, as opposed to a historical control that might be more limiting in that it may just be either proprietary information or information that relates to a specific formulation from a specific one single article or one single study.

    So it might be seen as a historical control, but not necessarily as a reference.  I'm at a loss as to what other groups, but let's get those on the table.

    DR. HEUBI:  I guess it's possible that that is referenced as longitudinal data, as opposed to other, which might be cross-sectional.  That would be another option for other.

    DR. GARZA:  You mean that would be under reference or historical that it could be either longitudinal, the substance of those?

    DR. HEUBI:  Well, I'm saying reference data used as controls would be things like the Fels-Iowa data, as opposed to the CDC data for other as cross-sectional.

    DR. GARZA:  I'm sorry.  In my description, I meant to include both Fels and CDC, for example, as reference data.

    DR. STALLINGS:  So not just longitudinal--

    DR. GARZA:  And subsets of that.  It could be cross-sectional.

    DR. HEUBI:  it would be relatively easy to rank these if you separate those out and make it the other way, so it's cross-sectional being your last category.

    DR. GARZA:  Okay.

    DR. HEUBI:  Because that's the least desirable.

    DR. GARZA:  So you would say cross-sectional, and we would interpret concurrent controls, reference data, and historical as longitudinal, and then other would be cross-sectional or would you prefer seeing six cells with concurrent controls being longitudinal or cross-sectional?  No?

    DR. HEUBI:  This may be a long time.

    DR. GARZA:  That's more complicated, but I think it's going to be complicated by just saying all of the above is cross-sectional, but let's try it.

    Yes?

    DR. SIGMAN-GRANT:  I have a question.

    DR. GARZA:  Sure.

    DR. SIGMAN-GRANT:  Sigman-Grant.  Are we referring to this as reference or standard?

    DR. ANDERSON:  This is Anderson.  I think we can't do anything other than refer to this as reference because that's what they are.

    DR. GARZA:  So that the implication of that is that you're not going to be making any value judgment on the basis of the comparison, but that you're suggesting we limit the discussion to just the fact that if it's going to be a comparison, it'll be more or less, not better or worse.  Is that clear to everyone?  Because it's an important distinction.

    Now, we may want to take up the issue of standard in number five, but for right now we would limit this to a reference, and so that what one would say is more or less, not better or worse.

    Dr. Anderson?

    DR. ANDERSON:  With the permission of the chair, let me take a stab at this.

    DR. GARZA:  Yes, please do.

    DR. ANDERSON:  My own personal view is that the least helpful of what we've discussed are historical controls, largely because they tend to be small populations collected in a process which is often difficult to describe, and the temporal changes that can occur make it highly likely that historical controls will--it will be extraordinarily difficult to convince individuals that the historical control accurately reflects an appropriate comparison group.

    And then notwithstanding my attempting to push the envelope during the discussion today, I think that using reference data is not that much better, largely because it's just that.  It's reference data, and so as a result of the conduct of a study, we can say that the results were greater than or less than the standard--sorry--the reference, but I don't know how to interpret that.  So we end up with the study group having a mean weight at six months, which is the 55th percentile of those standard.

    The conclusion of that is I don't know how you get beyond that, and so my view is that if it's felt that it's important to evaluate normal physical growth, it should be done in the context of concurrent randomized controls.

    DR. GARZA:  Dr. Clemens?

    DR. CLEMENS:  Clemens.  I trust that we don't make a decision recommendation this evening on this, Bert.  Tomorrow, we have some public comment available and also the three medical directors for the manufacturer was making comment on this question, and other related questions, which we have not addressed tonight.  So, again, I trust that we don't make a recommendation to the FDA this evening.

    DR. GARZA:  We can hold off on that until tomorrow.

    DR. CLEMENS:  I would trust that--discussion is good, but no recommendation at this time, please.

    DR. THUREEN:  Thureen.  I'd like to note that Dr. Anderson's comments closely reflect Dr. Frongillo's report of earlier today, where he really elegantly laid out the advantages and disadvantages of concurrent reference and historical reference data or controls, and I was already convinced this morning, after his talk, and I'm now more convinced after hearing your opinion.

    DR. GARZA:  One of the things that I'm reading into what you said, Dr. Anderson, is that the absence of standards really limits us in terms of making those comparisons, so that having a concurrent randomized control group makes a lot of sense; is that one of the major factors in your conclusion?

    DR. ANDERSON:  Yes, you said that very well.  I think that if there were data which were generally agreed reflected in an agreed upon standard, and I have not heard that today, then the discussion could be broadened because then it would be possible, for instance, to say that any outcome which led to results which were within .2 standard deviations of the standard would be considered acceptable evidence of normal physical growth, in the absence of a standard, and with the presence of what I gather are generally agreed upon as references, that kind of argument is not permitted.

    DR. GARZA:  Dr. Baker?

    DR. BAKER:  I'd just like to put in my two cents here.  I think that we all agree that the gold standard, the thing we all like to see, is a concurrent randomized control group.

    DR. GARZA:  Is the mike on?  I can't tell.

    DR. BAKER:  What we'd like to see is a randomized control group, and so that we put as number one, and then I would put a longitudinal reference group as kind of the second choice, and then a cross-sectional reference group as a third choice, and most often, the historical control is way down there.

    On the other hand, I would have to say that there might be some circumstances where the historic control group might be reasonable, and that would be in a situation where you were doing rapid sequence studies with very similar formulas and just changing one thing.  So you'd use, for those series of studies, you might use one historic control group, but--

    DR. GARZA:  You may want to hold the mike closer to you.

    DR. BAKER:  --but to think about using a control group from 10 years ago, seems to me, unreasonable.

    DR. GARZA:  Thank you.

    Dr. Heubi?

    DR. HEUBI:  Heubi.  I would agree with Rob.  I think that historical controls are probably only acceptable if they're fairly recent, and whatever fairly recent means, I don't know, within a year, two, three years, but not ten years.  But the ordering, I think I would agree with him totally, concurrent controls should be the gold standard, and a longitudinal reference group, like Fels-Iowa, would be a second, and then a cross-sectional and then finally historical controls.

    DR. GARZA:  Dr. Stallings?

    DR. STALLINGS:  I think the concurrent control group is essential.  I think often when the rest of us are doing things that are research, we wouldn't be able to do research without an adequate control group, and we're not setting policy or regulating anything or making products that, you know, millions of infants will be exposed to.  So I really do feel strongly about that, and I concede that there may be a time, if you really are designing things, that the concurrent control group might be able to serve for a series off studies.

    I feel like the historical, as defined when we started this, the historical group, which might be proprietary and might be specific to a single formulation, a small population, less-well described, I really don't think that's adequate sort of for the modern times, partly because I think it's not in the public domain.

    And if we're saying you don't have to do the study, which might remain proprietary, that somebody really needs to agree that, other than the regulatory group, agree that it was a good choice.

    So I think it does come down to the control groups, if we reach the level where a growth study is required, then it should be an optimal growth study.

    The issue of the reference data, I really learned a lot over the last couple of days reading about this and listening to the comments.  I think I have come to believe that because the reference data represent so many years, and we really don't know what's optimal, and that's, you know, we need many more days conference on that, that I'm less inclined to use reference data.  I would have it available for descriptors because what you would want to know, if there is some complete quirk, and now the control group and the study group happen to all be at the one standard deviation of everything, you would want to know that, that that's the data you would be looking at, but I don't think it would be--and I would worry about how we got into that fix--but that would be a bit different.

    So I'm really voting for the contemporary control groups.

    DR. DOWNER:  Downer.  I, too, agree that the historical control would not have much implications for the growth study at this time and that the gold choice, of course, the concurrent randomized trials, would be the one to select.  The longitudinal reference and cross-sectional reference, too, would not be of great importance here, but it would be good to look at over the long haul just to see what has happened and to make comparisons over time.

    DR. SIGMAN-GRANT:  This is Sigman-Grant.  I, too, agree that the concurrent controls would be the most ideal.

    I'm sitting here reflecting.  It's interesting because when I work prenatally, there's the issue of what happens prenatally determines potentially what might happen postnatally, the so-called metabolic imprinting in Barker, and yet we seem to think that, okay, we're just looking from birth on, and so without a concurrent group and using previous information makes me question if, indeed, metabolic imprinting, as a hypothesis, is explored and expanded.

    There's a lot of difference, maternal differences, between, say, the '50s, '60s and '70s and what's now recommended--weight gain, smoking, alcohol use.  If that, indeed, affects infant growth, postnatally, which it probably does, it might, then if we use data that was presented for those years, are the infants truly representative of the cohort from today?

    DR. GARZA:  Are there any other comments that anyone would like to add?  And we'll come back to this issue tomorrow after the public comment period.  Any other--I'm sorry, go ahead.  I didn't see you.

    DR. MOYER-MILEUR:  That's okay.  Moyer-Mileur.  I would just like to say I assume we're all talking from the term baby point of view at the moment, but for preterm babies, there is no reference.  So I think for preterm studies they always have to have a concurrent control because medical care in the NICUs is changing constantly, and you can't really use a historical control from even a year in the past.

    DR. GARZA:  That's a good point.

    Any other comments?  Your free to speak today.  We're going to just come back to it tomorrow.  I want to make sure that we've got the time.  So you can speak today or tomorrow.

    DR. KUZMINSKI:  Just a comment.  I concur with the comments made by the rest of the committee on concurrent controls, as the gold standard, but I feel that there is place for historical controls.  To me, it depends upon how, an agreement with another comment that's been made, how long back in time that history is, in terms of that data.  I think a driving point also is how relevant was that study population to the current study population that is being proposed in the current study?

    It gets back to my question, I guess, I posed to two of the speakers, Dr. Denton and Dr. Bier today, what is that gray area that we use experience to make a judgment not to use, not to do a clinical growth study, and do we just automatically default to doing a study?  I think that's a very difficult question to answer.

    This hits on it, also, on historical controls, and the reference data, I agree with the others as the third most--

    DR. DENNE:  Denne.  I guess I don't have much to add.  I would agree with the previous comments about the value of concurrent controls and the significant lower value of any other approach.

    DR. THUREEN:  Thureen.  I agree.

    DR. BRILEY:  Briley.  I would like to make a statement in regard to what our current consumer is, in terms of it's different than what we've had in previous years, like Maureen had said, and so I feel like the gold standard is the concurrent controls.  I think it has to be there because society has changed, and the young mothers today have a different role to play than what they did 10 years ago, and 10 years is too long.

    DR. GARZA:  Would any of you have any questions to any of the presenters that might have addressed this topic earlier today?

    [No response.]

    DR. GARZA:  Then, we pretty much have dealt with, A, for term infants, we had Dr. Moyer-Mileur addressed the preterm as being a special case where, in fact, that's even truer in terms of reliance on a concurrent control.

    Does anyone want to speak to preterms or would the discussion that we've had, in your view, pretty much reflect your sense of preterms as well?

    [No response.]

    DR. GARZA:  People are pretty tired, Dr. Sigman-Grant is telling me.

    DR. STALLINGS:  Dr. Stallings.

    Just to clarify, though, I think what Laurie was addressing, we had some diversity about whether concurrent controls were almost the only thing or they are second and third rank.  I think you were suggesting, and correct me, that in preterm you really have to have concurrent controls all the time because of the changes in care practices and how rapidly things can happen and because we simply don't know enough about those babies, for all of the practice environment of a preterm environment.

    So if that's what you were saying, I would agree with that, that it isn't a one, two, three; it's really sort of a one.

    DR. GARZA:  I'll ask the group, as we conclude today, to please think about, B, because we pretty much addressed A.  To a certain degree, we've addressed B, but I'd like to come back to that tomorrow after the public comment period, and then, depending on what we do with B, more explicitly, perhaps even returning to C, but maybe that won't be necessary, and then we'll pick up 5, and 6, and 7 at the remaining time.

    I want to thank the group because, Dr. Sigman-Grant is absolutely correct, you guys have worked hard today, and you have one more chore before tomorrow morning and that is thinking about Question 7.  You may want to do that after dinner, rather than before.

    I want to thank our presenters, because they've obviously made this discussion much, much easier than it otherwise may have been, and staff for getting this organized as well as it has been.  So to each of the players, thank you, and we'll be back tomorrow at 8 o'clock--8:15.  I've been corrected.

    [Whereupon, at 5:51 p.m., the proceedings were adjourned, to reconvene at 8:15 a.m., the next day, Tuesday, November 19, 2002.]