A call for transparent reporting to optimize the predictive value of preclinical research

Story C. Landis; Susan G. Amara; Khusru Asadullah; Chris P. Austin; Robi Blumenstein; Eileen W. Bradley; Ronald G. Crystal; Robert B. Darnell; Robert J. Ferrante; Howard Fillit; Robert Finkelstein; Marc Fisher; Howard E. Gendelman; Robert M. Golub; John L. Goudreau; Robert A. Gross; Amelie K. Gubitz; Sharon E. Hesterlee; David W. Howells; John Huguenard; Katrina Kelner; Walter Koroshetz; Dimitri Krainc; Stanley E. Lazic; Michael S. Levine; Malcolm R. Macleod; John M. McCall; Richard T. Moxley III; Kalyani Narasimhan; Linda J. Noble; Steve Perrin; John D. Porter; Oswald Steward; Ellis Unger; Ursula Utz; Shai D. Silberberg

doi:10.1038/nature11556

Open Access
Published: 10 October 2012

A call for transparent reporting to optimize the predictive value of preclinical research

Nature volume 490, pages187–191(2012)Cite this article

10k Accesses
670 Citations
152 Altmetric
Metrics details

Subjects

Preclinical research

Abstract

The US National Institute of Neurological Disorders and Stroke convened major stakeholders in June 2012 to discuss how to improve the methodological reporting of animal studies in grant applications and publications. The main workshop recommendation is that at a minimum studies should report on sample-size estimation, whether and how animals were randomized, whether investigators were blind to the treatment, and the handling of data. We recognize that achieving a meaningful improvement in the quality of reporting will require a concerted effort by investigators, reviewers, funding agencies and journal editors. Requiring better reporting of animal studies will raise awareness of the importance of rigorous study design to accelerate scientific progress.

Download PDF

Main

Dissemination of knowledge is the engine that drives scientific progress. Because advances hinge primarily on previous observations, it is essential that studies are reported in sufficient detail to allow the scientific community, research funding agencies and disease advocacy organizations to evaluate the reliability of previous findings. Numerous publications have called attention to the lack of transparency in reporting, yet studies in the life sciences in general, and in animals in particular, still often lack adequate reporting on the design, conduct and analysis of the experiments. To develop a plan for addressing this critical issue, the US National Institute of Neurological Disorders and Stroke (NINDS) convened academic researchers and educators, reviewers, journal editors and representatives from funding agencies, disease advocacy communities and the pharmaceutical industry to discuss the causes of deficient reporting and how they can be addressed. The specific goal of the meeting was to develop recommendations for improving how the results of animal research are reported in manuscripts and grant applications. There was broad agreement that: (1) poor reporting, often associated with poor experimental design, is a significant issue across the life sciences; (2) a core set of research parameters exist that should be addressed when reporting the results of animal experiments; and (3) a concerted effort by all stakeholders, including funding agencies and journals, will be necessary to disseminate and implement best reporting practices throughout the research community. Here we describe the impetus for the meeting and the specific recommendations that were generated.

Widespread deficiencies in methods reporting

In the life sciences, animals are used to elucidate normal biology, to improve understanding of disease pathogenesis, and to develop therapeutic interventions. Animal models are valuable, provided that experiments employing them are carefully designed, interpreted and reported. Several recent articles, commentaries and editorials highlight that inadequate experimental reporting can result in such studies being un-interpretable and difficult to reproduce^1–8. For instance, replication of spinal cord injury studies through an NINDS-funded program determined that many studies could not be replicated because of incomplete or inaccurate description of experimental design, especially how randomization of animals to the various test groups, group formulation and delineation of animal attrition and exclusion were addressed⁷. A review of 100 articles published in Cancer Research in 2010 revealed that only 28% of papers reported that animals were randomly allocated to treatment groups, just 2% of papers reported that observers were blinded to treatment, and none stated the methods used to determine the number of animals per group, a determination required to avoid false outcomes². In addition, analysis of several hundred studies conducted in animal models of stroke, Parkinson’s disease and multiple sclerosis also revealed deficiencies in reporting key methodological parameters that can introduce bias⁶. Similarly, a review of 76 high-impact (cited more than 500 times) animal studies showed that the publications lacked descriptions of crucial methodological information that would allow informed judgment about the findings⁸. These deficiencies in the reporting of animal study design, which are clearly widespread, raise the concern that the reviewers of these studies could not adequately identify potential limitations in the experimental design and/or data analysis, limiting the benefit of the findings.

Some poorly reported studies may in fact be well-designed and well-conducted, but analysis suggests that inadequate reporting correlates with overstated findings^{9,10,11,12,13}. Problems related to inadequate study design surfaced early in the stroke research community, as investigators tried to understand why multiple clinical trials based on positive results in animal studies ultimately failed. Part of the problem is, of course, that no animal model can fully reproduce all the features of human stroke. It also became clear, however, that many of the difficulties stemmed from a lack of methodological rigor in the preclinical studies that were not adequately reported¹⁴. For instance, a systematic review and meta-analysis of studies testing the efficacy of the free-radical scavenger NXY-059 in models of ischaemic stroke revealed that publications that included information on randomization, concealment of group allocation, or blinded assessment of outcomes reported significantly smaller effect sizes of NXY-059 in comparison to studies lacking this information⁹. In certain cases, a series of poorly designed studies, obscured by deficient reporting, may, in aggregate, serve erroneously as the scientific rationale for large, expensive and ultimately unsuccessful clinical trials. Such trials may unnecessarily expose patients to potentially harmful agents, prevent these patients from participating in other trials of possibly effective agents, and drain valuable resources and energy that might otherwise be more productively spent.

A core set of reporting standards

The large fraction of poorly reported animal studies and the empirical evidence of associated bias^{6,9,10,11,12,13,15,16,17,18,19}, defined broadly as the introduction of an unintentional difference between comparison groups, led various disease communities to adopt general^20,21,22 and animal-model-specific^6,23,24,25 reporting guidelines. However, for guidelines to be effective and broadly accepted by all stakeholders, they should be universal and focus on widely accepted core issues that are important for study evaluation. Therefore, based on available data, we recommend that, at minimum, authors of grant applications and scientific publications should report on randomization, blinding, sample-size estimation and the handling of all data (see below and Box 1).

Box 1: A core set of reporting standards for rigorous study design

Randomization

•Animals should be assigned randomly to the various experimental groups, and the method of randomization reported.

•Data should be collected and processed randomly or appropriately blocked.

Blinding

•Allocation concealment: the investigator should be unaware of the group to which the next animal taken from a cage will be allocated.

•Blinded conduct of the experiment: animal caretakers and investigators conducting the experiments should be blinded to the allocation sequence.

•Blinded assessment of outcome: investigators assessing, measuring or quantifying experimental outcomes should be blinded to the intervention.

Sample-size estimation

•An appropriate sample size should be computed when the study is being designed and the statistical method of computation reported.

•Statistical methods that take into account multiple evaluations of the data should be used when an interim evaluation is carried out.

Data handling

•Rules for stopping data collection should be defined in advance.

•Criteria for inclusion and exclusion of data should be established prospectively.

•How outliers will be defined and handled should be decided when the experiment is being designed, and any data removed before analysis should be reported.

•The primary end point should be prospectively selected. If multiple end points are to be assessed, then appropriate statistical corrections should be applied.

•Investigators should report on data missing because of attrition or exclusion.

•Pseudo replicate issues need to be considered during study design and analysis.

•Investigators should report how often a particular experiment was performed and whether results were substantiated by repetition under a range of conditions.

Randomization and blinding

Choices made by investigators during the design, conduct and interpretation of experiments can introduce bias, resulting in false-positive results. Many have emphasized the importance of randomization and blinding as a means to reduce bias^{6,20,21,22,26}, yet inadequate reporting of these aspects of study design remains widespread in preclinical research. It is important to report whether the allocation, treatment and handling of animals were the same across study groups. The selection and source of control animals needs to be reported as well, including whether they are true littermates of the test groups. Best practices should also include reporting on the methods of animal randomization to the various experimental groups, as well as on random (or appropriately blocked) sample processing and collection of data. Attention to these details will avoid mistaking batch effects for treatment effects (for example, dividing samples from a large study into multiple lots, which are then processed separately). Investigators should also report on whether the individuals caring for the animals and conducting the experiments were blinded to the allocation sequence, blinded to group allocation and, whenever possible, whether the persons assessing, measuring or quantifying the experimental outcomes were blinded to the intervention.

Sample-size estimation

Minimizing the use of animals in research is not only a requirement of funding agencies around the world but also an ethical obligation. It is unethical, however, to perform underpowered experiments with insufficient numbers of animals that have little prospect of detecting meaningful differences between groups. In addition, with smaller studies, the positive predictive value is lower, and false-positive results can ensue, leading to the needless use of animals in subsequent studies that build upon the incorrect results²⁷. Studies with an inadequate sample size may also provide false-negative results, where potentially important findings go undetected. For these reasons it is crucial to report how many animals were used per group and what statistical methods were used to determine this number.

Data handling

Common practices related to data handling that can also lead to false positives include interim data analysis²⁸, the ad hoc exclusion of data²⁹, retrospective primary end point selection³⁰, pseudo replication³¹ and small effect sizes³².

Interim data analysis

It is not uncommon for investigators to collect some data and perform an interim data analysis. If the results are statistically significant in favour of the working hypothesis, the study is terminated and a paper is written. If the results look ‘promising’ but are not statistically significant, additional data are collected. This has been referred to as ‘sampling to a foregone conclusion’ and can lead to a high rate of false-positive findings^28,29. Therefore, sample size and rules for stopping data collection should be defined in advance and properly reported. Unplanned interim analyses, which can inflate false-positive outcomes and require unblinding of the allocation code, should be avoided. If there are interim analyses, however, these should be reported in the publication.

Ad hoc exclusion of data

Animal studies are often complex and outliers are not unusual. Decisions to include or exclude specific animals on the basis of outcomes (for example, state of health, dissimilarity to other data) have the potential to influence the study results. Thus, rules for inclusion and exclusion of data should be defined prospectively and reported. It is also important to report whether all animals that were entered into the experiment actually completed it, or whether they were removed, and if so, for what reason. Differential attrition between groups can introduce bias. For example, a treatment may appear effective if it kills off the weakest or most severely affected animals whose fates are then not reported. In addition, it is important to report whether any data were removed before analysis and the reasons for this data exclusion.

Retrospective primary end-point selection

It is well known that assessment of multiple end points, and/or assessment of a single end point at multiple time points, inflates the type-I error (false-positive results)³⁰. Yet it is not uncommon for investigators to select a primary end point only after data analyses. False-positive conclusions arising from such practices can be avoided by specifying a primary end point before the study is undertaken, the time(s) at which the end point will be assessed, and the method(s) of analysis. Significant findings for secondary end points can and should be reported, but should be delineated as exploratory in nature. If multiple end points are to be assessed, then appropriate statistical corrections should be applied to control type-I error, such as Bonferroni corrections^30,33.

Pseudo replicates

When considering sample-size determination and experimental design, pseudo-replication issues need to be considered³¹. There is a clear, but often misunderstood or misrepresented, distinction between technical and biologic replicates. For example, in analysing effects of pollutants on reproductive health, multiple sampling from a litter, regardless of how many littermates are quantified, provides data from only a single biologic replicate. When biologic variation in response to some intervention is the variable of interest, as in many animal experiments, analysis of samples from multiple litters is essential. The unit of assessment is the smallest unit (animal, cage, litter) to which the intervention in question can be independently administered³⁴.

Small effect sizes

A statistically significant result does not provide information on the magnitude of the effect and thus does not necessarily mean that the effect is robust, which could account for the poor reproducibility of certain studies³⁵. Therefore, reporting whether results were substantiated by repetition, preferably under a range of conditions that demonstrate the robustness of the effect is encouraged. Also, reporting how often the particular experiment was performed as a means to control for a general tendency to publish only the best results would strengthen the validity of experimental results. To this end, carefully designed and powered animal studies should be budgeted for in the grant applications and funding agencies should consider supporting repetition studies where appropriate.

An important note about exploratory experiments

For the most part, these best practices do not apply to early-stage observational experiments searching for possible differences among experimental groups. Such exploratory testing is frequently conducted using a small sample size, does not have a primary outcome and is often unblinded. However, because such experiments are likely to be subject to many of the limitations described above, they should be viewed as hypothesis-generating experiments and interpreted as such. Potential discoveries arising from the exploratory phase of the research should be supported by follow-up, hypothesis-testing experiments that take into consideration and adequately report on the core standards detailed above (Box 1).

The path to implementation

Improving the transparency and quality of reporting cannot be achieved by a single party, but will require cooperation among all stakeholders, including investigators, reviewers, funding agencies and journals. Calling upon investigators to provide key information about the design, execution and analysis of animal experiments described in grant applications and manuscripts and encouraging reviewers to consider these issues in their evaluations should, over time, increase both the quality and predictive value of preclinical research. Potential strategies for achieving this goal can be adopted from the clinical trials community, which also contended with poor reporting and associated bias. Evidence that clinical trials can yield biased results if they lack methodological rigor^{36,37,38,39,40,41} led to the development and implementation of the CONSORT guidelines for randomized clinical trials (among other guidelines), now adopted by many clinical journals and funding organizations. These guidelines require that authors report whether and how their studies were carried out blind and randomized, how sample size was determined, whether data are missing owing to attrition or exclusion, and supply information about other important experimental parameters^42,43,44. Importantly, the guidelines have improved the transparency of clinical study reporting in journals that have adopted them^45,46,47,48. Additional evidence for the power of such guidelines can be deduced from the observation that, although few animal studies report on randomization, blinding or sample-size determination, most describe compliance with animal regulations, which is required by journals^6,8,9,49,50.

As a first step, we recommend that funding organizations and journals provide reviewers with clear guidance about core features of animal study design (listed in Box 1). The goal is not to be prescriptive or proscriptive, but rather to delineate the minimum set of standards that should routinely be considered in evaluating the appropriateness of a study. Such guidance would make the task easier for reviewers of manuscripts and grant applications who volunteer their time and are often overextended. In addition, investigators and reviewers should be encouraged to consult published generic and model-specific guidelines for designing in vivo animal experiments^{6,20,21,22,23,24,25,26,51,52}. To assist reviewers, editors and funding organizations in making sure that applications and manuscripts contain sufficient information on the core reporting recommendation (Box 1), authors could be asked to append relevant information on a standardized form that accompanies the submission. This form could be as simple as a checkbox indicating the page on which the key reporting standard is addressed. Such a form is currently used by clinical research journals.

In addition to the measures proposed above, better dissemination of knowledge will be greatly facilitated by addressing publication bias, the phenomenon that few studies showing negative outcomes are published^{53,54,55,56,57,58,59,60,61,62}. Such deficiency in reporting contributes to needless repetition of similar studies by investigators unaware of earlier efforts^59,60. There is a widely accepted belief that the scientific community, promotions committees, funding agencies and journals favour positive outcomes, an impression that can lead to bias⁶³. Possible solutions include incentivizing investigators to publish negative outcomes, supporting studies of independent replication, encouraging journals to publish a greater number of studies reporting negative outcomes, creating a database for negative outcomes (analogous to http://ClinicalTrials.gov/), and linking the raw data to publications.

Change will not occur overnight. The importance of training scientists to properly design and adequately report animal studies cannot be overstated. Training and education focused on key features of experimental design should be an ongoing process for both the novice and veteran involved in biomedical research. Attention to better study design reporting should be communicated at major meetings, brought to the attention of reviewers, editors and funders, required by the publishers of peer-review journals, and included in the training program of graduate and postdoctoral students. Furthermore, good mentorship is crucial for developing such skills and should be encouraged and rewarded. Rigorous experimental design and adequate reporting needs to be emphasized across the board and monitored in training grants awarded by the US National Institute of Health (NIH) and other funding agencies. Professional societies can also have an important role by highlighting this issue in their respective communities.

An important gatekeeper of quality remains the peer review of grant applications and journal manuscripts. We therefore call upon funding agencies and publishing groups to take actions to reinforce the importance of methodological rigor and reporting. NINDS has begun taking steps to promote best practices for preclinical therapy development studies. In 2011, a Notice was published in the NIH Guide encouraging the scientific community to address the issues described above in their grant applications, in describing both the project being proposed and the supporting data upon which it is based (http://grants.nih.gov/grants/guide/notice-files/NOT-NS-11-023.html). Points that should be considered in a well-designed study are listed on the NINDS website (http://www.ninds.nih.gov/funding/transparency_in_reporting_guidance.pdf). Furthermore, the reviewers of applications reviewed by the NINDS Scientific Review Branch are reminded of these issues and asked to pay careful attention to the scientific premise of the proposed projects.

We believe that improving how animal studies are reported will raise awareness of the importance of rigorous study design. Such increased awareness will accelerate both scientific progress and the development of new therapies.

References

1
Begley, C. G. & Ellis, L. M. Raise standards for preclinical cancer research. Nature 483, 531–533 (2012)
CAS Article ADS Google Scholar
2
Hess, K. R. Statistical design considerations in animal studies published recently in Cancer Research. Cancer Res. 71, 625 (2011)
CAS Article Google Scholar
3
Kilkenny, C. et al. Survey of the quality of experimental design, statistical analysis and reporting of research using animals. PLoS ONE 4, e7824 (2009)
Article ADS Google Scholar
4
Moher, D., Simera, I., Schulz, K. F., Hoey, J. & Altman, D. G. Helping editors, peer reviewers and authors improve the clarity, completeness and transparency of reporting health research. BMC Med. 6, 13 (2008)
Article Google Scholar
5
Prinz, F., Schlange, T. & Asadullah, K. Believe it or not: how much can we rely on published data on potential drug targets? Nature Rev. Drug Discov. 10, 712 (2011)The first report that many published studies cannot be reproduced by the pharmaceutical industry.
CAS Article Google Scholar
6
Sena, E., van der Worp, H. B., Howells, D. & Macleod, M. How can we improve the pre-clinical development of drugs for stroke? Trends Neurosci. 30, 433–439 (2007)
CAS Article Google Scholar
7
Steward, O., Popovich, P. G., Dietrich, W. D. & Kleitman, N. Replication and reproducibility in spinal cord injury research. Exp. Neurol. 233, 597–605 (2012)
Article Google Scholar
8
van der Worp, H. B. & Macleod, M. R. Preclinical studies of human disease: time to take methodological quality seriously. J. Mol. Cell. Cardiol. 51, 449–450 (2011)
CAS Article Google Scholar
9
Hackam, D. G. & Redelmeier, D. A. Translation of research evidence from animals to humans. J. Am. Med. Assoc. 296, 1727–1732 (2006)A study reporting that a large fraction of high-impact publications in highly reputable journals lack important information related to experimental design.
Article Google Scholar
10
Macleod, M. R. et al. Evidence for the efficacy of NXY-059 in experimental focal cerebral ischaemia is confounded by study quality. Stroke 39, 2824–2829 (2008)A study demonstrating that lack of reporting of key methodological parameters is associated with bias.
Article Google Scholar
11
Bebarta, V., Luyten, D. & Heard, K. Emergency medicine animal research: does use of randomization and blinding affect the results? Acad. Emerg. Med. 10, 684–687 (2003)
Article Google Scholar
12
Crossley, N. A. et al. Empirical evidence of bias in the design of experimental stroke studies – A metaepidemiologic approach. Stroke 39, 929–934 (2008)
Article Google Scholar
13
Rooke, E. D., Vesterinen, H. M., Sena, E. S., Egan, K. J. & Macleod, M. R. Dopamine agonists in animal models of Parkinson’s disease: a systematic review and meta-analysis. Parkinsonism Relat. Disord. 17, 313–320 (2011)
Article Google Scholar
14
Vesterinen, H. M. et al. Improving the translational hit of experimental treatments in multiple sclerosis. Mult. Scler. J. 16, 1044–1055 (2010)
Article Google Scholar
15
Stroke Therapy Academic Industry Roundtable (STAIR). Recommendations for standards regarding preclinical neuroprotective and restorative drug development. Stroke 30, 2752–2758 (1999)
16
Fanelli, D. “Positive” results increase down the hierarchy of the sciences. PLoS ONE 5, e10068 (2010)
Article ADS Google Scholar
17
Jerndal, M. et al. A systematic review and meta-analysis of erythropoietin in experimental stroke. J. Cereb. Blood Flow Metab. 30, 961–968 (2010)
CAS Article Google Scholar
18
Macleod, M. R., O’Collins, T., Horky, L. L., Howells, D. W. & Donnan, G. A. Systematic review and metaanalysis of the efficacy of FK506 in experimental stroke. J. Cereb. Blood Flow Metab. 25, 713–721 (2005)
CAS Article Google Scholar
19
Sena, E. S. et al. Factors affecting the apparent efficacy and safety of tissue plasminogen activator in thrombotic occlusion models of stroke: systematic review and meta-analysis. J. Cereb. Blood Flow Metab. 30, 1905–1913 (2010)
CAS Article Google Scholar
20
Wheble, P. C. R., Sena, E. S. & Macleod, M. R. A systematic review and meta-analysis of the efficacy of piracetam and piracetam-like compounds in experimental stroke. Cerebrovasc. Dis. 25, 5–11 (2008)
CAS Article Google Scholar
21
Festing, M. F. & Altman, D. G. Guidelines for the design and statistical analysis of experiments using laboratory animals. ILAR J. 43, 244–258 (2002)
CAS Article Google Scholar
22
Kilkenny, C., Browne, W. J., Cuthill, I. C., Emerson, M. & Altman, D. G. Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research. PLoS Biol. 8, e1000412 (2010)
Article Google Scholar
23
van der Worp, H. B. et al. Can animal models of disease reliably inform human studies? PLoS Med. 7, e1000245 (2010)
Article Google Scholar
24
Fisher, M. et al. Update of the stroke therapy academic industry roundtable preclinical recommendations. Stroke 40, 2244–2250 (2009)
Article Google Scholar
25
Ludolph, A. C. et al. Guidelines for preclinical animal research in ALS/MND: a consensus meeting. Amyotroph. Lateral Scler. 11, 38–45 (2010)
Article Google Scholar
26
Shineman, D. W. et al. Accelerating drug discovery for Alzheimer’s disease: best practices for preclinical animal studies. Alzheimers Res. Ther. 3, 28 (2011)
CAS Article Google Scholar
27
Unger, E. F. All is not well in the world of translational research. J. Am. Coll. Cardiol. 50, 738–740 (2007)
Article Google Scholar
28
Ioannidis, J. P. A. Why most published research findings are false. PLoS Med. 2, e124 (2005)
Article Google Scholar
29
Dienes, Z. Bayesian versus orthodox statistics: which side are you on? Perspect. Psychol. Sci. 6, 274–290 (2011)
Article Google Scholar
30
Simmons, J. P., Nelson, L. D. & Simonsohn, U. False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychol. Sci. 22, 1359–1366 (2011)
Article Google Scholar
31
Beal, K. G. & Khamis, H. J. A problem in statistical-analysis: simultaneous inference. Condor 93, 1023–1025 (1991)
Article Google Scholar
32
Lazic, S. E. The problem of pseudoreplication in neuroscientific studies: is it affecting your analysis? BMC Neurosci. 11, 5 (2010)
Article Google Scholar
33
Scott, S. et al. Design, power, and interpretation of studies in the standard murine model of ALS. Amyotroph. Lateral Scler. 9, 4–15 (2008)An enlightening analysis of how small sample sizes can lead to false-positive outcomes.
CAS Article Google Scholar
34
Proschan, M. A. & Waclawiw, M. A. Practical guidelines for multiplicity adjustment in clinical trials. Control. Clin. Trials 21, 527–539 (2000)
CAS Article Google Scholar
35
Festing, M. F. W. Design and statistical methods in studies using animal models of development. ILAR J. 47, 5–14 (2006)
CAS Article Google Scholar
36
Nakagawa, S. & Cuthill, I. C. Effect size, confidence interval and statistical significance: a practical guide for biologists. Biol. Rev. Camb. Philos. Soc. 82, 591–605 (2007)
Article Google Scholar
37
Chalmers, T. C., Celano, P., Sacks, H. S. & Smith, H. Bias in treatment assignment in controlled clinical-trials. N. Engl. J. Med. 309, 1358–1361 (1983)
CAS Article Google Scholar
38
Jüni, P., Altman, D. G. & Egger, M. Systematic reviews in health care - assessing the quality of controlled clinical trials. Br. Med. J. 323, 42 (2001)
Article Google Scholar
39
Pildal, J. et al. Impact of allocation concealment on conclusions drawn from meta-analyses of randomized trials. Int. J. Epidemiol. 36, 847–857 (2007)
CAS Article Google Scholar
40
Pocock, S. J., Hughes, M. D. & Lee, R. J. Statistical problems in the reporting of clinical-trials. A survey of three medical journals. N. Engl. J. Med. 317, 426–432 (1987)
CAS Article Google Scholar
41
Schulz, K. F., Chalmers, I., Hayes, R. J. & Altman, D. G. Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials. J. Am. Med. Assoc. 273, 408–412 (1995)
CAS Article Google Scholar
42
Wood, L. et al. Empirical evidence of bias in treatment effect estimates in controlled trials with different interventions and outcomes: meta-epidemiological study. Br. Med. J. 336, 601–605 (2008)
Article Google Scholar
43
Moher, D. CONSORT 2010 Explanation and Elaboration: updated guidelines for reporting parallel group randomised trials. Br. Med. J. 340, c869 (2011)
Article Google Scholar
44
Moher, D., Schulz, K. F. & Altman, D. G. The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomised trials. Lancet 357, 1191–1194 (2001)Revision of guidelines by the CONSORT group to improve the reporting of randomized clinical trials.
CAS Article Google Scholar
45
Schulz, K. F., Altman, D. G. & Moher, D. CONSORT 2010 statement: updated guidelines for reporting parallel group randomised trials. PLoS Med. 7, e1000251 (2010)
Article Google Scholar
46
Plint, A. C. et al. Does the CONSORT checklist improve the quality of reports of randomised controlled trials? A systematic review. Med. J. Aust. 185, 263–267 (2006)
PubMed Google Scholar
47
Kane, R. L., Wang, J. & Garrard, J. Reporting in randomized clinical trials improved after adoption of the CONSORT statement. J. Clin. Epidemiol. 60, 241–249 (2007)
Article Google Scholar
48
Prady, S. L., Richmond, S. J., Morton, V. M. & Macpherson, H. A systematic evaluation of the impact of STRICTA and CONSORT recommendations on quality of reporting for acupuncture trials. PLoS ONE 3, e1577 (2008)
Article ADS Google Scholar
49
Smith, B. A. et al. Quality of reporting randomized controlled trials (RCTs) in nursing literature: application of the consolidated standards reporting trials (CONSORT). Nurs. Outlook 56, 31–37 (2008)
Article Google Scholar
50
Macleod, M. R., O’Collins, T., Howells, D. W. & Donnan, G. A. Pooling of animal experimental data reveals influence of study design and publication bias. Stroke 35, 1203–1208 (2004)
Article Google Scholar
51
Macleod, M. R., O’Collins, T., Horky, L. L., Howells, D. W. & Donnan, G. A. Systematic review and meta-analysis of the efficacy of melatonin in experimental stroke. J. Pineal Res. 38, 35–41 (2005)
CAS Article Google Scholar
52
Gallo, J. M. Pharmacokinetic/pharmacodynamic-driven drug development. Mt. Sinai J. Med. 77, 381–388 (2010)
Article Google Scholar
53
Moher, D. et al. Describing reporting guidelines for health research: a systematic review. J. Clin. Epidemiol. 64, 718–742 (2011)
Article Google Scholar
54
Callaham, M. L., Wears, R. L., Weber, E. J., Barton, C. & Young, G. Positive-outcome bias and other limitations in the outcome of research abstracts submitted to a scientific meeting. J. Am. Med. Assoc. 280, 254–257 (1998)
CAS Article Google Scholar
55
Dickersin, K. & Chalmers, I. Recognizing, investigation and dealing with incomplete and biased reporting of clinical research: from Francis Bacon to the WHO. J. R. Soc. Med. 104, 532–538 (2011)
Article Google Scholar
56
Fanelli, D. Negative results are disappearing from most disciplines and countries. Scientometrics 90, 891–904 (2012)
Article Google Scholar
57
Kyzas, P. A., Denaxa-Kyza, D. & Ioannidis, J. P. A. Almost all articles on cancer prognostic markers report statistically significant results. Eur. J. Cancer 43, 2559–2579 (2007)
Article Google Scholar
58
Liu, S. Dealing with publication bias in translational stroke research. J. Exp. Stroke Transl. Med. 2, 16–21 (2009)
Article Google Scholar
59
Rockwell, S., Kimler, B. E. & Moulder, J. E. Publishing negative results: the problem of publication bias. Radiat. Res. 165, 623–625 (2006)
CAS Article ADS Google Scholar
60
Rosenthal, R. The file drawer problem and tolerance for null results. Psychol. Bull. 86, 638–641 (1979)
Article Google Scholar
61
Sterling, T. D. Publication decisions and their possible effects on inferences drawn from tests of significance—or vice versa. J. Am. Stat. Assoc. 54, 30–34 (1959)
Google Scholar
62
Song, F. et al. Dissemination and publication of research findings: an updated review of related biases. Health Technol. Assess. 14, 1–220 (2010)
Article Google Scholar
63
Sena, E. S., van der Worp, H. B., Bath, P. M. W., Howells, D. W. & Macleod, M. R. Publication bias in reports of animal stroke studies leads to major overstatement of efficacy. PLoS Biol. 8, e1000344 (2010)
Article Google Scholar
64
Fanelli, D. Do pressures to publish increase scientists’ bias? An empirical support from US states data. PLoS ONE 5, e10271 (2010)
Article ADS Google Scholar

Download references

Acknowledgements

Funded by NINDS.

Author information

Affiliations

National Institute of Neurological Disorders and Stroke, NIH, Bethesda, 20892, Maryland, USA
Story C. Landis, Robert Finkelstein, Amelie K. Gubitz, Walter Koroshetz, John D. Porter, Ursula Utz & Shai D. Silberberg
Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, 15213, Pennsylvania, USA
Susan G. Amara
Bayer HealthCare, Berlin, 13342, Germany
Khusru Asadullah
National Center for Advancing Translational Sciences, NIH, Rockville, 20854, Maryland, USA
Chris P. Austin
CHDI Management/CHDI Foundation, New York, 10001, New York, USA
Robi Blumenstein
Center for Review, NIH, Bethesda, Maryland 20892, USA
Eileen W. Bradley
Department of Genetic Medicine, Weill Cornell Medical College, New York, 10021, New York, USA
Ronald G. Crystal
Howard Hughes Medical Institute, The Rockefeller University, New York, 10065, New York, USA
Robert B. Darnell
Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, 15213, Pennsylvania, USA
Robert J. Ferrante
Alzheimer's Drug Discovery Foundation, New York, 10019, New York, USA
Howard Fillit
Department of Neurology, University of Massachusetts Medical School, Worcester, 01545, Massachusetts, USA
Marc Fisher
Department of Pharmacology and Experimental Neuroscience, University of Nebraska Medical Center, Omaha, 68198, Nebraska, USA
Howard E. Gendelman
JAMA, Chicago, 60654, Illinois, USA
Robert M. Golub
Department of Neurology, Michigan State University, East Lansing, 48824, Michigan, USA
John L. Goudreau
Department of Neurology, University of Rochester Medical Center, Rochester, 14642, New York, USA
Robert A. Gross
Parent Project Muscular Dystrophy, Hackensack, 07601, New Jersey, USA
Sharon E. Hesterlee
The Florey Institute of Neuroscience and Mental Health, University of Melbourne, Heidelberg, 3081, Australia
David W. Howells
Neurology and Neurological Sciences and Cellular and Molecular Physiology, Stanford University, Stanford, 94305, California, USA
John Huguenard
Science Translational Medicine, AAAS, Washington DC 22201, USA
Katrina Kelner
Department of Neurology, Harvard Medical School, Massachusetts General Hospital, Boston, 02114, Massachusetts, USA
Dimitri Krainc
F. Hoffmann-La Roche, Basel, 4070, Switzerland
Stanley E. Lazic
Department of Psychiatry and Biobehavioral Sciences, University of California Los Angeles, Los Angeles, 90095, California, USA
Michael S. Levine
Department of Clinical Neurosciences, University of Edinburgh, Western General Hospital, Edinburgh, EH4 2XU, UK
Malcolm R. Macleod
PharMac LLC, Boca Grande, 33921, Florida, USA
John M. McCall
University of Rochester Medical Center, School of Medicine and Dentistry, Rochester, 14642, New York, USA
Richard T. Moxley III
Nature Neuroscience, New York, 10013, New York, USA
Kalyani Narasimhan
Department of Neurological Surgery, University of California San Francisco, San Francisco, 94143, California, USA
Linda J. Noble
ALS Therapy Development Institute, Cambridge, 02139, Massachusetts, USA
Steve Perrin
Reeve-Irvine Research Center, University of California Irvine, Irvine, 92697, California, USA
Oswald Steward
Office of New Drugs, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, Maryland 20993, USA
Ellis Unger

Authors

Story C. Landis
View author publications
You can also search for this author in PubMed Google Scholar
Susan G. Amara
View author publications
You can also search for this author in PubMed Google Scholar
Khusru Asadullah
View author publications
You can also search for this author in PubMed Google Scholar
Chris P. Austin
View author publications
You can also search for this author in PubMed Google Scholar
Robi Blumenstein
View author publications
You can also search for this author in PubMed Google Scholar
Eileen W. Bradley
View author publications
You can also search for this author in PubMed Google Scholar
Ronald G. Crystal
View author publications
You can also search for this author in PubMed Google Scholar
Robert B. Darnell
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Ferrante
View author publications
You can also search for this author in PubMed Google Scholar
Howard Fillit
View author publications
You can also search for this author in PubMed Google Scholar
Robert Finkelstein
View author publications
You can also search for this author in PubMed Google Scholar
Marc Fisher
View author publications
You can also search for this author in PubMed Google Scholar
Howard E. Gendelman
View author publications
You can also search for this author in PubMed Google Scholar
Robert M. Golub
View author publications
You can also search for this author in PubMed Google Scholar
John L. Goudreau
View author publications
You can also search for this author in PubMed Google Scholar
Robert A. Gross
View author publications
You can also search for this author in PubMed Google Scholar
Amelie K. Gubitz
View author publications
You can also search for this author in PubMed Google Scholar
Sharon E. Hesterlee
View author publications
You can also search for this author in PubMed Google Scholar
David W. Howells
View author publications
You can also search for this author in PubMed Google Scholar
John Huguenard
View author publications
You can also search for this author in PubMed Google Scholar
Katrina Kelner
View author publications
You can also search for this author in PubMed Google Scholar
Walter Koroshetz
View author publications
You can also search for this author in PubMed Google Scholar
Dimitri Krainc
View author publications
You can also search for this author in PubMed Google Scholar
Stanley E. Lazic
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Levine
View author publications
You can also search for this author in PubMed Google Scholar
Malcolm R. Macleod
View author publications
You can also search for this author in PubMed Google Scholar
John M. McCall
View author publications
You can also search for this author in PubMed Google Scholar
Richard T. Moxley III
View author publications
You can also search for this author in PubMed Google Scholar
Kalyani Narasimhan
View author publications
You can also search for this author in PubMed Google Scholar
Linda J. Noble
View author publications
You can also search for this author in PubMed Google Scholar
Steve Perrin
View author publications
You can also search for this author in PubMed Google Scholar
John D. Porter
View author publications
You can also search for this author in PubMed Google Scholar
Oswald Steward
View author publications
You can also search for this author in PubMed Google Scholar
Ellis Unger
View author publications
You can also search for this author in PubMed Google Scholar
Ursula Utz
View author publications
You can also search for this author in PubMed Google Scholar
Shai D. Silberberg
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.F., A.K.G., S.C.L., J.D.P., S.D.S., U.U. and W.K. organized the workshop. R.B.D., S.E.L., S.C.L., M.R.M. and S.D.S. wrote the manuscript. All authors participated in the workshop and contributed to the editing of the manuscript.

Corresponding author

Correspondence to Shai D. Silberberg.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This article is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike licence (http://creativecommons.org/licenses/by-nc-sa/3.0/).

Reprints and Permissions

About this article

Cite this article

Landis, S., Amara, S., Asadullah, K. et al. A call for transparent reporting to optimize the predictive value of preclinical research. Nature 490, 187–191 (2012). https://doi.org/10.1038/nature11556

Download citation

Received: 21 August 2012
Accepted: 10 September 2012
Published: 10 October 2012
Issue Date: 11 October 2012
DOI: https://doi.org/10.1038/nature11556

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Main

Widespread deficiencies in methods reporting

A core set of reporting standards

Randomization and blinding

Sample-size estimation

Data handling

Interim data analysis

Ad hoc exclusion of data

Retrospective primary end-point selection

Pseudo replicates

Small effect sizes

An important note about exploratory experiments

The path to implementation

References

Acknowledgements

Author information

Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Further reading

Sex as a Biological Variable in Preclinical Modeling of Blast-Related Traumatic Brain Injury

Human Muscle Precursor Cells Form Human-Derived Myofibers in Skeletal Muscles of Nonhuman Primates: A Potential New Preclinical Setting to Test Myogenic Cells of Human Origin for Cell Therapy of Myopathies

Enhancing quality in preclinical data: Of hot science and cool quality

An Analysis of Mesenchymal Stem Cell-Derived Extracellular Vesicles for Preclinical Use

Unified Behavioral Scoring for Preclinical Models

Comments

Search

Quick links