Sensitivity analysis of the 1998 Large Coastal Shark Evaluation Workshop results to new data and model formulations following recommendations from peer reviews

 

by

Enric Cortés, National Marine Fisheries Service, Southeast Fisheries Science Center, Panama City Laboratory, 3500 Delwood Beach Road, Panama City, FL 32408

 

 

April 2002

 

Sustainable Fisheries Division Contribution SFD-01/02-167

 

 

BACKGROUND

As a result of litigation, the 1998 Large Coastal Shark Evaluation Workshop Report was sent out for external peer-review. Two organizations, the Center for Independent Experts (CIE) and the National Resources Consultants (NRC), were selected to implement the review process, which resulted in a total of seven reviews. The present document addresses some of the criticisms and recommendations contained in those reviews. Specifically, it describes the sensitivity trial results obtained from using the 1998 data to evaluate the recommended range of alternative model and data formulations. Ensuing documents will address additional issues, such as using other model (e.g., age-structured) and data (at least up to 2000) formulations useful for an updated assessment.

Rather than addressing each reviewer’s specific criticisms individually, the present document groups the sensitivity tests undertaken into several general categories. Specifically, sensitivity tests presented herein address (listed in alphabetical order):

 

 

 

 

METHODS AND MODELS

Stock Assessment Models

1. Bayesian Surplus Production Model using the SIR algorithm, two forms of the population dynamics model, and two different weighting schemes

The surplus production model applied in the 1998 SEW using Bayesian statistical techniques was a modified Schaefer model that had been previously used in the 1996 SEW (NMFS 1996). This version of the Schaefer model was proposed by Prager (1994) and includes fishing mortality (F) explicitly in the surplus production function, such that when population abundance is expressed in numbers, it becomes:

equation

 

 

 

equationwhere Nt is stock abundance in year t, r is the intrinsic rate of increase from the logistic equation, K is carrying capacity, and Ft is the instantaneous fishing mortality rate in year t. After integration with respect to time and with at = r-Ft and b = r/K, the equation above can take two forms:

 

 

equationwhen at ¹ 0, or

 

when at=0.

As detailed in Prager (1994) and McAllister et al. (2001), Ft must be solved iteratively from two different equations (when at ¹ 0 or when at=0) in which Ft occurs on both sides of the equation. This version of the model will be referred to as the "Prager form" in this document.

The discrete version of the surplus production model without the Prager modification (which was used by at least one of the two reviewers that attempted to duplicate results reported in the 1998 SEW report) was also used in the sensitivity trials conducted to determine the effect of introducing changes in the form of the stock assessment model. This version is simply:

equation

where Ct is catch in year t. This version of the model will be referred to as the "discrete form" in this document.

The expected catch rate (CPUE) for each of the available time series j in year t is given by:

equation

where qj is the catchability coefficient for CPUE series j, and ee is the residual error, which is assumed to be lognormally distributed. Coefficients of variation (CV) were available in some CPUE series (CVj,t), and were used as weights for each series, such that:

equation

 

where cj is a constant for series j that makes the weights sum to 1, and sj2 is the arithmetic mean for the variance of CPUE series j.

The log likelihood function of the abundance indices is expressed as:

equation

 

 

 

where s is the number of CPUE series and y is the number of years in each CPUE series.

 

The average sj2 and qj for each CPUE series were assumed to follow a uniform distribution on the log scale, but were integrated from the joint posterior distribution using the method described by Walters and Ludwig (1994). In another form of the model, all CPUE data points from all series were assumed to have the same variance (s2=1), which is equivalent to having no weighting or having an equal weighting scenario.

Prior probability distributions, alternative hypotheses, and performance indicators

Alternative hypotheses were generated by drawing alternative values from the parameters assigned priors (r, K, N1974/K, and C0). Performance indicators included the maximum sustainable yield (MSC=rK/4), the stock abundance in the last year of data (N1998), the ratio of stock abundance in the last year of data to carrying capacity (N1998/K), and the replacement yield for 1998 (RY1998).

The prior chosen for K in the base-case scenario was uninformative, as little is known about the carrying capacity of shark populations. The prior distribution for the large coastal shark complex and sandbar shark was uniform on the log of K over the range 1x10-12 to 1x1012 individuals. This prior is proportional to the inverse of K and so assigns less credibility to higher values of K (McAllister and Kirkwood 1998). For blacktip shark, the range of the prior for K was 1x10-12 to 8x108 individuals.

The informative prior chosen for r was based on results from documents presented at the 1998 SEW (appendix 2 of the 1996 SEW [SB-IV-31]; SB-IV-10) and on some ad-hoc calculations by the Committee. The upper bound, or absolute biological upper limit, of the intrinsic rate of increase assuming geometric growth was used in each case as the estimate of the mean. Note that the values used are higher than those used by McAllister et al. (2001). Thus, it is important to note that the values used in the 1998 SEW were high from a biological perspective. Recent estimates of intrinsic rates of increase obtained using both density-independent (Cortés, in press) and density-dependent (Smith et al. 1998) theory support considerably lower values of r for most species of sharks—including the sandbar and blacktip—in the large coastal complex (values of r for the most representative species all <0.07 for both density-independent and density-dependent estimates). While the surplus production model used assumes closed populations, it can be argued that the relatively high values of r used may be considered a proxy for net immigration into the stocks, thus alleviating the violation of a closed population assumption.

The priors for r were lognormal pdfs with mean=0.113, 0.117, and 0.136 for the large coastal complex, sandbar, and blacktip, respectively. The SD in the logarithm of r (sr) was set equal to 0.7 in all cases. It is calculated as (McAllister et al. 2001):

equation

This pdf makes values of r<0 impossible and concentrates most of the density towards the lower values of r. However, it also allows for high values of r that are unlikely for closed populations and even for open populations of sharks (e.g., see Table 30).

Informative priors were also used to describe the ratio of the stock abundance in 1974 with respect to K (N1974/K) and the average catch from 1974 to 1980 (C0). For N1974/K, the prior was lognormal with mean=1 and SD in the logarithm of r of 0.20 in all cases. This prior reduces the probability that N1974/K will be much higher than K since most of the values will be closer to unity. The prior for C0 was also lognormal with mean=487 300, 135 900, and 303,800 individuals (the mean of the observed catches) for the large coastal complex, sandbar, and blacktip, respectively. The SD in the logarithm of C0 was 0.51, 0.53, and 0.43, respectively.

 

Method of numerical integration

Numerical integration was carried out using the sampling/importance resampling (SIR) algorithm (Berger 1985, McAllister and Kirkwood 1998, McAllister et al. 2001). The marginal posterior distributions for each of the population parameters of interest were obtained by integrating the joint probability with respect to all the other parameters. Posterior CVs for each population parameter estimate were computed by dividing the posterior SD by the posterior expected value (mean) of the parameter of interest. The importance function used in the SIR algorithm was the multivariate Student t distribution, with its mean based on the posterior mode of q (vector of parameter estimates K, r, N74/K, and C0), and the covariance of q based on the Hessian estimate of the covariance at the mode (see McAllister and Kirkwood and references therein for details).

It has become apparent after the 1998 SEW report was written, however, that the Student t distribution used to sample from the posterior distribution for blacktip shark was inadequate and likely failed to converge to the posterior distribution. This resulted in the underestimation of expected values of the parameters because their right tail was not sampled appropriately by the importance function (McAllister et al., in prep.). To overcome this problem, one solution is to use a variety of importance functions and compare the marginal posterior distributions of the parameters. McAllister et al. (in prep.) have found that making the importance function more diffuse by multiplying the variance estimated from the Hessian matrix by a factor of 2 or more allows for the importance function to adequately sample from the posterior distribution.

The effect of the importance function is evidenced in the results for blacktip. Unlike for the large coastal complex and sandbar, a large portion of the posterior distribution had not been sampled by the importance function, thus underestimating the results. An additional reason for the unreliability of the results reported for blacktip in the 1998 SEW lies in the data themselves. The CPUE series that the model was fitted to show conflicting trends, with the longer Early Rec series and the Gulf Reef logs series indicating an increasing trend, but the remaining, shorter five series indicating a decreasing trend over time. The effect of the series considered on results will be shown in the sensitivity analysis section.

Decision analysis

Posterior expected values for several indices of policy performance were calculated using the resampling portion of the SIR algorithm, which involves randomly drawing 5,000 values of q with replacement from the discrete approximation to the posterior distribution of q, with the probability of drawing each value of q being proportional to the posterior probability calculated during the importance sampling phase. Details of this procedure can be found in McAllister and Kirkwood (1998) and McAllister et al. (2001), and references therein. Once a value of q was drawn, the model was projected from 1974 to 1998, and then to 2028, while applying one of the constant TAC (total allowable catch) policies (0, 10%, 20%, 30%, 40%, and 50% of 1995 catch) from 1999 on. The projections included calculating the expected value of Nfin/K (with fin=2008, 2018, and 2028), the expected value of the ratio of Nfin to the stock abundance that would result in MSY (Nfin /NMSY), the probability that Nfin were <0.2K, and the probability that Nfin were >N1998.

 

2. Bayesian Surplus Production Model using State-Space methodology and MCMC for numerical integration

A nonequilibrium Schaefer surplus production model was also used to describe the population dynamics of the large coastal shark complex, sandbar, and blacktip sharks using state-space methodology and a Markov Chain Monte Carlo (MCMC) method for numerical integration as an alternative to the SPM originally used in the 1998 SEW and described here under (1). The model used was that described by Meyer and Millar (1999a), originally developed in BUGS, and recoded here in WINBUGS (Spiegelhalter et al. 2000). In this approach, a state-space model accounts for both process error and observation error in a unified analytical framework that uses a MCMC method called Gibbs sampling (Gilks et al. 1996) to sample from the joint posterior distribution.

State-space models can be used to relate observed catch rates (It) to unobserved states (biomass, Bt) through a stochastic observation model for It given Bt. A description of state-space models can be found in Meyer and Millar (1999b) and Millar and Meyer (1999). Millar and Meyer (1999a) implemented a nonlinear, nonnormal state-space model assuming lognormal error structures and a reparametrization by expressing the annual biomass as a proportion of carrying capacity (Pt = Bt/K). In the present implementation, this Bayesian model includes the joint prior distribution of all unobservable quantities, i.e., K, r, N1974/K, C0, q, s2 (process error variance), and t2 (observation error variance) and the unknown states P1,….,Pt, and the joint distribution of the observable quantities, i.e., the CPUE indices I1,….,It. Bayesian inference then uses the posterior distribution of the unobserved quantities given the data (see Meyer and Millar 1999a for a full description of the model).

As in the original model developed by Millar and Meyer (1999a), the present implementation used inverse gamma distributions as priors for s2 and t2, but the MLEs for q in each CPUE time series were used instead of one prior for q for each series. The geometric average of the time series of individual q estimates for each CPUE series was used as an analytic solution for the estimate of q that maximizes the likelihood function (Punt 1988; Hilborn and Mangel 1997):

equation

where y is the number of years in each CPUE series.

The prior for s2 was an inverse gamma distribution with the 10% and 90% quantiles set at 0.04 and 0.08, and the priors for t2 (one for each individual CPUE series) were also described by an inverse gamma distribution with the 10% and 90% quantiles set at 0.05 and 0.15. In an alternative scenario, one single value of t2 was used for all series and given an inverse gamma distribution. No CV2 s were used in any of the scenarios run in Winbugs. All runs were based on two chains of initial values (where the Pt values were set equal to 0.5 and 1.0, respectively) to account for over-dispersed initial values (Spiegelhalter et al. 2000), and included a 5,000 sample burn-in phase followed by a 100,000 iteration phase.

 

RESULTS

Baseline analyses (with two forms of the model and two separate importance functions)

The baseline analyses reported in the 1998 SEW (Tables 7-9 therein) were rerun using the same, but updated, software (Mc Allister et al. 2001; Mc Allister and Babcock, in prep.) and compared to the results reported by the two reviewers who attempted to replicate the analyses (Haist and Punt). The two forms of the surplus production model (SPM) described above (Prager form and discrete form) were evaluated using two different importance functions: 1) the importance function used for the 1998 SEW (which will be referred to as the "original importance function"), and 2) an importance function in which the variance obtained form the Hessian matrix was multiplied by a factor of 2 (as described earlier).

 

Large coastal complex

Results of the baseline analysis for the large coastal shark complex with the original importance function (Table 1) are very similar regardless of the form of the model used. Stock abundance in 1998 (N1998) and N1998/K are slightly lower than reported in the 1998 SEW report, whereas the Maximum Sustainable Catch (MSC) is slightly higher. Results for this baseline analysis are a little more pessimistic than found by the two reviewers, especially the predictions of N1998 and N1998/K.

These and all other tables for the sensitivity trials include a decision analysis section that summarizes the consequences of alternative harvesting policies. Predictions from both the Prager and discrete versions of the SPM model indicate that, with the exception of a no-take policy after 30 years, no management action would result in the stock reaching MSY (MSC, here). Compared to Punt’s review findings, which included predictions of N2019/K with 0%, 30%, and 50% of the 1995 catch TAC options, the present predictions are more pessimistic (0.37 vs. 0.50 for the no-take option, 0.02 vs. 0.17 for the 30% TAC option, and 0.01 vs. 0.04 for the 50% TAC option).

Results obtained using the variance expansion factor of two (Table 2) were nearly identical to those obtained with the original importance function, with the projections being slightly more optimistic. Conclusions for the large coastal complex are thus unaffected by the form of the model used (Prager vs. discrete) or the importance function. The marginal posterior distributions of the most representative parameters obtained using the two importance functions with the Prager form of the SPM are shown in Figures 1 and 2.

 

Sandbar shark

Results of the baseline analysis for the sandbar shark using the original importance function are also almost identical regardless of the form of the model used (Table 3). Stock abundance in 1998 (N1998) and N1998/K are slightly lower than reported in the 1998 SEW, but MSC is slightly higher. As with the large coastal shark complex, predictions of N1998, N1998/K, and MSC for this baseline analysis are a little more pessimistic than found by the two reviewers. Projections continue to indicate that a 50% TAC option would not suffice to restore the stock to MSC levels after 10, 20, or 30 years. A no-take policy would almost achieve MSC after 10 years only, and a TAC of 20% of the 1995 catch would yield MSC after 20 years. The prediction for a no-take option after 20 years is similar to that by Punt (0.62-0.64 vs. 0.72), but the predictions for the 30% and 50% TAC options after 20 years are a little more pessimistic that those of Punt (0.42 vs. 0.56, and 0.28 vs. 0.43, respectively).

Results obtained using the variance expansion factor of two (Table 4) are a little more optimistic, but still a little lower than those found by the two reviewers. Conclusions for the sandbar shark are also unaffected by the form of the model used (Prager vs. discrete) or the importance function. The marginal posterior distributions of the most representative parameters obtained using the two importance functions with the Prager form of the SPM are shown in Figures 3 and 4.

 

Blacktip shark

Results of the baseline analysis for the blacktip shark using the original importance function as reported in the 1998 SEW and in the present analysis, and those reported by the two reviewers are the most dissimilar (Table 5). Stock abundance in 1998 (N1998), N1998/K, and MSC are much lower than found by the reviewers. Consequently, predictions of future trends are also very different: a no-take policy would result in N2018/K=0.64-0.67 compared to N2019/K=0.81 found by Punt, and the 30% and 50% TAC options after 20 years are even more different (0.32-0.34 vs. 0.65, and 0.14-0.15 vs. 0.55, respectively).

As discussed in the Methods and Models section, results for the blacktip shark are highly influenced by the importance function used, largely because the seven CPUE series that the model is fitted to have contradictory trends. When the variance expansion factor of two was used in the importance function, results became much more optimistic, especially with the discrete form of the SPM (Table 6). Results obtained with the Prager form SPM were similar to those obtained by Punt and a little more optimistic than those reported by Haist. When using the discrete version of the SPM, results became considerably higher. Conclusions for the blacktip shark are thus directly affected by the importance function used and, to a lesser extent, by the form of the surplus production model used. The marginal posterior distributions of the most representative parameters obtained using the two importance functions with the Prager form of the SPM are shown in Figures 5 and 6. Note that the distributions for K and N1998 show more density in the right tail and that N1998/K seems to follow a bimodal distribution (Fig. 6).

 

Sensitivity analyses

Sensitivity tests incorporated the following modifications to the base-case scenarios, which used the priors described above for the large coastal shark complex, sandbar, and blacktip sharks. All sensitivity trials used the importance function with a variance expansion factor of two and were run with the Prager form SPM only, except for the scenarios described in section A.1, which included both forms of the SPM and the two separate importance functions.

 

A. Changes in the catch series

A.1. Alternative catch scenarios presented in the 1998 SEW report

Alternative catch scenarios were presented in the 1998 SEW report (Tables 7-9) for the large coastal shark complex, sandbar, and blacktip sharks. These alternative catch series (presented in Table 3 of the 1998 SEW report for the large coastal complex) accounted for potential underreporting in the commercial fishery by multiplying the catches from 1981-1985, 1986-1992, and 1993, by a factor of 1.5, 2, and 1.5, respectively, assuming that discards in the bottom longline fishery were 10,000 individuals during 1981-1986, and assuming that the recreational catch for 1983 was the geometric mean value of the 1982 and 1984 estimates. Similarly, alternative catch series for the sandbar and blacktip sharks were constructed by multiplying the commercial catches from 1986 by a factor of 1.5, those from 1987-1992 by a factor of 2, and those from 1993 by a factor of 1.5. Baseline and alternative catch series are given in Appendix 1. The priors for K, r, and N1974/K in the alternative catch series scenarios were the same as those in the baseline analyses, but the mean for the lognormal prior of C0 was set at 543 500, 187 300, and 381 700 individuals (the mean of the 1981-1997 or 1986-1997 catches) for the large coastal complex, sandbar, and blacktip, respectively. The SD in the logarithm of r was left unchanged.

For the large coastal complex, using the alternative catch series with the original importance function increased the predictions of K, N1998, N1998/K, and RY1998, but decreased the expected value of r and MSC (Table 7). Projections in the decision analysis changed very little, becoming slightly more optimistic. Results obtained using the variance expansion factor of two in the importance function (Table 8) were nearly identical to those obtained with the original importance function. Considering this alternative catch series thus had essentially no effect on the conclusions derived from the baseline analysis.

For sandbar shark, results from the Prager and the discrete SPM forms with the original importance function were slightly different, and they also differed from those in the 1998 SEW report (Table 9), possibly because the prior used for C0 was different. The projections from the decision analysis were a little more optimistic than those from the baseline analysis, with a 20% TAC option allowing recovery to MSC after 10 years according to the discrete form of the SPM. Results obtained using the variance expansion factor of two in the importance function (Table 10) were a little more optimistic in general than those obtained with the original importance function. For sandbar, the main conclusions derived from the baseline analysis were not strongly affected .

For blacktip shark, results from both forms of the model with the original importance function were very similar and also similar to those reported in the 1998 SEW report (Table 11). The projections from the decision analysis were a little more pessimistic than those from the baseline analysis (Table 5). As in the baseline analysis, using the variance expansion factor of two in the importance function yielded much more optimistic results (Table 12) and indicated no need for rebuilding, especially when using the discrete form SPM. Conclusions for the blacktip shark are directly affected by the importance function used and, to a lesser extent, by the form of the surplus production model used, as was also found in the baseline analysis.

 

A.2. Alternative catch scenario proposed by Haist

An additional sensitivity trial was run for the large coastal complex, incorporating the catch series proposed by Haist. This alternative catch series addressed concerns identified in the Industry Position Statement that the alternative catch series used in the 1998 SEW report underestimated true catches and did not take account of catches occurring outside U.S. waters. Haist thus multiplied the baseline catches from 1981-1985 and from 1986-1992 by a factor of 2 and 3, respectively, and included non-U.S. catches of 100 000 individuals per year from 1981-1997 (Appendix 2). The results obtained with the Prager form of the SPM were a bit more optimistic than those from the baseline analysis, but still a little more pessimistic than those found by Haist (Table 13). As indicated by Haist, however, the main conclusion from the baseline analysis remained unchanged.

 

B. Changes in prior distributions

B.1. Decreasing the upper bound of K

Haist considered a set of sensitivity scenarios in which the upper bound of the carrying capacity, K, was lowered. Decreasing the upper bound of K from 1x1012 to 20x106 individuals had no effect on results for the large coastal complex (Table 14). For sandbar, the same decrease in the upper bound of K had no effect on results, and a further decrease to 12x106 individuals had essentially no effect either (Table 15). For blacktip, a decrease of the upper bound of K from 8x108 to 20x106 individuals slightly decreased the predictions, whereas decreasing it to 12x106 individuals had a more accentuated effect, further decreasing the predictions (Table 16). In general, none of the conclusions derived from the baseline analyses changed by applying this set of sensitivity trials, except when decreasing the upper bound of K for blacktip to a considerably lower value, which would then indicate the need for some degree of rebuilding.

 

B.2. Dividing by two the CV of r

Dividing the CV for the intrinsic rate of increase, r, by two (scenarios run by Punt) for the large coastal complex resulted in the same level of stock depletion, reduced the estimate of stock abundance in 1998, but increased the estimates of MSC and RY1998, making the projections in the decision analysis generally less pessimistic than those from the baseline analysis (Table 17). For sandbar, a similar pattern was observed when reducing the CV of r by 50%, and projections became a little less pessimistic (Table 17). For blacktip, the stock was projected to be a little more depleted, but still above MSC (Table 17). In general, the same level of discrepancy with respect to the results found by Punt in the baseline analyses persisted in these sensitivity trials. The conclusions regarding the status of the three groupings remained unaffected.

 

B.3. Decreasing the upper bound of K and the mean of r

A further sensitivity trial for the sandbar shark that reduced the mean value of r from 0.113 to 0.07 and K to 12x106 individuals (one sensitivity trial run by Haist) was run. This resulted in lower expected values of K, r, N1998/K, MSC, and RY1998, but N1998 increased a little, with the ensuing projections becoming more pessimistic (Table 18), but not affecting significantly the conclusions on stock status.

 

 

C. Changes in the weighting schemes for the CPUE series, the form of the stock assessment model, and the method of numerical integration

Changing the weighting scheme by assuming that all CPUE data points from all series had equal variance (s2=1) or, in other words, assuming an equal weighting scenario (see Stock Assessment Models section above) yielded results similar to those of Punt’s equal weighting scenario. Both forms of the SPM model—the Prager form and the discrete form—resulted in higher values of K and N1998, and lower values of r, MSC, and RY1998 than found by Punt, but N1998/K was similar (Table 19). The results and projections were less pessimistic than those found in the baseline scenario, indicating that MSC could almost be reached after 20 years—as opposed to 30 years in the baseline analysis—with a no-take policy.

For sandbar, the SPM models yielded results similar to those of Punt, with N1998 and stock depletion in 1998 being a little higher, and MSC and RY1998 somewhat lower (Table 20). The results were also similar to those found in the baseline analysis, with stock depletion in 1998 being less pronounced. Model projections varied very little (Table 20).

For blacktip, the Prager form of the SPM yielded results similar to those of Punt, with stock depletion in 1998, N1998, MSC, and RY1998 being close (Table 21). Results with this equal weighting scenario were substantially more optimistic than those found in the baseline analysis with the Prager form and the importance function incorporating the variance expansion factor of two. As in Punt’s review, the estimates of N1998, N1998/K, MSC, and RY1998 all increased considerably (Table 21). Results for blacktip were therefore sensitive to the weighting scheme used.

Further sensitivity trials were run utilizing the state-space SPM model described in the Stock Assessment Models section. Two configurations were tested, one in which one value of t2 (the observation error variance) was estimated for each series, and one in which a single value of t2 was estimated for all series combined. This model further differed from the discrete SPM model in that it considered process error (with variance s2), it assumed an inverse gamma distribution for both t2 and s2, and it used a different method for numerical integration (MCMC with the Gibbs sampler instead of the SIR algorithm). Despite these different formulations, results from these trials using Winbugs were generally similar to those found with the SPM/SIR algorithm and those found by Punt for the large coastal complex, sandbar, and blacktip (Tables 19-21). For sandbar, Winbugs results obtained using the first formulation were higher than those found by Punt or with the SPM/SIR models, but results from the second formulation were closer (Table 20). For blacktip, the main difference was that the Winbugs models predicted a much lower value of r, which resulted in a lower value of MSC, but the predictions of K, N1998, and N1998/K were similar (Table 21). Results for the three groupings were thus pretty insensitive to the combined effect of using a different method of numerical integration for the Bayesian models, slightly structurally different population dynamics models, and different prior distributions for the error terms.

 

D. Changes in the structural assumptions of the model

Changing the structure of the model by assuming that the fishery had started in 1969 instead of 1974 (one of the scenarios considered in Punt’s review) affected little the output parameters and projections for the three groupings (Table 22). For the large coastal complex and sandbar, the results became a little more pessimistic, whereas for blacktip they barely changed. These results agree with Punt’s findings and do not affect the conclusions from the baseline analyses.

 

E. Changes in the CPUE time series

E.1. Decreasing the upper bound for K and changing the CPUE series used

Haist ran a sensitivity trial for the large coastal complex using only those CPUE series that the large coastal complex and sandbar or blacktip had in common. Using these eight CPUE series (Shark Observer, SC LL, Virginia LL, pelagic logbook, Early Rec [MRFSS,HBOAT,TX1], Late Rec [MRFSS,HBOAT,TX2], NMFS LL NE, and NMFS LL SE) of the 17 available, removing the 1983 data point from the Early Rec series, while decreasing the upper bound for K to 20x106 individuals (this latter change was shown earlier not to have an effect on results) resulted in no substantial change with respect to the findings of the baseline analysis (Table 23). The expected values of the estimates of N1998 and N1998/K found by Haist in the baseline analysis decreased somewhat, whereas the MSC increased; in contrast, the estimates of N1998, N1998/K, and MSC all increased when using the Prager form SPM. The estimates of N1998, N1998/K, and MSC found by Haist and with the Prager form were very close.

For sandbar, removing the 1983 data point from the Early Rec series while decreasing the upper bound for K to 12x106 individuals (this latter change was shown earlier not to have an effect on results) also resulted in no substantial change with respect to the findings of the baseline analysis (Table 23). The expected values of the estimates of N1998 and MSC found by Haist in the baseline analysis slightly decreased and N1998/K increased, whereas when using the Prager form SPM the estimates of N1998 and N1998/K increased and that of MSC decreased. In contrast to the results for the large coastal complex, the estimates of N1998, N1998/K, and MSC found by Haist and with the Prager form were substantially different, but still indicated stock depletion below MSC levels (Table 23).

For blacktip, removing the Gulf Reef logs series—the only series used for blacktip that was based on biomass—while decreasing the upper bound for K to 12x106 individuals (this latter change was shown earlier to have a substantial effect on results) resulted in a considerably more pessimistic outlook than found in the baseline analysis (Table 23). The expected values of all quantities found by Haist in the baseline analysis decreased very substantially, with the estimates of N1998/K and MSC decreasing by about one third and that of N1998 by over one half. Using the Prager form SPM also reduced all estimated quantities to various degrees. The quantities estimated by Haist and the Prager form SPM differed, but somewhat less than in the baseline analysis. This scenario indicated that the stock was below MSC levels. It can be concluded that inclusion of the Gulf Reef logs (biomass) series coupled with the assumption of a considerably higher upper bound for K (baseline scenario) results in a considerably more optimistic outlook than when that series is ignored and the upper bound for K is considerably lowered.

 

E.2. Removing the shorter CPUE series

Punt considered a set of scenarios in which only those CPUE series with 5 points or more were used to fit the model. For the large coastal complex, this resulted in 6 series being eliminated (Shark Observer, Jax, NC#, SC LL, NMFS LL NE, and NMFS LL SE). Fitting the 11 remaining series resulted in no substantial change with respect to the findings of the baseline analysis (Table 24). The expected value of N1998/K increased a little in Punt’s analysis, and the estimates of N1998, N1998/K, MSC, and RY1998 increased when using the Prager form SPM. The absolute discrepancy in the prediction of N1998/K between Punt’s analysis and the Prager form SPM persisted. For sandbar, five of the eight series were eliminated, leaving only the Virginia LL, and Early and Late Rec series in the analysis. This change had little effect on Punt’s estimate of N1998/K, which increased only from 0.34 to 0.35, and increased the predictions of N1998, N1998/K, and RY1998 with the Prager form SPM (Table 24). The predictions of r, N1998/K, and RY1998 between Punt’s analysis and the Prager form SPM were very close.

For blacktip, three of the seven series were eliminated, leaving the pelagic logbook, Early Rec, Late Rec, and Gulf Reef logs series in the analysis. This change decreased Punt’s estimate of N1998/K from 0.54 to 0.46, but increased that obtained with the Prager form SPM from 0.56 to 0.61, and also increased the estimates of N1998, MSC, and RY1998 (Table 24). In contrast to the findings with the large coastal complex or sandbar, the difference in the prediction of N1998/K between Punt’s analysis and the Prager form SPM increased appreciably. For blacktip, Punt’s results became more pessimistic and those from the Prager form SPM, more optimistic. It can be concluded that the combined effect of the three series eliminated (Shark Observer, NMFS LL NE, and NMFS LL SE) was to affect negatively the results, as will be further discussed in subsequent scenarios. However, the main conclusions from the baseline analyses for the three groupings were not affected by eliminating the shorter CPUE time series.

 

E.3. Removing the two MRFSS (Early Rec and Late Rec) series

Another set of scenarios considered by Punt consisted of eliminating the two MRFSS CPUE series (Early and Late Rec). For the large coastal complex, this modification resulted in little change with respect to the findings of the baseline analysis in Punt’s review (Table 25). The expected value of N1998/K in Punt’s analysis decreased from 0.21 to 0.20, whereas the estimates of N1998, N1998/K, and RY1998 decreased a little further, and that of MSC slightly increased, when using the Prager form SPM. The difference in the prediction of N1998/K between Punt’s analysis and the Prager form SPM changed little. For sandbar, eliminating the two MRFSS series resulted in an increase from 0.34 to 0.39 in N1998/K in Punt’s analysis, whereas N1998, N1998/K, MSC, and RY1998 all decreased with the Prager form SPM (Table 25). As a result, the difference in the prediction of N1998/K between Punt’s analysis and the Prager form SPM doubled.

For blacktip, elimination of the two recreational series yielded substantially lower estimated quantities in both Punt’s analysis and with the Prager form SPM. The expected value of N1998/K decreased from 0.54 to 0.37 in Punt’s analysis and from 0.56 to 0.36 with the Prager form SPM (Table 25). All quantities, except C0, decreased with the Prager form SPM. The estimates of r and RY1998 were almost identical between the two models. This scenario indicated that stock depletion was below MSC levels, and that the two recreational series (due to the strong positive trend of the Early Rec series as will be discussed later) had a strong influence in positively affecting the outcome of the analysis for blacktip (Table 25).

 

E.4. Removing the Virginia Longline fishery-independent CPUE series

To address some of the concerns in the Industry Position Statement regarding the lack of standardization of the Virginia LL series with respect to environmental conditions, Punt also considered one set of scenarios in which this CPUE series was eliminated. For the large coastal complex, this resulted in a generally more pessimistic outlook, with a slight decrease in N1998/K from 0.21 to 0.19 in Punt’s review, and a slight decrease in N1998, N1998/K, and RY1998 , and an increase in MSC, with the Prager form SPM (Table 26). For sandbar, results became much more optimistic, with N1998/K almost doubling in Punt’s review, and N1998 more than doubling, N1998/K increasing by over 75%, and MSC increasing by only about 20% increase with the Prager form SPM. Consequently, projections also became more optimistic, with no need for rebuilding (Table 26). The Virginia LL series thus had opposing effects on the results for the large coastal complex and sandbar, with a slight positive effect on the former, and a larger negative impact on the latter. No Virginia LL CPUE series was available for blacktip.

 

E.5. Splitting all commercial and recreational CPUE series in 1993

Another concern expressed in the Industry Position Statement was that the implementation of management measures in 1993 brought about changes in the fleet in subsequent years that were not accounted for in the commercial CPUE series. To evaluate this potential effect, Punt split all commercial and recreational series that included 1993, in a pre-1993 series (up to 1992) and a post-1993 series (1993 onwards). For the large coastal complex, this meant that the LPS (recreational), charter boat (recreational), and pelagic logbook (commercial) series were split in two. The results of this change were more optimistic, with an increase in N1998/K from 0.21 to 0.28 in Punt’s review and from 0.14 to 0.27 with the Prager form SPM (Table 27). N1998 and RY1998 about doubled, but the MSC increased only a little with the Prager form SPM (Table 27). Projections thus became more optimistic, with rebuilding to MSC levels taking 20 years with a no-take policy (Table 27). No CPUE series spanned 1993 for sandbar. For blacktip, the only change consisted of eliminating the first data point in the pelagic logbook series (corresponding to 1992). This change had no effect at all on output parameters or predictions in Punt’s review and almost no effect with the Prager form SPM (Table 27).

 

E.6. Ignoring all data points past 1992 in commercial and recreational CPUE series

Another related scenario investigated by Punt was to eliminate any data points in commercial or recreational CPUE series past 1992. For the large coastal complex, this resulted in the elimination of the Shark Observer and Late Rec series and the truncation of the LPS, charter boat, and pelagic logbook series in 1992. Results also became more optimistic in this scenario, with an increase in N1998/K from 0.21 to 0.31 and of RY1998 from 106 to 163 in Punt’s review. In the Prager form SPM, N1998 almost doubled, N1998/K increased from 0.14 to 0.26, RY1998 increased from 70 to 123, and MSC increased only slightly (Table 28). Projections thus became more optimistic, with rebuilding to MSC levels taking 20 years with a no-take policy (Table 28). For sandbar, this scenario resulted in the elimination of the pelagic logbook, Late Rec, and Shark Observer series. The outlook became more pessimistic, with a decrease in N1998/K from 0.34 to 0.27 and of RY1998 from 68 to 48 in Punt’s review. In the Prager form SPM, N1998 was significantly reduced, N1998/K decreased from 0.28 to 0.16, MSC decreased from 85 to 54, and RY1998 from 61 to 27 (Table 28). Projections thus became more pessimistic, with rebuilding to MSC levels taking 30 years with a no-take policy (Table 28).

For blacktip, this scenario resulted in the elimination of the Gulf Reef logs, Late Rec, and Shark Observer series and the truncation of the pelagic logbook series past 1992 (so this series was also eliminated because it then consisted only of the 1992 data point). The results of fitting the model to essentially three series only (Early Rec, NMFS LL NE, and NMFS LL SE) were much more optimistic, with an increase in N1998/K from 0.54 to 0.79 and of RY1998 from 173 to 252 in Punt’s review. Results from the Prager form SPM followed a very similar trend, with N1998 increasing appreciably, N1998/K increasing from 0.56 to 0.79, MSC, from 348 to 579, and RY1998 , from 181 to 253 (Table 28). Projections thus became even more optimistic than in the baseline analysis.

 

E.7. Using fishery-independent CPUE series only

A final scenario considered by Punt to address the effect of potential management measures introduced in 1993 was to use fishery-independent CPUE series only. For the large coastal complex, this meant using only the SC LL, Virginia LL, NMFS LL NE, and NMFS LL SE series, of which only the Virginia LL span more than four years (1974-1997). Results became only a little more optimistic in this scenario, with slight increases in N1998/K from 0.21 to 0.23 and of RY1998 from 106 to 130 in Punt’s review, and small increases in N1998 from 1307 to 1508, N1998/K from 0.14 to 0.16, MSC from 160 to 177, and RY1998 from 70 to 88 with the Prager form SPM (Table 29). Projections also became a little more optimistic as a result. For sandbar, the same four series as for the large coastal complex were used. In contrast to the findings for the large coastal complex, results became a little more pessimistic, with small decreases in N1998/K from 0.34 to 0.31 and of RY1998 from 68 to 52 in Punt’s review, and larger decreases in N1998 from 976 to 551, N1998/K from 0.28 to 0.18, MSC from 85 to 55, and RY1998 from 61 to 30 with the Prager form SPM (Table 29). Projections consequently became more pessimistic.

For blacktip, only two short fishery-independent CPUE series were available (NMFS LL NE and NMFS LL SSE, of 4- and 3-year duration, respectively). Results followed the same trend as those for the large coastal complex, becoming more optimistic. N1998/K increased from 0.54 to 0.70 and RY1998 from 173 to 220 in Punt’s review, and N1998 increased from 6942 to 7409, N1998/K from 0.56 to 0.61, and RY1998 from 181 to 197, whereas MSC slightly decreased from 348 to 338, with the Prager form SPM (Table 29). Accordingly, projections became a little more optimistic.

 

E.8. Using only specific CPUE series based on ad-hoc ranking by one of the reviewers

In his review, Trumble ranked the CPUE series according to what he considered: 1, "best data"; 2, "apparent data problems that need more justification before using, but may supply supporting information"; and 3, "apparent serious data flaws that disqualify the data from quantitative use, unless justified by additional information, but may supply supporting information". Based on these considerations, a set of sensitivity trials was run incorporating only Trumble’s "best" CPUE series (rank=1). For the large coastal complex, these included the Shark Observer, SC LL, Virginia LL, charter boat, and NMFS LL SE series. This change in the CPUE series used in fitting the model yielded much more optimistic results, with an increase in N1998 from 1307 to 3192, N1998/K from 0.14 to 0.50, MSC from 160 to 397, and RY1998 from 70 to 364 with the Prager form SPM, with the subsequent projections indicating no need to rebuild (Table 30). Note, however, that the expected value of the posterior for r was very high (r=0.33) for the large coastal shark complex. In contrast to the results from the baseline analyses (Figs. 1 and 2), the marginal posterior distributions for N1998 and N1998/K did not show strong central tendencies (Fig. 7). The marginal posterior distribution for K was also much less informative than in the baseline analyses (Fig. 7), indicating that there was little information in the data about K in this scenario. For sandbar, the "best" series included the same as those for the large coastal complex, except for the charter boat series, which was not available for sandbar. In contrast to the findings for the large coastal complex, results for sandbar exhibited little variation (Table 30).

For blacktip, using only the "best" series meant eliminating all but two of the seven available series. The two remaining series (Shark Observer and NMFS LL SE) span only four (1994-1997) and three (1995-1997) years, respectively. As for the large coastal complex, introducing this change resulted in considerably more optimistic results than in the baseline analysis (Table 30).

Another set of sensitivity trials was considered which incorporated the "best" CPUE series (1) and those ranked as "2". For the large coastal complex, in addition to the five series ranked as "1", these included the Crooke LL, NC#, LPS, Early Rec, and Late Rec series. Results became more optimistic than those in the baseline analysis, with an increase in N1998 from 1307 to 2218, N1998/K from 0.14 to 0.23, MSC from 160 to 172, and RY1998 from 70 to 115 with the Prager form SPM, with the subsequent projections becoming a little more optimistic (Table 31). For sandbar, the additional series ranked as "2" included the two Rec series (Early and Late). As with the scenario in which only the series ranked as "1" were considered, results for sandbar exhibited little variation, becoming just slightly more pessimistic (Table 31). For blacktip, the additional series ranked as "2" also included the two Rec series (Early and Late). As with the scenario in which only the series ranked as "1" were considered, results for blacktip varied considerably, becoming even more optimistic than in the baseline analysis (Table 31).

 

E.9. Using only specific CPUE series for blacktip

As discussed earlier in the Stock Assessment Models section, the CPUE series for blacktip show contradictory trends. To evaluate the effect that the series indicating opposing trends can have on results, two additional trials were run: one using only the longer series that supports a strong increasing trend in abundance (Early Rec series), and another with the six remaining series. Using the Early Rec series only resulted in more optimistic results, with an increase in N1998 from 6942 to 10777, N1998/K from 0.56 to 0.79, MSC from 348 to 592, and RY1998 from 181 to 255 with the Prager form SPM (Table 32). Using all the other series, except the Early Rec series, resulted in more pessimistic results, with a decrease in N1998 to 1886, N1998/K to 0.25, MSC to 149, and RY1998 to 88 (Table 32).

 

Conclusions

The results were not very sensitive to most of the changes introduced, but there were notable exceptions. For the large coastal complex, the only change that significantly affected results and would thus modify the conclusions from the 1998 SEW report was the use of Trumble’s "best" CPUE series. Those five series, however, include a series that was criticized by other reviewers and in the Industry Position Statement (Virginia LL), one commercial and one recreational series that were not standardized to account for the effect of fishing operational variables, and two very short fishery-independent series. It is thus arguable that the series ranked as "1" by this reviewer are actually the best. Fitting this model resulted in a very high, expected posterior mean value of r, especially for the aggregate of species that make up the large coastal shark complex. The marginal posterior distributions of several parameters of interest, particularly K, also indicated that there was little information in the data the model was fitted to. The results from this model fit should thus be interpreted cautiously.

For sandbar, the only modification that substantially affected results and conclusions was the elimination of the Virginia LL CPUE series, which essentially ratified that this series negatively affected results due to its decreasing trend over time. Without it, projections indicate that the status of sandbar is better than in the baseline analyses and there would be no need for rebuilding.

The analyses for blacktip were sensitive to a number of factors. The most important one is probably the importance function used, which can effectively change the conclusions from the 1998 SEW report, indicating that the resource would be a little above MSC. The form of the surplus production model (Prager vs. discrete) also had an effect on results, but not enough to affect conclusions. A substantial change in one of the priors (decreasing the upper bound of K) also changed the outcome and the ensuing conclusions, indicating that the resource would be below MSC. As also concluded by Punt, the method used to weight the CPUE indices had a very large effect. The equal weighting scenario indicated that the resource was well above MSC. This result was ratified by the use of a separate form of the stock assessment model (state space model), different weighting scheme, and use of a different method of numerical integration for the Bayesian analysis (MCMC vs. SIR). The sensitivity of the results for blacktip to computational issues (model, importance function, method to weight CPUEs) is most likely due, at least in part, to the CPUE series used, which showed contradictory trends. The Gulf Reef logs and the Early Rec series, with their strong positive trend, directly influenced the results positively. Elimination of either of these series would affect conclusions by predicting that the resource would be well below MSC. Particular emphasis should thus be placed on selecting the CPUE series for model fitting for blacktip shark.

The CPUE time series to be used in stock assessments should be chosen based on scientific criteria, not on subjective or arbitrary judgment. When multiple time series of abundance are available, it is also customary to assign relative weights to the various series. However, the assignment of weights to CPUE time series in stock assessments is still a controversial issue (McAllister et al. 2001). When the time series suggest contradictory trends, as was the case for blacktip, it has been suggested that the stock assessment model not be fitted to all series combined (Richards 1991), but rather to the two sets of series indicating opposing trends (McAllister et al. 2001), as was done in the present document. However, the issue of the relative credibility of the opposing sets of data persists. The discrepancy should then be formally addressed, for example by using statistical methods as described by Schnute and Hilborn (1993) and McAllister et al. (1999).

 

REFERENCES

Berger, J. O. 1985. Statistical decision theory and Bayesian analysis. 2nd ed. Springer-Verlag, New York.

Cortés, E. In press. Incorporating uncertainty into demographic modeling: application to shark populations and their conservation. Cons. Biol.

Gilks, W. R., S. Richardson, and D. J. Spiegelhalter. 1996. Markov chain Monte Carlo in practice. Chapman and Hall, London, U.K.

Hilborn, R. and M. Mangel. 1997. The ecological detective. Princeton University Press, Princeton, New Jersey. 315 pp.

McAllister, M. K. and E. A. Babcock. In preparation. Bayesian surplus production model with the Sampling Importance Resampling algorithm: a User’s Guide.

McAllister, M. K. and G. P. Kirkwood. 1998. Bayesian stock assessment: a review and example application using the logistic model. ICES J. Mar. Sci. 55:1031-1060.

McAllister, M. K., E. A. Babcock, and E. K. Pikitch. 2001. Evaluating the relative merits of alternative methods to weight different time series of abundance indices in stock assessment. Col. Vol. Sci. Pap. ICCAT 52:1094-1115.

McAllister, M. K., E. A. Babcock, E. K. Pikitch, and R. Bonfil. In preparation. Importance sampling issues in the 1998 large coastal shark assessment.

McAllister, M. K., E. K. Pikitch, and E. A. Babcock. 2001. Using demographic methods to construct Bayesian priors for the intrinsic rate of increase in the Schaefer model and implications for stock rebuilding. Can. J. Fish. Aquat. Sci. 58:1871-1890.

McAllister, M. K., P. J. Starr, V. Restrepo, and G.P. Kirkwood. 1999. Formulating quantitative methods to evaluate fishery management systems: what fishery processes should be modeled and what trade-offs should be made? ICES J. Mar. Sci. 56:900-916.

Meyer, R. and R. B. Millar. 1999a. BUGS in Bayesian stock assessments. Can. J. Fish. Aquat. Sci. 56:1078-1086.

Meyer, R. and R. B. Millar. 1999b. Bayesian stock assessment using a state-space implementation of the delay difference model. Can. J. Fish. Aquat. Sci. 56:37-52.

Millar, R. B. and R. Meyer. 1999. Nonlinear state-space modeling of fisheries biomass dynamics using Metropolis-Hastings within Gibbs sampling. Tech. Rep. STAT9901. Department of Statistics, University of Auckland, Auckland, New Zealand.

NMFS (National Marine Fisheries Service). 1993. Fishery Management Plan for Sharks of the Atlantic Ocean. NOAA/NMFS, U.S. Dept. of Commerce.

NMFS (National Marine Fisheries Service). 1998. 1998 Report of the Shark Evaluation Workshop. 109pp.

Prager, M. H. 1994. A suite of extensions to a nonequilibrium surplus production model. Fish. Bull. 92:374-389.

Punt, A. E. 1988. Model selection for the dynamics of southern African hake resources. MSc thesis, Department of Applied Mathematics, University of Capetown. 395 pp.

Richards, L. J. 1991. Use of contradictory data sources in stock assessments. Fish. Res. 11:225238.

SB-IV-10. Cortés, E. and G. Scott. Rates of increase per generation for large coastal species of sharks from the U.S. Atlantic Ocean and Gulf of Mexico.

SB-IV-31. NMFS. 1996 Report of the Shark Evaluation Workshop.

Schnute, J. T. and R. Hilborn. 1993. Analysis of contradictory data sources in fish stock assessments. Can. J. Fish. Aquat. Sci. 52:2063-2077.

Smith, S. E., D. W. Au, and C. Show. 1998. Intrinsic rebound potentials of 26 species of Pacific sharks. Mar. Freshwater Res. 49:663-678.

Spiegelhalter D., A. Thomas, and N. Best. 2000. WinBUGS version 1.3. User Manual. April 2000.

Walters, C. J. and D. Ludwig. 1994. Calculation of Bayes posterior probability distributions for key population parameters: a simplified approach. Can. J. Fish. Aquat. Sci. 51:713-722.