|
publications > report > using statistical models to simulate salinity variation and other physical parameters in north Florida Bay
Critical Ecosystem Studies Initiative Final Project Report
Using Statistical Models to Simulate Salinity Variation and other Physical Parameters in North Florida Bay
Cooperative Agreement Number 1443CA528001020 Amendment/ Modification 0004
Between
The United States Department of the Interior National Park Service
Everglades National Park
And
Cetacean Logic Foundation, Inc.
FINAL PROJECT REPORT
Critical Ecosystem Studies Initiative
Project Period October 1, 2002 through April 30, 2004
Frank E. Marshall, III
Principal Investigator
DeWitt Smith
Project Coordinator
David Nickerson
Review
April 30, 2004
Table of Contents
I. Introduction
II. Study Area and Data Set
III. Residuals Analysis and Variable Significance Level Evaluation
IV. Observed Versus Model-Produced Data for Model Development
V. MLR Salinity Models for the IOP Evaluation
VI. Additional MLR Salinity Models
VII. Extended Period Models
VIII. Coupling the 2X2 Model and MLR Salinity Models for Salinity Simulations
IX. Model Error Statistics
X. Presentations
XI. Discussion
XII. Summary and Findings
XIII. Literature Cited
APPENDICES - (Appendices are available separately and can be obtained by contacting Frank E. Marshall, III)
Appendix A. Residual Plots
Appendix B. 2X2 Model Calibration / Verification Plots
Appendix C. Salinity Simulations for 95 Restudy
Appendix D. Salinity Simulations for 2000 CERP
Appendix E. Salinity Simulations for NSM 4.5
Appendix F. Salinity Simulations for NSM 4.6
I. Introduction
This report describes the activities of the second year of Critical Ecosystem Studies Initiative (CESI) research into the use of statistical models to simulate salinity in Florida Bay. The activities of the first year of CESI work are presented in Salinity Simulation Models for North Florida Bay Everglades National Park (Marshall, et al 2003a). Because this report describes follow-on activities, reference to the first year report may be needed to understand all of the background for the second year of work. During the first year of this investigation, two types of statistical modeling procedures were found to be well-suited for use with time series data - SARIMA models (seasonal autoregressive integrated moving average models) and multivariate linear regression models (MLR models). SARIMA models were found to be useful for one-step forward predictions, but for other simulation purposes, MLR models were found to be much easier to use and almost as robust to the idiosyncrasies of time series data. Since the models are intended for use with the output from the South Florida Water Management District (SFWMD) Everglades watershed model (South Florida Water Management Model, or 2X2 model), which simulates hydrologic conditions in south Florida beginning in 1965, MLR models were selected for further development.
Tasks in the original Project Description for the Year 2 work are summarized as follows:
- Coordinate with SFWMD to obtain 2X2 model data and evaluate the uncertainty in the 2X2 model simulations;
- Obtain any other data that may be needed to use MLR models with 2X2 model data;
- In conjunction with ENP staff, prepare MLR models of salinity, water level, or flow;
- Simulate salinity, water level, or flow using the 2X2 Natural System Model and other appropriate input parameters;
- Decode a previously prepared SARIMA model; and
- Prepare draft and final reports.
As the project progressed considerable experience and feedback were gained using MLR salinity models. Additionally, when the uncertainty in the 2X2 model output was evaluated, the structure of the MLR salinity models was modified, as described further in the following sections of this report.
From the onset of the CESI work on statistical models, it was hoped that they would prove useful for simulating salinity in Florida Bay for the Initial CERP (Comprehensive Everglades Restoration Plan) Update (ICU) evaluations. However, when the Year 2 work began, it did not appear that the MLR models would be ready for this purpose, and the scope of the Year 2 project was made intentionally broad to investigate the use of MLR models to simulate other parameters besides salinity, such as water level or flow. As the project progressed it became clear that MLR models were capable of making acceptable simulations of salinity such that different water management schemes could be evaluated, and that the revised schedule for the ICU evaluations was going to make it possible for MLR models to be used for the evaluations. Therefore, the work being done was concentrated on the development of MLR models for that purpose (ICU evaluations).
The first tasks were completed as scheduled. Residual plots were examined and the updated dataset was assembled. In the midst of completing the project tasks, a need for the MLR models developed at Everglades National Park for use with the Interim Operational Plan (IOP) evaluation Congressional Report. ENP was tasked with analyzing the water management regimes that had been modified to lessen the impact of flow diversions on the Cape Sable seaside sparrow. MLR models were developed for use with these evaluations, and valuable experience was gained that has benefited the CESI project. In order to complete the IOP evaluations, a six-month extension of this CESI project was requested and granted. The IOP evaluation model development procedure also allowed the project dataset used for model development to be lengthened.
One committee that is charged with completing the ICU evaluations is the Southern Estuaries Sub-team of RECOVER. Beginning in the spring of 2003, the Principal Investigator has been coordinating with the Sub-team, preparing to use the models developed by this CESI project for evaluating the established salinity performance measures for Florida Bay and the southwest coast. At the time of preparation of this report, the models presented herein are intended to be used by the Sub-team in this manner.
When work re-started on this CESI project following the IOP evaluation activities, the tasks to be completed were officially modified to take into account the focus on modeling for ICU evaluation purposes. The revised Project Description included the following revised list of tasks:
- Contact / meet with SFWMD staff to coordinate the acquisition of 2X2 model output and additional information about the modeling procedure.
- Acquire the other data needed to create a complete input data set for running the multivariate linear regression models with 2X2 model output, including the historical record for wind at Key West and Miami weather stations.
- Eliminate flow parameters from MLR salinity models.
- Meet with the Southern Estuaries Sub-team to obtain their needs for MLR salinity models for ICU performance measure evaluations.
- Prepare MLR salinity models for Joe Bay, Little Madeira Bay, Terrapin Bay, Garfield Bight and North River using 2X2 model output to calibrate the MLR salinity models. This task was completed. However, because the SFWMD updated the 2X2 model output subsequent to their development, these models were rendered obsolete. All future MLR salinity models will be developed from real (observed) data.
- Adapt the IOP models prepared from observed data for use with the Southern Estuaries Sub-team performance measure evaluations by expanding the data used for model development, where possible for Little Madeira Bay, Terrapin Bay, Whipray Basin, Butternut Key, and Duck Key.
- Prepare new MLR models using the longest period of record available for Taylor River, Little Blackwater Sound, Highway Creek, and Bob Allen Key.
- Run simulations at all stations using the following 2X2 model runs: NSM 4.5, NSM 4.6, 95 Restudy, and 2000 CERP. These are the same runs being made by the Southern Estuaries Sub-team at other stations.
- Evaluate the level of uncertainty in the models and in the simulations. Some statistical tests that may be used include the mean error, mean absolute error, root mean square error, maximum absolute error, relative mean error and relative absolute mean error.
- Prepare draft and final Project Reports describing the activities that were completed and present the findings.
Details on the activities of these revised tasks are presented in this report.
II. Study Area and Data Set
 |
| Figure 1. The Everglades and Florida Bay Study Area Showing Monitoring Stations [larger image] |
The study area for this CESI project encompasses northeastern, north, and central Florida Bay; the extreme southwestern coast of the Florida; and the Everglades watershed within Everglades National Park. This modeling effort utilized data that have been collected at 15 to 60 minute increments and averaged to daily and monthly values. Salinity data is taken from the ENP Marine Monitoring Network (MMN) data base. The stage data are ENP Physical Monitoring Network Everglades water levels. Details about these data can be found in Everglades National Park (1997a and 1997b), and Smith (1997, 1998, 1999, and 2001). To these data other time series data were added, including wind data from the National Weather Service (Southeast Regional Climate Center), and water level data collected at Key West from the National Ocean Service. Wind data from Key West and Miami were used as these locations had the longest continuous records for wind and were considered to be representative of the regional wind patterns. Sea level data from Key West were considered to be representative of the average effect of oceanic water level influences, and, to some extent, the average water level patterns within Florida Bay.
The locations of each of the monitoring stations where water level and salinity data were collected are presented in Figure 1. The salinity monitoring stations for which MLR salinity models were prepared as part of this CESI study or the IOP evaluation (shown on Figure 1) are as follows:
- Joe Bay
- Little Madeira Bay
- Terrapin Bay
- North River
- Whipray Basin
- Duck Key
- Butternut Key
- Taylor River
- Highway Creek
- Little Blackwater
- Bob Allen Key
- Long Sound.
Continuous water level records in the Everglades begin in the 1950's in some locations, but most stage records date from the 1990's. Continuous salinity data extend back to 1988 at several locations in northeast Florida Bay. Because the shortest data record (for E146) begins on March 24, 1994, the period of data used for most of these modeling activities begins on this date. The period of record extends through October 31, 2002, which means that there are 3143 daily values in a record with no missing data. In reality, most data sets contained some missing values. Information on the parameters that were used for the modeling activities is presented in Table 1.
| Table 1. Summary of information about the dependent and independent variables used in model development and verification, and in simulations. |
| Variable Name |
Dependent or Independent |
Variable Type |
Units |
Data Source |
Location |
| Little Madeira Bay |
Dependent |
Salinity |
PSU |
ENP |
Near-shore Florida Bay |
| Terrapin Bay |
Dependent |
Salinity |
PSU |
ENP |
Near-shore Florida Bay |
| Long Sound |
Dependent |
Salinity |
PSU |
ENP |
Near-shore Florida Bay |
| Joe Bay |
Dependent |
Salinity |
PSU |
ENP |
Near-shore Florida Bay |
| Little Blackwater Sound |
Dependent |
Salinity |
PSU |
ENP |
Near-shore Florida Bay |
| North River |
Dependent |
Salinity |
PSU |
ENP |
Southwest Coast |
| Taylor River |
Dependent |
Salinity |
PSU |
ENP |
Mangrove Zone |
| Highway Creek |
Dependent |
Salinity |
PSU |
ENP |
Mangrove Zone |
| Whipray Basin |
Dependent |
Salinity |
PSU |
ENP |
Open Water Florida Bay |
| Duck Key |
Dependent |
Salinity |
PSU |
ENP |
Open Water Florida Bay |
| Butternut Key |
Dependent |
Salinity |
PSU |
ENP |
Open Water Florida Bay |
| Bob Allen Key, |
Dependent |
Salinity |
PSU |
ENP |
Open Water Florida Bay |
| Cp |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Craighead Pond |
| E146 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Taylor Slough |
| Ever4 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
So. Of FL City |
| Ever6 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
So. Of FL City |
| Ever7 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
So. Of FL City |
| G3273 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
East of S.R. Slough |
| NP206 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
East of S.R. Slough |
| NP46 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Rocky Glades |
| NP62 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
East of S.R. Slough |
| P33 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Shark River Slough |
| P35 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Shark River Slough |
| P37 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Taylor Slough |
| P38 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Shark River Slough |
| R127 |
Independent |
Water Level |
Ft, NGVD 29 |
ENP |
Taylor Slough |
| uwndkw |
Independent |
E-W Wind |
N/A |
NWS |
Key West |
| vwndkw |
Independent |
N-S Wind |
N/A |
NWS |
Key West |
| uwndmia |
Independent |
E-W Wind |
N/A |
NWS |
Miami |
| vwndmia |
Independent |
N-S Wind |
N/A |
NWS |
Miami |
| Kwwatlev |
Independent |
Tide Elevation |
Ft, NGVD 29 |
NOS |
Key West |
| Table 2. Comparison of Model Uncertainty Statistics for MLR Salinity Models |
| station |
mean sq error (mse), psu2 |
root mse (rmse), psu |
adj R-sq |
mean error, psu |
mean abs error, psu |
max abs error, psu |
Nash-Sutcliffe Efficiency |
rel mean error |
rel mean abs error |
rel mean sq error |
| Joe Bay |
25.8 |
5.1 |
0.75 |
-0.14 |
3.7 |
20.6 |
0.76 |
0.012 |
0.32 |
0.14 |
| Little Madeira Bay |
20.3 |
4.5 |
0.56 |
0.56 |
3.5 |
15.4 |
0.55 |
0.027 |
0.18 |
0.29 |
| Terrapin Bay |
32.6 |
5.7 |
0.75 |
-0.99 |
5.4 |
5.4 |
0.67 |
0.044 |
0.24 |
0.2 |
| Whipray Basin |
7.2 |
2.7 |
0.8 |
0.11 |
2.2 |
10.1 |
0.77 |
0.034 |
0.07 |
0.12 |
| Duck Key |
9.7 |
3.1 |
0.71 |
-0.18 |
2.27 |
14.4 |
0.71 |
-0.007 |
0.09 |
0.17 |
| Butternut Key |
10.7 |
3.3 |
0.65 |
0.1 |
2.7 |
11.3 |
0.66 |
0.003 |
0.1 |
0.21 |
| Long Sound |
15 |
3.9 |
0.8 |
0.31 |
2.7 |
18.9 |
0.81 |
0.021 |
0.18 |
0.11 |
| Taylor River |
21.4 |
4.62 |
0.78 |
-0.49 |
3.6 |
22.9 |
0.78 |
-0.06 |
0.47 |
0.13 |
| Highway Creek |
18.2 |
4.3 |
0.81 |
-0.95 |
3.7 |
17.7 |
0.76 |
-0.08 |
0.31 |
0.14 |
| Little Blackwater Sound |
14 |
3.75 |
0.75 |
-0.14 |
2.9 |
15.7 |
0.76 |
-0.007 |
0.16 |
0.14 |
| North River |
8.9 |
3 |
0.86 |
0.19 |
2.3 |
18.1 |
0.78 |
0.03 |
0.35 |
0.11 |
| Bob Allen Key |
7.2 |
2.7 |
0.79 |
0.3 |
2.1 |
9.2 |
0.81 |
0.01 |
0.065 |
0.12 |
|
|
| Figure 2. Joe Bay salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 23, 1995. [larger image] |
|
|
| Figure 3. Little Madeira Bay salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 4. Terrapin Bay salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 5. Long Sound salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 6. North River salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 7. Whipray Basin salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 8. Duck Key salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 9. Butternut Key salinity model development plot (IOP) - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 10. Comparison of Observed and Simulated Data for the Taylor River MLR Model - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 11. Comparison of Observed and Simulated Data for the Highway Creek MLR Model - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 12. Comparison of Observed and Simulated Data for the Little Blackwater Sound MLR Model - Calibration is March 24, 1995 - October 31, 2002; Verification is March 24, 1994 - March 3, 1995. [larger image] |
|
|
| Figure 13. Comparison of Observed and Simulated Data for the Bob Allen Key MLR Model - Calibration is September 9, 1998- October 31, 2002; Verification is September 9, 1997 - September 8, 1998. [larger image] |
|
|
| Figure 14. Comparison of Observed and Simulated Data for the Little Madeira Bay Extended Period MLR Model - Calibration is August 25, 1988 - October 31, 2001; Verification is November 1, 2001 - October 31, 2002. [larger image] |
|
|
| Figure 15. Comparison of Observed and Simulated Data for the Terrapin Bay Extended Period MLR Model - Calibration is September 12, 1991 - October 31, 2001; Verification is November 1, 2001 - October 31, 2002. [larger image] |
|
|
| Figure 16. Comparison of Observed and Simulated Data for the Whipray Basin Extended Period MLR Model - Calibration is September 12, 1991 - October 31, 2001; Verification is November 1, 2001 - October 31, 2002. [larger image] |
|
|
| Figure 17. Comparison of Observed and Simulated Data for the Duck Key Extended Period MLR Model - Calibration is September 12, 1991 - October 31, 2001; Verification is November 1, 2001 - October 31, 2002. [larger image] |
|
|
| Figure 18. Comparison of Observed and Simulated Data for the Butternut Key Extended Period MLR Model - Calibration is September 12, 1991 - October 31, 2001; Verification is November 1, 2001 - October 31, 2002. [larger image] |
|