Kansas Real-Time Water Quality

Continuous Real-Time Water-Quality Monitoring of Kansas Streams

by Victoria G. Christensen, Andrew C. Ziegler, Patrick P. Rasmussen, and Xiaodong Jian

Originally presented at American Water Resources Association, 2003 meeting. Download as PDF (505 KiB).

Abstract

A continuous, real-time water-quality monitoring system was developed for 13 stream sites in Kansas that eliminates the waiting time inherent in chemical analyses reported by a laboratory and provides continuous estimates of constituent concentrations and loads. The U.S. Geological Survey (USGS) monitoring system is described, and its effectiveness in characterizing water quality is evaluated using discrete water-quality samples collected from 1995 through 2002 and continuous water-quality monitor data from the first 4 years of operation with examples from two stream sites on the Little Arkansas River in south-central Kansas. Because sensor technology currently is not available to directly measure many chemicals of interest in a stream, regression models are developed to relate constituents in laboratory-analyzed samples with in-stream continuous-sensor measurements. Concentration estimates traditionally have used continuous streamflow data only; however, most constituents are more accurately estimated with continuous specific conductance or turbidity measurements. As the hourly sensor measurements are transmitted from the stream sites to the USGS computers in Lawrence, Kansas, the models are applied, and the computed estimates displayed on a Web page at http://ks.water.usgs.gov/Kansas/rtqw/. Currently, continuous estimated concentrations and loads of sediment, major ions, select nutrients, atrazine, and indicator bacteria, and uncertainty of the estimates are displayed. Information from this system is used by water suppliers to modify treatment of water, by State and local agencies in total maximum daily load (TMDL) programs, and to alert recreational water users of potential health risks.

Results

Between 1995 and 2002, the U.S. Geological Survey (USGS), in cooperation with local, State, and Federal agencies, collected numerous discrete water samples from 13 stream sites in Kansas. Two of those sites are the Little Arkansas River near Halstead (site 07143672) and near Sedgwick (site 07144100). Samples were analyzed for sediment, major ions, nutrients, pesticides, indicator bacteria, and other constituents. Samples were collected throughout a range of streamflow conditions and constituent concentrations as described in Wilde and Radtke (1998). Both stream sites also were equipped with water-quality monitors from 1998 through 2002 that record continuous measurements of specific conductance, pH, water temperature, dissolved oxygen, and turbidity. These monitors were serviced about twice per month according to methods described in Wagner and others (2001). Regression models were developed using S-Plus Statistical Software (Mathsoft, Inc., 1999) to describe relations between the discrete samples and the continuous cross-section-averaged water-quality sensor measurements, streamflow, stage, and time (Table 1). Site-specific regression models were developed using plots of each possible explanatory variable against the response variable and visually examining the plots of the residuals for patterns. Explanatory and response variables (except time) were log transformed, if necessary, to remove curvature in the data. An overall model-building method was used (Helsel and Hirsch, 1992, p. 312-314). Generally, if there were several acceptable models (F-test p-value less than 0.05), the one with the lowest PRESS statistic was chosen. PRESS (acronym for "PRediction Error Sum of Squares") is one of the best measures of the goodness of fit of a regression model (Helsel and Hirsch, 1992, p. 248). Explanatory variables were included in a model only if there was a physical basis or explanation for their inclusion. For variables that were log transformed, retransformation of regression-estimated concentrations was necessary. However, retransformation can cause an underestimation of chemical loads when adding individual load estimates over a long period of time. A Duan's bias correction factor or smear factor (Duan, 1983) was applied to the annual load calculation to correct for this underestimation. Cohn and others (1989), and Hirsch and others (1993) provide additional information on interpreting the results of regression-based load estimates. Uncertainty of the estimates for regression models was determined using 90-percent prediction intervals (Helsel and Hirsch, 1992). Probabilities of exceeding water-quality standards, recommended criteria, or guidelines of the State of and U.S. Environmental Protection Agency also were determined. Regression methods are described in Helsel and Hirsch (1992), Hirsch and others (1993), and Christensen (2000).

Table 1. Regression Models for Continuous Concentration and Load Estimates and Load Smearing Factors for Little Arkansas River at Highway 50 near Halstead (site 07143672) and near Sedgwick (site 07144100), Kansas, 1995-2002.
[n, number of samples; R2, square of coefficient of determination; MSE, mean square error; Smear, smear correction factor; Alk, alkalinity in milligrams per liter as calcium carbonate; SC, specific conductance in microsiemens per centimeter at 25 degrees Celsius; Q, streamflow in cubic feet per second; HC03, bicarbonate in milligrams per liter; CaC03, hardness in milligrams per liter; DS, dissolved solids in milligrams per liter; TSS, total suspended solids in milligrams per liter; Turb, turbidity in nephelometric turbidity units; Ca, calcium in milligrams per liter; Na, sodium in milligrams per liter; Cl, chloride in milligrams per liter; F, fluoride in milligrams per liter; SO4, sulfate in milligrams per liter; TON, total organic nitrogen in milligrams per liter; TP, total phosphorus in milligrams per liter; As, arsenic in micrograms per liter; Atr, atrazine in micrograms per liter; D, day of year; FCB, fecal colifom bacteria in colonies per 100 milliliters; ECB, Escherichia coli bacteria in colonies per 100 milliliters]
Constituent Regression Model n R2 MSE Smear
Little Arkansas River at Highway 50 near Halstead (site 07143672)
Alkalinity log10Alk = 0.646 log10Q + 0.509 155 0.949 0.00484 1.015
Bicarbonate log10HCO3 = 0.846 log10SC - 0.179 156 0.912 0.00831 1.021
Hardness CaCO3 = 0.273 SC + 9.85 157 0.960 897 1.000
Dissolved solids DS = 0.551 SC + 25.3 158 0.977 2,310 1.000
Total suspended solids log10TSS = 0.943 log10Turb + 0.0110 48 0.933 0.0294 1.127
Suspended sediment concentration log10SSC = 0.945 log10Turb + 0.132 43 0.948 0.0201 1.047
Calcium Ca = 0.0870 SC + 2.34 157 0.955 102 1.000
Sodium Na = 0.107 SC - 15.4 157 0.966 114 1.000
Chloride Cl = 0.209 SC - 33.7 156 0.957 558 1.000
Sulfate log10SO4 = 0.900 log10SC - 1.07 158 0.885 0.0126 1.035
Total organic nitrogen log10TON = 0.439 log10Turb - 0.816 35 0.850 0.0127 1.034
Total phosphorus TP = 0.000929 Turb + 0.325 36 0.893 0.0207 1.000
Arsenic log10As = -0.227 log10Q + 1.16 33 0.663 0.0223 1.172
Atrazine log10Atr = -0.000790 SC + 0.359 sin (2πD/365) - 0.643 cos(2πD/365) + 0.430 78 0.552 0.326 1.806
Fecal coliform bacteria log10FCB = 1.13 log10Turb + 0.378 23 0.69 0.249 2.122
Escherichia coli bacteria log10ECB = 1.011 log10Turb + 0.439 23 0.63 0.290 2.282
Little Arkansas River near Sedgwick (site 07144100)
Alkalinity log10Alk = 0.671 log10SC - 0.121 log10Q + 0.574 131 0.936 0.00588 1.014
Bicarbonate log10HCO3 = 0.000462 SC - 0.174 log10Q + 2.32 131 0.913 0.00787 1.013
Hardness CaCO3 = 0.318 SC + 0.442 131 0.965 471 1.000
Dissolved solids DS = 0.562 SC + 17.2 125 0.958 518 1.000
Total suspended solids log10TSS = 1.16 log10Turb + 0.000605 SC - 0.831 54 0.911 0.0402 1.119
Suspended sediment concentration log10SSC = 0.715 log10Turb + 0.188 log10Q + 0.185 47 0.908 0.0419 1.141
Calcium Ca = 0.0979 SC - 0.0587 131 0.962 48.5 1.000
Sodium Na = 0.101 SC + 3.79 log10Q - 23.4 131 0.965 41.1 1.000
Chloride Cl = 0.203 SC + 25.7 log10Q - 112 131 0.941 207 1.000
Fluoride log10F = 0.434 log10SC - 1.75 57 0.621 0.0112 1.000
Sulfate SO4 = 0.0592 SC + 1.00 131 0.924 35.5 1.000
Total organic nitrogen TON = 0.00232 Turb + 0.863 39 0.743 0.263 1.000
Total phosphorus TP = 0.000567 Turb + 0.506 36 0.549 0.0385 1.000
Arsenic log10As = -0.250 log10Q + 1.30 34 0.682 0.021 1.041
Atrazine log10Atr = -0.000647 SC + 0.504 sin(2πD/365) - 0.745 cos(2πD/365) + 0.237 205 0.604 0.183 1.442
Fecal coliform bacteria log10FCB = 1.19 log10Turb + 0.198 28 0.79 0.210 1.579
Escherichia coli bacteria log10ECB = 1.17 log10Turb + 0.111 28 0.73 0.278 1.806

Discussion

There are several advantages to a continuous, real-time water-quality monitoring system. The first advantage is that the information is real time. This eliminates waiting time inherent in chemical analyses reported by a laboratory and provides continuous estimates of constituent concentrations and loads. The continuous estimates can assist water managers in the operation of water-treatment facilities and allow for decisions about the sanitary quality of water with respect to water recreation activities and public safety. Concentration and load information is available on the World Wide Web (http://ks.water.usgs.gov/Kansas/rtqw/) for water managers to make decisions relative to rapid changes in water quality. An example of the information available for suspended sediment is presented in Figure 1. Knowledge of the probability that current water-quality concentrations meet or exceed water-quality standards also allows the public to make decisions about the use of a stream for recreational purposes. Additionally, continuous estimates provide information that is essential to obtain accurate constituent load information. Because the data are available on an hourly basis, rapid changes in water quality can be quantified and used to more accurately estimate the daily load during storm runoff when concentrations and loads can vary by as much as four orders of magnitude. Additionally, the continuous data result in a more accurate estimation of the loads for different time periods--daily, weekly, monthly, or annually. The continuous data also can be used to construct frequency distribution (duration) curves to determine percentage of time that concentrations exceed water-quality standards or to assist with development of water-quality standards.

Graph Graph

Figure 1. Hourly Estimated Suspended-Sediment Concentrations and Duration for the Little Arkansas River near Sedgwick, Kansas, January 1, 2002, Through December 31, 2002.

Continuous streamflow information is available nationally at more than 7,000 USGS stream gages, but many constituents in streams are more strongly related to specific conductance or turbidity than to streamflow; therefore, a more accurate estimation of load is possible by using the regression models with the water-quality sensor data as explanatory variables. A previous study for calendar year 2001 indicated that differences between instantaneous measured suspended sediment load and streamflow-estimated suspended-sediment load were as great as 100 percent at some sites (Christensen and others, 2001), whereas the difference beheen measured instantaneous suspended-sediment load and turbidity-estimated suspended-sediment load generally was less than 10 percent. A similar comparison between multiple-regression-estimated and streamflow-estimated atrazine (Table 2) was made to determine if specific conductance and day of year provided better estimates of atrazine than streamflow. For both monitoring sites on the Little Arkansas River, the percentage difference was about twice as large for the streamflow-estimated load than for the specific-conductance and day-of-year-estimated load. Specific conductance and day of year provided a more accurate estimate of atrazine loads probably because the larger values of specific conductance are associated with base flows when ground water dominates and has small concentrations of atrazine. However, both models over estimate atrazine loads probably because of bias toward higher values and the seasonal application of atrazine. Improvements in sensor technology may provide better explanation of atrazine concentrations.

Table 2. Comparison of Measured Instantaneous Atrazine Loads to Streamflow- and Multiple-Regression-Estimated Atrazine Loads in the Little Arkansas River at Highway 50 near Halstead and near Sedgwick, Kansas, 1995-2002.
Variables Little Arkansas River at Highway 50 near Halstead Little Arkansas River near Sedgwick
Number of samples 78 205
Mean measured atrazine concentration (micrograms per liter) 5.19 5.29
Mean measured streamflow (cubic feet per second) 668 918
Mean measured instantaneous atrazine load (pounds per day) 23.8 28.4
Mean streamflow-estimated instantaneous atrazine load (pounds per day) 51.6 49.6
Percentage difference of streamflow-estimated load from measured load -120 -75
Mean specific-conductance-estimated instantaneous atrazine load (pounds per day) 39.6 36.0
Percentage difference of specific-conductance-estimated load from measured load -66 -27

The primary limitations for the continuous water-quality monitoring approach are associated with the water-quality sensors and the consistency of the relations with the sensor. Turbidity sensors have an upper limit of measurement depending on the instrument used (Sadar, 2002). Exceeding this upper limit of measurement prevents accurate estimation of constituent concentrations and loads during the greatest turbidities that generally occur during the greatest streamflows when the largest loads are likely to occur. This is an important consideration in the sediment-rich streams of Kansas, but may not be as important in streams that are less turbid. However, concentrations and loads can be described with greater accuracy for the days when the sensor is not maximized. The turbidity sensor was at maximum for 4 percent of the time during 2002 (Figure 1). As technology improves, sensor maximization potentially can be eliminated, and direct sensor measurement of constituents of interest made possible. Additionally, continued discrete water-quality samples are required to verify that the relations between the sensors and constituents estimated remain consistent.

Conclusions

Site-specific regression models coupled with continuous, in-stream water-quality data can estimate in-stream constituent concentrations and loads continuously and in real time. Estimated hourly constituent concentrations, loads, probability of exceeding water-quality standards, uncertainty of the estimates, and frequency distributions can be rapidly displayed on the World Wide Web. These estimates can provide water-quality information to resource managers and the public that is otherwise not available. In addition to the utility of the regression models for estimating concentrations, they also are useful for estimating total maximum daily loads (TMDLs) and to alert recreational water users of potential health risks. However, continued sampling is required to verify that the statistical models ire consistent.

This innovative approach utilizes the existing USGS stream-gaging network. In the future, real-time estimates can be developed elsewhere in the United States using the USGS national network of more than 7,000 streamflow gages to provide continuous, real-time information on streamflow and water-quality concentrations and loads in streams that can be used to improve the treatment of drinking water and to monitor the environment. This approach offers the potential to improve our understanding of the dynamics of watersheds and nonpoint-source pollution. As new sensors become available for direct measurement of constituents, the use of the network sites to provide continuous, real-time water-quality information will improve. Additional advances in technology are hoped to directly measure chemicals of interest and decrease the data uncertainty and the operation and maintenance requirements for the sensors. The increasing public interest in TMDLs and water quality in general make this approach of regional, national, and international importance.

Acknowledements

The use of brand, trade, or firm names in this paper is for identification purposes only and does not constitute endorsement by the U.S. Geological Survey. Without the dedicated field personnel to collect samples and maintain the water-quality monitors, none of this work is possible. Their efforts are greatly appreciated.

References