Skip to main content
Advanced Search

Filters: Tags: Regression analysis (X)

65 results (11ms)   

Filters
Date Range
Extensions
Types
Contacts
Categories
Tag Types
Tag Schemes
View Results as: JSON ATOM CSV
A generalized least squares (GLS) technique was used with at-site n-day low-flow time series and frequencies, from SWToolbox (Kiang and others, 2018), and 14 basin characteristics (basin perimeter, drainage area, maximum basin elevation, mean total annual reference evapotranspiration, groundwater head, length of longest flow path, minimum basin elevation, runoff-curve number, relief, ruggedness, slope ratio, percentage of hydrologic soil type B, available water capacity, and total length of streams in a basin) for 28 selected streamflow gaging stations in Puerto Rico to calculate regional n-day low-flow frequency regression equations for ungaged stream locations. The GLS approach assigns weights based on uncertainty...
thumbnail
Regional regression equations were calculated in Puerto Rico with generalized least squares techniques to estimate flood frequency statistics at ungaged locations using drainage area as the only explanatory variable. The island was divided into 2 regions to minimize residuals. The region division that resulted in lower and more balanced residuals runs primarily north-south near the center of the island, mostly along an 8-digit hydrologic unit code (HUC8) boundary. The division line runs through a HUC8 polygon on the southern end of the island, but care was taken to include entire watersheds and consideration was given where hydrologic and physiographic properties differed. This data release includes geographic information...
thumbnail
Site-specific multiple linear regression models were developed for one beach in Ohio (three discrete sampling sites) and one beach in Pennsylvania to estimate concentrations of Escherichia coli (E. coli) or the probability of exceeding the bathing-water standard for E. coli in recreational waters used by the public. Traditional culture-based methods are commonly used to estimate concentrations of fecal indicator bacteria, such as E. coli; however, results are obtained 18 to 24 hours post sampling and do not accurately reflect current water-quality conditions. Beach-specific mathematical models use environmental and water-quality variables that are easily and quickly measured as surrogates to estimate concentrations...
thumbnail
Three regression models were developed for describing variability of host-specific flux data in eight Great Lakes tributaries. Models include one for Lachnospiraceae flux, one for human bacteroides flux, and one for ruminant bacteroides flux. Models were developed using the R project for statistical computing with core functionality and the survival, smwrBase, and smwrQW packages. Predictor variables for these models are included in the data file and input files provided. These include sampling dates and times, rainfall depth, percent of watershed underlain by tile drainage, cattle population (for the ruminant marker model) and human population (for the human marker models). Sample collection and laboratory analyses...
thumbnail
This dataset summarizes various sedimentary data from ten U.S. Geological Survey (USGS) surface water sites across the contiguous United States. These sites include: 01648010 Rock Creek at Joyce Road, Washington, DC 05586300 Illinois River at Florence, Illinois 06731000 Cherry Creek below Cherry Creek Lake, Colorado 06807000 Missouri River at Nebraska City, Nebraska 06935965 Missouri River at St. Charles, Missouri 08374550 Rio Grande near Castolon, Texas 08375300 Rio Grande at Rio Grande Village, Big Bend National Park, Texas 09404200 Colorado River above Diamond Creek near Peach Springs, Arizona 11447650 Sacramento River at Freeport, California 12046260 Elwha River at diversion near Port Angeles, Washington The...
thumbnail
Measures used to assess trends in the 10th, 50th, and 90th quantiles of annual peak streamflow from 1916-2015 at 2,683 U.S. Geological Survey stations and within 191 4-digit HUCs in the conterminous United States. Linear quantile regression was applied to the selected quantiles of log-transformed annual peak streamflow to represent trends for a range of flood frequencies from small, common floods to large, infrequent floods. Comparative trends in pairs of quantiles were characterized as coherent, convergent, or divergent by comparing the slopes of linear quantile regression equations.
thumbnail
This data release contains trend results computed on the basis of modeled and observed daily streamflows at 502 reference gages across the conterminous U.S. from October 1, 1983 through September 30, 2016. Modeled daily streamflows were computed using the deterministic Precipitation Runoff Modeling System (PRMS), and five statistical techniques: Nearest-Neighbor Drainage Area Ratio (NNDAR), Map-Correlation Drainage Area Ratio (MCDAR), Ordinary Kriging of the logarithms of discharge per unit area (OKDAR), Nearest-Neighbor nonlinear spatial interpolation using flow duration curves (NNQPPQ), and Map-Correlation nonlinear spatial interpolation using flow duration curves (MCQPPQ). Observed daily streamflow data for the...
thumbnail
These model archive summaries document the surrogate regression models developed to estimate 15-minute suspended-sediment concentrations at three streamgage sites in Colorado: Anthracite Creek above Mouth near Somerset, U.S. Geological Survey (USGS) site number 09132095; Muddy Creek above Paonia Reservoir, USGS site number 385903107210800; and North Fork Gunnison below Raven Gulch near Somerset, USGS site number 385553107243301. The methods used follow USGS guidance as referenced in relevant Office of Surface Water Technical Memorandum (TM) 2016.07 and Office of Water Quality TM 2016.10, USGS Techniques and Methods, book 3, chap. C5 (Landers and others, 2016), and (or) USGS Techniques and Methods, book 3, chap....
thumbnail
These processed data and provisional codes were created to investigate seismic velocity changes associated with the collapse of Kīlauea caldera during its 2018 eruption. Primary data (i.e., seismic waveforms) are hosted at the Incorporated Research Institutions for Seismology (IRIS; https://www.iris.edu/) and are ingested by the codes included here to reproduce the data analyzed in Hotovec-Ellis et al., 'Earthquake-derived seismic velocity changes during the 2018 caldera collapse of Kīlauea volcano.' The included code ('cwire' short for Coda Wave Interferometry with Repeating Earthquakes) takes a catalog of earthquakes clustered by waveform similarity (e.g., REDPy, https://github.com/ahotovec/REDPy/) and processes...
thumbnail
This data release supports the following publication: Senior, Lisa A., 2017, Estimated Fecal Coliform Bacteria Concentrations Using Near Real-Time Water-Quality and Streamflow Data From Five Stream Sites in Chester County, Pennsylvania, 2007–16: U.S. Geological Survey Scientific-Investigations Report 2017–5075 (https://doi.org/10.3133/sir20175075). The U.S. Geological Survey (USGS), in cooperation with the Chester County Health Department (CCHD) and the Chester County Water Resources Authority (CCWRA), has collected discrete stream samples for analysis of fecal coliform concentrations during March–October annually at or near five gaging stations where near real-time continuous data on stream discharge, turbidity,...
thumbnail
This data set contains daily survival rates from 81 studies of passerine obligate grassland bird species that primarily breed in the United States; patch size information was extracted when available. Temperature and precipitation variables were calculated for the study sites and years the data were collected. The studies we used collected data across the period 1978 to 2013.
thumbnail
This data release presents catch and effort data for Cisco Coregonus artedi and Lake Whitefish Coregonus clupeaformis commercial gill net fisheries in State of Michigan waters of Lakes Superior, Michigan, and Huron during 1929-1970. The data were used to determine if Cisco and Lake Whitefish relative abundance (commercial gill net catch per effort) were correlated (positive and negative) during the historical period. The file is in .csv format and contains columns for: (1) lake (LAKE); (2) commercial fishery management unit (MU); (3) year (YEAR); (4) gill net material used to target Lake Whitefish (WF_MAT); (5) gill net material used to target Cisco (CS_MAT); (6) gill net conversion factors (multipliers) used...
thumbnail
The U.S. Geological Survey (USGS), in cooperation with the Bureau of Land Management, used streamflow measurements at 11 partial-record sites and related them to nearby USGS or Idaho Power Company real-time streamgages (index sites) to provide daily mean streamflow values at ungaged (partial-record) sites within the Wild and Scenic River of the Owyhee Canyonlands Wilderness, Idaho. Daily mean streamflow was estimated by developing a regression relationship between streamflow at each partial-record site and the index site for the period of record of the index site. The regressions are then used to estimate annual and semimonthly 20-, 50-, and 80-percent exceedance probability streamflow statistics at each partial-record...
thumbnail
This data release contains input data and programs (scripts) used to estimate monthly water demand for retail customers of Providence Water, located in Providence, Rhode Island. Explanatory data and model outputs are from July 2014 through June 2021. Models of per capita (for single-family residential customers) or per connection (for multi-family residential, commercial, and industrial customers) water use were developed using multiple linear regression. The dependent variables, provided by Providence Water, are the monthly number of connections and gallons of water delivered to single- and multi-family residential, commercial, and industrial connections. Potential independent variables (from online sources) are...
thumbnail
Peak-flow frequency analysis is crucial in various water-resources management applications, including floodplain management and critical structure design. Federal guidelines for peak-flow frequency analyses, provided in Bulletin 17C, assume that the statistical properties of the hydrologic processes driving variability in peak flows do not change over time and so the frequency distribution of annual peak flows is stationary. Better understanding of long-term climatic persistence and further consideration of potential climate and land-use changes have caused the assumption of stationarity to be reexamined. This data release contains input data and results of a study investigating hydroclimatic trends in peak streamflow...
thumbnail
This model archive summary documents the suspended-sediment concentration (SSC) model developed to estimate 15-minute SSC at Anthracite Creek above Mouth near Somerset, U.S. Geological Survey (USGS) site number 09132095. The methods used follow USGS guidance as referenced in relevant Office of Surface Water Technical Memorandum (TM) 2016.07 and Office of Water Quality TM 2016.10, USGS Techniques and Methods, book 3, chap. C5 (Landers and others, 2016), and USGS Techniques and Methods, book 3, chap. C4 (Rasmussen and others, 2009). A total of 399 suspended-sediment samples were collected during the calibration period (43 cross-section and 356 single-station samples). Forty-one of these samples with associated streamflow...
An ordinary least squares (OLS) technique was used with at-site flow-duration exceedance probabilities, from SWToolbox (Kiang and others, 2018), and 14 basin characteristics (basin perimeter, drainage area, maximum basin elevation, mean total annual reference evapotranspiration, groundwater head, length of longest flow path, minimum basin elevation, runoff-curve number, relief, ruggedness, slope ratio, percentage of hydrologic soil type B, available water capacity, and total length of streams in a basin) for 28 selected streamflow gaging stations in Puerto Rico to calculate regional flow-duration regression equations for ungaged stream locations. The final flow-duration regression equations were developed in R (R...
thumbnail
These data are a species-level classification map of riparian vegetation in the Colorado River riparian corridor in Grand Canyon, Arizona, USA. The classification is derived from 0.2 m pixel resolution multispectral aerial imagery acquired in May 2013. The classification spans the riparian zone of the river corridor between Glen Canyon Dam near Page, Arizona, and Lake Mead at Pearce Ferry, Arizona. The classification is divided into 5 distinct reaches of the river: Glen Canyon, Marble Canyon, Eastern Grand Canyon, Western Grand Canyon upstream of Diamond Creek, and Western Grand Canyon downstream of Diamond Creek. The method used for classification was a combination of supervised Classification And Regression Tree...
thumbnail
This dataset is one of many used in the development of the manuscript 'Advancing Cave Detection using Terrain Analysis Techniques and Thermal Imagery' by Wynne et al. 2021. Manuscript Abstract: Since the initial experiments nearly 50 years ago, techniques for detecting caves using airborne and spacecraft acquired thermal imagery have improved markedly. These advances are largely due to a combination of higher instrument sensitivity, modern computing systems, and processor-intensive analytical techniques. Through applying these advancements, our goals were to: (1) determine the utility of methods designed for terrain analysis and applied to thermal imagery; (2) analyze the usefulness of predawn and midday imagery...
thumbnail
For more than 100 years, the Permian Basin has been an important source of oil and gas produced from conventional reservoirs; directional drilling combined with hydraulic fracturing has greatly increased production in the past 10 years to the extent that the Permian Basin is becoming one of the world’s largest continuous oil and gas (COG) producing fields (U.S. Energy Information Administration, 2020). These recent techniques extract oil and gas by directionally drilling and hydraulically fracturing the surrounding reservoir rock. The extraction of COG by using these techniques requires large volumes of water and estimates of the total water volume used in COG require a comprehensive assessment to determine the...


map background search result map search result map Regression models and associated data for describing variability of host specific bacteria fluxes in eight Great Lakes tributaries, 2011-2013 Archival data for regression models developed to estimate fecal coliform concentrations at five stream sites, Chester County, Pennsylvania (2017) Daily Survival Rates of Grassland Passerines and Associated Weather Variables (1978-2013) Riparian species vegetation classification data for the Colorado River within Grand Canyon derived from 2013 airborne imagery Trends in annual peak streamflow quantiles for 2,683 U.S. Geological Survey streamgages in the conterminous United States Modeled and observed trends at reference basins in the conterminous U.S. from October 1, 1983 through September 30, 2016 Surrogate regression models for computation of time series suspended-sediment concentrations at Muddy Creek above Paonia Reservoir, Anthracite Creek above mouth near Somerset, and North Fork Gunnison below Raven Gulch near Somerset, Colorado Model Archive Summary for Suspended-Sediment Concentration at U.S. Geological Survey Site 09132095; Anthracite Creek above Mouth near Somerset, Colorado Data for Field Evaluation of the Sequoia Scientific LISST-ABS Acoustic Backscatter Sediment Sensor R scripts and results of estimated water use associated with continuous oil and gas development, Permian Basin, United States, 2010–19 (ver. 2.0, April, 2022) Regions for regional regression equations Data for multiple linear regression models for estimating Escherichia coli (E. coli) concentrations or the probability of exceeding the bathing-water standard at recreational sites in Ohio and Pennsylvania as part of the Great Lakes NowCast, 2019 Puerto Rico N-day Low-Flow Regression Files Puerto Rico Flow-Duration Regression Files Catch and Effort Data for Cisco and Lake Whitefish Commercial Gill Net Fisheries in State of Michigan Waters of Lakes Superior, Michigan, and Huron During 1929–1970 Streamflow regressions and annual and semimonthly exceedance probability statistics for wild and scenic rivers, Owyhee Canyonlands Wilderness, Idaho Time series of seismic velocity changes during the 2018 collapse of Kīlauea volcano derived from coda wave interferometry of repeating earthquakes Aircraft-Borne Thermal Imagery and Derived Terrain Analysis Layers, Pisgah Lava Field, California (COPY) Peak Streamflow Data, Climate Data, and Results from Investigating Hydroclimatic Trends and Climate Change Effects on Peak Streamflow in the Central United States, 1921–2020 Data for Regression Models to Estimate Water Use in Providence, Rhode Island, 2014-2021 Aircraft-Borne Thermal Imagery and Derived Terrain Analysis Layers, Pisgah Lava Field, California (COPY) Time series of seismic velocity changes during the 2018 collapse of Kīlauea volcano derived from coda wave interferometry of repeating earthquakes Data for Regression Models to Estimate Water Use in Providence, Rhode Island, 2014-2021 Archival data for regression models developed to estimate fecal coliform concentrations at five stream sites, Chester County, Pennsylvania (2017) Regions for regional regression equations Puerto Rico N-day Low-Flow Regression Files Puerto Rico Flow-Duration Regression Files Streamflow regressions and annual and semimonthly exceedance probability statistics for wild and scenic rivers, Owyhee Canyonlands Wilderness, Idaho Riparian species vegetation classification data for the Colorado River within Grand Canyon derived from 2013 airborne imagery Data for multiple linear regression models for estimating Escherichia coli (E. coli) concentrations or the probability of exceeding the bathing-water standard at recreational sites in Ohio and Pennsylvania as part of the Great Lakes NowCast, 2019 Surrogate regression models for computation of time series suspended-sediment concentrations at Muddy Creek above Paonia Reservoir, Anthracite Creek above mouth near Somerset, and North Fork Gunnison below Raven Gulch near Somerset, Colorado Regression models and associated data for describing variability of host specific bacteria fluxes in eight Great Lakes tributaries, 2011-2013 R scripts and results of estimated water use associated with continuous oil and gas development, Permian Basin, United States, 2010–19 (ver. 2.0, April, 2022) Catch and Effort Data for Cisco and Lake Whitefish Commercial Gill Net Fisheries in State of Michigan Waters of Lakes Superior, Michigan, and Huron During 1929–1970 Peak Streamflow Data, Climate Data, and Results from Investigating Hydroclimatic Trends and Climate Change Effects on Peak Streamflow in the Central United States, 1921–2020 Data for Field Evaluation of the Sequoia Scientific LISST-ABS Acoustic Backscatter Sediment Sensor Trends in annual peak streamflow quantiles for 2,683 U.S. Geological Survey streamgages in the conterminous United States Modeled and observed trends at reference basins in the conterminous U.S. from October 1, 1983 through September 30, 2016 Daily Survival Rates of Grassland Passerines and Associated Weather Variables (1978-2013)