Skip to main content

An example data set for exploration of Multiple Linear Regression

Dates

Publication Date
Start Date
1956
End Date
2016

Citation

Farmer, W.H., 2019, An example data set for exploration of Multiple Linear Regression: U.S. Geological Survey data release, https://doi.org/10.5066/P9T5ZEXV.

Summary

This data set contains example data for exploration of the theory of regression based regionalization. The 90th percentile of annual maximum streamflow is provided as an example response variable for 293 streamgages in the conterminous United States. Several explanatory variables are drawn from the GAGES-II data base in order to demonstrate how multiple linear regression is applied. Example scripts demonstrate how to collect the original streamflow data provided and how to recreate the figures from the associated Techniques and Methods chapter.

Contacts

Point of Contact :
William H Farmer
Originator :
William H Farmer
Metadata Contact :
William H Farmer
Publisher :
U.S. Geological Survey
Distributor :
U.S. Geological Survey - ScienceBase
SDC Data Owner :
Office of Planning and Programming
USGS Mission Area :
Water Resources

Attached Files

Click on title to download individual files attached to this item.

getData.R 8.15 KB
makeFigures.R 14.11 KB
reg_data.csv 23.98 KB
streamflow_cfs.csv 263.46 MB
README.txt 5.49 KB

Purpose

The purpose of this data is to allow users to reproduce examples and figures from the Techniques and Methods chapter Regionalization of Surface-Water Statistics using Multiple Linear Regression.

Map

Communities

  • USGS Data Release Products

Tags

Provenance

Additional Information

Identifiers

Type Scheme Key
DOI https://www.sciencebase.gov/vocab/category/item/identifier doi:10.5066/P9T5ZEXV

Item Actions

View Item as ...

Save Item as ...

View Item...