Skip to main content
Advanced Search

Filters: Tags: multivariate statistical analysis (X)

9 results (58ms)   

View Results as: JSON ATOM CSV
thumbnail
This USGS data release documents species distribution models for 271 fluvial fish species in their native ranges of the conterminous United States. Source data, supporting code and model results are documented in this data package. Boosted Regression Tree (BRT) models were used to develop presence/absence predictions for each of the National Hydrography Dataset Plus Version 2.1 stream segments within a species' native range. The predictions provided can be linked to the NHDPlusV2.1 geospatial dataset through the COMID to create a spatial depiction of the models. The primary results are stored in the file "BRT Predictions" and are provided in comma separated value (CSV) and Parquet file formats. Parquet file format...
This USGS data release contains products that resulted from aquatic species distribution modeling in the United States on the National Hydrography Dataset Plus Version 2.1. Source data, supporting code and model results are documented in this data release. The file species_model_list.csv provides a list of most recent models for each combination of species, habitat, and region.
thumbnail
This data release contains the input-data files and R scripts associated with the analysis presented in Worland and others (2018). The spatial extent of the data is the contiguous U.S. The input-data files include one comma separated value (csv) file of county-level data, and one csv file of city-level data. The county-level csv (“county_data.csv”) contains data for 3,109 counties. This data includes two measures of water use, descriptive information about each county, three grouping variables (climate region, urban class, and economic dependency), and contains 18 explanatory variables: proportion of population growth from 2000-2010, fraction of withdrawals from surface water, average daily water yield, mean annual...
thumbnail
Concentrations of particulate organic carbon (POC) and dissolved organic carbon (DOC), which together comprise total organic carbon, were measured in a reconnaissance study at sampling sites in the Upper Klamath River, Lost River, and Klamath Straits Drain in 2013–16. In addition, data for total nitrogen and chlorophyll a were collected. Optical absorbance and fluorescence properties of dissolved organic matter (DOM), which contains DOC, also were analyzed. Excitation-Emission matrices (EEMs) were produced for each sample and full absorbance spectra. The EEMs were compiled and key data points and regions of the spectra were extracted from each site. Parallel factor analysis was used to decompose the optical fluorescence...
thumbnail
These data describe the percent of cropland harvested as wheat, corn, and soybean within each basin (basins 1-8, see accompanying shapefiles). Data are available for other crops; however, these three were chosen because wheat is a traditional crop that has been grown for a long time in the Basin and corn and soybeans have increased in recent times because of wetter conditions, the demand for biofuels, and advances in breeding short-season, drought-tolerant crops. The data come from the National Agricultural Statistics Service (NASS) Census of Agriculture (COA) and have estimates for 1974, 1978, 1982, 1986, 1992, 1997, 2002, 2007, and 2012. Years with missing data were estimated estimated using multivariate imputation...
thumbnail
This habitat model was developed to delineate suitable habitat for coastal cactus wren (Campylorhynchus brunneicapillus) in southern California. A primary purpose of the model is to identify potential restoration sites that may not currently support cactus patches required by wrens, but which are otherwise highly suitable. These are areas that could be planted with cactus to increase wren populations, an important management objective for many land managers. We used the Partitioned Mahalanobis D2 modeling technique to construct alternative models with different combinations of environmental variables. Variables were calculated at each point in the center of a 150 m x 150 m cell in a grid of points across the landscape....
thumbnail
This habitat model was developed to delineate a sampling frame for regional monitoring of coastal California gnatcatchers (Polioptila californica californica) to determine: 1) percent area occupied (PAO) in high and very high suitability habitat across conserved lands and participating military lands in the U.S. range in southern California; 2) changes in PAO over time; and 3) extinction and colonization rates. One purpose of the model is to identify areas recovering from disturbance, such as wildfire, that may not currently support coastal sage scrub vegetation used by coastal California gnatcatchers, but are otherwise highly suitable. In this way, we can monitor gnatcatcher occupancy associated with habitat changes...
thumbnail
A machine learning streamflow (MLFLOW) model was developed in R (model is in the Rscripts folder) for modeling monthly streamflow from 2012 to 2017 in three watersheds on the Wyoming Range in the upper Green River basin. Geospatial information for 125 site features (vector data are in the Sites.shp file) and discrete streamflow observation data and environmental predictor data were used in fitting the MLFLOW model and predicting with the fitted model. Tabular calibration and validation data are in the Model_Fitting_Site_Data.csv file, totaling 971 discrete observations and predictions of monthly streamflow. Geospatial information for 17,518 stream grid cells (raster data are in the Streams.tif file) and environmental...
thumbnail
This habitat model was developed to identify suitable habitat for the federally-endangered least Bell’s vireo (Vireo bellii pusillus) across its current and historic range in California. The vireo disappeared from most of its range by the 1980s, remaining only in small populations in southern California. Habitat protection and management since the mid-1980s has led to an increase in southern California vireo populations with small numbers of birds recently expanding into the historic range. Predictions from this model will be used to focus surveys in the historic range to determine where vireos are recolonizing and to track the status and distribution of populations over time. We used the Partitioned Mahalanobis...


    map background search result map search result map Wheat Corn Soy Estimates Red River of the North Basin Data from an Analysis of Dissolved Organic Matter in the Upper Klamath River, Lost River, and Klamath Straits Drain, Oregon and California, 2013–16 2010 County and City-Level Water-Use Data and Associated Explanatory Variables Least Bell's Vireo Habitat Suitability Model for California (2019) Coastal Cactus Wren Habitat Suitability Model for Southern California (2015) Coastal California Gnatcatcher Habitat Suitability Model for Southern California (2015) Fluvial Fish Native Distributions for the Conterminous United States using the NHDPlusV2.1 and Boosted Regression Tree (BRT) Models Input data, model output, and R scripts for a machine learning streamflow model on the Wyoming Range, Wyoming, 2012–17 Input data, model output, and R scripts for a machine learning streamflow model on the Wyoming Range, Wyoming, 2012–17 Data from an Analysis of Dissolved Organic Matter in the Upper Klamath River, Lost River, and Klamath Straits Drain, Oregon and California, 2013–16 Coastal Cactus Wren Habitat Suitability Model for Southern California (2015) Coastal California Gnatcatcher Habitat Suitability Model for Southern California (2015) Wheat Corn Soy Estimates Red River of the North Basin Least Bell's Vireo Habitat Suitability Model for California (2019) Fluvial Fish Native Distributions for the Conterminous United States using the NHDPlusV2.1 and Boosted Regression Tree (BRT) Models 2010 County and City-Level Water-Use Data and Associated Explanatory Variables