Overview
FIESTA’s Estimation Modules combine multiple functions from FIESTA or other packages to generate estimates across a user-defined population(s) using different estimation strategies. Each module has an associated mod*pop function for compiling the population data and calculations, including adjustments for nonresponse and standardizing auxiliary data. The output form the mod*pop functions are input directly into the mod* estimation modules.
All Estimation Modules require similar inputs, including a set of population data output from associated mod*pop functions and other parameters defining filters and row and/or column domains. Refer to the following vignettes for specific examples.
Module Parameters
The parameters for FIESTA modules are organized by different categories based on population data and resulting estimates.
- Plot/condition filters to subset table records
- Tree information for tree estimates - modGBtree()
- Tree information for ratio estimates - modGBratio()
- Output table information
- Output table specifications
- Output table title(s)
Estimates
Filters subset the area of the sample population for the desired output. Filters do no change the population data used for estimate (e.g., number of plots, strata weights). Some filters are defined internally, depending on the dataset, such as land area of interest (landarea) or inclusion of ACI (All Condition Inventory) data. Others are specified according to the input table, such as pfilter or cfilter. The nonsamp.filter is for removing nonsampled conditions and is applied, internally, unless otherwise stated. All filter expressions must be in R syntax.
- landarea - A predefined cond filter depending on desired sample land area {ALL, FOREST (COND_STATUS_CD=1), TIMBERLAND (SITECLCD in(1:6) & RESERVCD=0)}.
- pcfilter - Plot/Condition-level filter.
Tree information for tree estimates - modGBtree()
Tree information is used for calculating estimates for derived tree data, such as basal area (BA), volume (e.g., VOLCFNET), or number of trees (i.e., TPA_UNADJ). All variables except number of tree variables are extrapolated to the acre plot size by multiplying by TPA_*).
- estvar - Name of estimate variable in tree (e.g., VOLCFNET, TPA_UNADJ).
- estvar.filter - Tree-level filter (e.g., ‘STATUS_CD == 1 & DIA >= 5.0’).
- estvar.name - A name for aggregated estvar (Default = estvar’_SUM’).
Tree information for ratio estimates - modGBratio()
Tree information is used for calculating per-acre ratio estimates (numerator) and per-tree ratio estimates (numerator and denominator) from derived tree data, such as basal area (BA), volume (e.g., VOLCFNET), or number of trees (i.e., TPA_UNADJ). All variables except number of tree variables are extrapolated to the acre plot size by multiplying by TPA_*).
- ratiotype - If ratio estimate, the ratio type (“PERACRE”, “PERTREE”).
- estvar(n) - Numerator - Name of estimate variable (e.g., VOLCFNET, TPA_UNADJ)
- estvar(n).filter - Numerator - Tree-level filter(s).
- estvar(n).name - Numerator - A name for aggregated estvar (Default = estvar(n)’_SUM’).
- estvard - Denominator - Name of estimate variable (e.g., VOLCFNET, TPA_UNADJ)
- estvard.filter - Denominator - Tree-level filter(s).
- estvard.name - Denominator - A name for aggregated estvar (Default = estvard’_SUM’).
Table estimates
Estimates can be broken down by categorical domains for further detail about a population. These details are presented as rows or rows and columns in a matrix table, with a separate, associated sampling error for each cell in the table. If the area of interest (i.e., population) has more than one estimation unit (i.e., subpopulation), estimates may be presented by estimation unit or summed to the population.
- rowvar - Name of row variable (domain). If area estimate, rowvar must be from cond (e.g., FORTYPCD). If tree or ratio estimate, rowvar can be from cond or tree (e.g., SPCD).
- colvar - Name of column variable (domain). If area estimate, colvar must be from cond (e.g., FORTYPCD). If tree or ratio estimate, colvar can be from cond or tree (e.g, SPCD).
- sumunits - TRUE, sum estimates by estimation units (sub-populations) into one estimate (e.g., Counties to State).
Other
- returntitle - TRUE, return table titles (See title_opts for more title parameters)
- savedata - TRUE, save data to outfolder (See savedata_opts for more savedata parameters).
Output table options (table_opts)
The following parameters provide more detail about the presentation
and aesthetics of the final output table, including: using code names
instead of codes; applying specific ordering; adding categories not in
the data set with 0 values; displaying percent standard errors and
estimates together or separate; and rounding options. These parameters
are set by supplying a list to the table_opts
parameter.
The possible parameters that can be set within the
table_opts
parameter can be seen by running
help(table_options)
.
-
row.FIAname - TRUE, gets FIA reference name for
rowvar, if available (
FIESTA::getRefcodes(rowvar)
for availability). -
col.FIAname - TRUE, gets FIA reference name for
colvar, if available (
FIESTA::getRefcodes(rowvar)
for availability). - row.orderby - Name of variable to order values of rowvar (if row.FIAname=FALSE).
- col.orderby - Name of variable to order values of colvar (if col.FIAname=FALSE).
-
row.add0 - TRUE, add 0 value in table for row
values that occur in
FIESTA::getRefcodes(rowvar)
or in rowlut, but are not in final estimate. -
col.add0 - TRUE, add 0 value in table for column
values that occur in
FIESTA::getRefcodes(colvar)
or in collut, but are not in final estimate. - rowlut - A lookup table with variable codes and descriptions to include in rows of output table (See note following for more details).
- collut - A lookup table with specific variable codes and descriptions to include in columns of output table (See note following for more details).
- rawonly - If TRUE, only raw data are returned (this option is more efficient for estimates with many estimation units, rows, and/columns).
- raw.keep0 - If TRUE, keeps 0 values in raw data tables. If FALSE, removes 0 values for efficient output.
- allin1 - TRUE, table cells include: estimates (% sample error).
- metric - TRUE, output is returned in metric units.
- estround - Number of digits to round estimates to.
- pseround - Number of digits to round percent sample error (pse) to.
- estnull - Number or character to indicate ‘not sampled’ for estimate.
- psenull - Number or character to indicate ‘not sampled’ for percent sample error (pse).
- divideby - Conversion of output numbers (‘hundred’, ‘thousand’, ‘million’).
Note: rowlut/collut - There are several objectives for including rowlut/collut look-up tables: 1) to include descriptive names that match row/column codes in the input table; 2) to use number codes that match row/column names in the input table for ordering rows; 3) to add rows and/or columns with 0 values for consistency. No duplicate names are allowed.
Include 2 columns in the table: 1-the merging variable with same name as the variable in the input merge table; 2-the ordering or descriptive variable. If the ordering variable is the rowvar/colvar in the input table and the descriptive variable is in rowlut/collut, set row.orderby/col.orderby equal to rowvar/colvar. If the descriptive variable is the rowvar/colvar in the input table, and the ordering code variable is in rowlut/collut, set row.orderby/col.orderby equal to the variable name of the code variable in rowlut/collut.
If returntitle=TRUE, a default title is generated based on the
estimation parameters above. The following parameters allow user-defined
inputs for generating customized titles. These parameters are set by
supplying a list to the title_opts
parameter. The possible
parameters that can be set within the title_opts
parameter
can be seen by running help(title_options)
.
- title.main - Full title for table.
- title.ref - The ending text for table title. If not NULL, included with title.main.
- title.rowvar - Pretty name for rowvar for table title (If NULL, default = rowvar).
- title.colvar - Pretty name for colvar for table title (If NULL, default = colvar).
- title.unitvar - Pretty name for unitvar for table title (If NULL and sumunits=FALSE, default = unitvar).
- title.estvar - Pretty name for estvar for table title (If NULL, default = estvar). For ratio estimates, title.estvarn (default = estvarn) and title.estvard (default = estvard).
- title.filter - Pretty name for any filters for table title.
Saving data specifications (savedata_opts)
If savedata=TRUE, output is saved using default settings. The
following parameters specify different options for saving output. These
parameters are set by supplying a list to the savedata_opts
parameter. The possible parameters that can be set within the
savedata_opts
parameter can be seen by running
help(savedata_options)
.
- outfolder - Name of folder to output data to (Default = working directory or window to browse).
- out_fmt - Format of output data tables (‘sqlite’, ‘gpkg’, ‘csv’, ‘gdb’).
- out_dsn - TRUE, data source name of database to output data tables.
- outfn.pre - Add a prefix to output name (e.g., ’01_*’).
- outfn.date - Add date (YYYYMMDD) to output name (e.g., ’*_20200220’).
- addtitle - TRUE, add title to tables.
- raw_fmt - Format of raw data tables (‘sqlite’, ‘gpkg’, ‘csv’, ‘gdb’).
- raw_dsn - TRUE, data source name of database to output raw data tables.
- overwrite_dsn - TRUE, overwrites existing data source name database files (e.g., sqlite).
- overwrite_layer - TRUE, overwrites layer within a database or file in outfolder, if exists.
- append_layer - TRUE, appends output to existing layer in database or file in outfolder.
Output values from FIESTA
modules
Estimates with percent sampling error for the row domain (and column domain) specified by the input parameters. This can be in the form of one table or two separate tables, depending on the number of domains and on allin1 parameter.
A list object with one or more of the following components. If savedata=TRUE, all output data frames are written to outfolder.
- $est - Data frame with estimates by rowvar, colvar (and estimation unit). If sumunits=TRUE or one estimation unit and colvar=NULL, estimates and percent sampling error (confidence level 68%) are all in est. For 95% percent confidence level multiply percent sampling error by 1.96.
- $pse - Data frame with percent sampling errors corresponding to est.
- $raw - A list of raw data used in the estimation process.
- $titlelst - If returntitle=TRUE, a list with one or two titles for est and pse, depending on number of output data frames (see below for list components).
Raw data used for producing estimates (If rawdata=TRUE, raw$):
The raw data includes the domain-level data set used for estimation and separate data frames with calculated variables used in estimation process. The number of processing tables depends on the input parameters. The tables include:
- raw$domdat - Plot domain data used for estimation.
- raw$unit_totest - Total by estimation unit
- raw$unit_rowest - If rowvar != NULL, rowvar totals by estimation unit
- raw$unit_colvar - If colvar != NULL, colvar totals by estimation unit
- raw$unit_grpvar - If colvar != NULL, a combination of rowvar and colvar by estimation unit
- If sumunits=TRUE, the raw data for the summed estimation units are also included: (totest, rowest, colest, grpest, respectively). These tables do not included estimate proportions (nhat and nhat.var). See below for variable descriptions.
- raw$totest - Total by estimation unit, summed to population
- raw$rowest - If rowvar != NULL, rowvar totals by estimation unit, summed to population
- raw$colvar - If colvar != NULL, colvar totals by estimation unit, summed to population
- raw$grpvar - If colvar != NULL, a combination of rowvar and colvar by estimation unit, summed to population
output
Variable | Description |
---|---|
nhat | Estimated proportion |
nhat.var | Variance estimate of estimated proportion |
est | Estimated acres { nhat * ACRES } |
est.var | Variance estimate of estimated acres { nhat * ACRES^2 } |
output
Variable | Description |
---|---|
nhat | Estimated proportion of land, for numerator |
nhat.var | Variance estimate of estimated proportion of land, for numerator |
dhat | Estimated proportion of land, for denominator |
dhat.var | Variance estimate of estimated proportion of land, for denominator |
covar | Covariance of estimated proportion of numerator and denominator |
estn | Estimated acres, for numerator |
estd | Estimated acres, for denominator |
estn.var | Variance estimate of estimate acres, for numerator |
estn.se | Standard error estimated acres, for numerator |
estn.cv | Coefficient of variation of estimated acres, for numerator |
estn.pse | Percent sampling error of estimate, for numerator |
estd.var | Variance estimate of estimate acres, for denominator |
estd.se | Standard error estimated acres, for denominator |
estd.cv | Coefficient of variation of estimated acres, for denominator |
estd.pse | Percent sampling error of estimate, for denominator |
est.covar | Covariance of estimated acres of numerator and denominator |
rhat | Ratio of estimated proportions (numerator/denominator) |
rhat.var | Variance of ratio of estimated proportions |
rhat.se | Standard error of ratio of estimated proportions { rhat.se/rhat } |
rhat.cv | Coefficient of variation of ratio of estimated proportions { sqrt(rhat.var) } |
est | Estimated percent cover of land { rhat*100 } |
est.var | Variance of estimated percent cover of land { rhat.var*100^2 } |
output
Variable | Description |
---|---|
NBRPLT.gt0 | Number of nonzero plots used in estimate |
ACRES | Total acres for estimation unit |
est.se | Standard error of estimated acres { sqrt(est.var) } |
est.cv | Coefficient of variation of estimated acres { est.se/est } |
pse | Percent sampling error of estimate { est.cv * 100 } |
CI99left | Left tail of 99% confidence interval for estimate { est - (2.58*est.se) } |
CI99right | Right tail of 99% confidence interval for estimate { est + (2.58*est.se) } |
CI95left | Left tail of 95% confidence interval for estimate { est - (1.96*est.se) } |
CI95right | Right tail of 95% confidence interval for estimate { est + (1.96*est.se) } |
CI68left | Left tail of 68% confidence interval for estimate { est - (0.97*est.se) } |
CI68right | Right tail of 68% confidence interval for estimate { est + (0.97*est.se) } |
Reference Tables in FIESTA
There are several reference tables stored in FIESTA
,
including table variable descriptions, code definitions, and estimation
categories.
- Table variable descriptions - ref_plt, ref_cond, ref_tree
- Code definitions - ref_codes
- Domain variables - ref_domain
- Estimation variables - ref_estvar
getdomain() getestvar()