Assess whether recovery-simulation results are ready to use
Source:R/api-simulation.R
assess_mfrm_recovery.RdConverts the numerical output from evaluate_mfrm_recovery() into a
reviewer-facing adequacy checklist. The goal is not to impose one universal
pass/fail rule; it is to make the main user questions explicit: Did the runs
finish? Did the fitted models converge? Are uncertainty summaries available?
Are coverage and Monte Carlo precision plausible? If practical RMSE or bias
limits are supplied, which parameter groups need follow-up? For bounded
GPCM, which slope-regime generator condition frames the recovery evidence?
Usage
assess_mfrm_recovery(
x,
min_reps = 30,
min_success_rate = 0.95,
min_convergence_rate = 0.95,
min_se_available = 0.8,
coverage_target = 0.95,
coverage_tolerance = 0.05,
max_mcse_rmse_ratio = 0.25,
max_rmse = NULL,
max_abs_bias = NULL,
top_n = 6,
digits = 3,
...
)
# S3 method for class 'mfrm_recovery_assessment'
plot(
x,
y = NULL,
type = c("status", "metrics"),
metric = c("rmse", "bias", "coverage", "se_available", "mcse_rmse"),
draw = TRUE,
...
)Arguments
- x
For
assess_mfrm_recovery(), output fromevaluate_mfrm_recovery(). Forplot.mfrm_recovery_assessment(), output fromassess_mfrm_recovery().- min_reps
Minimum replication count expected before treating the simulation as more than a smoke check.
- min_success_rate
Minimum acceptable proportion of replications that generated data and produced a fitted model.
- min_convergence_rate
Minimum acceptable proportion of replications whose fitted model reported convergence.
- min_se_available
Minimum acceptable proportion of recovery rows with standard errors in each parameter group. Set to
NULLto skip this check.- coverage_target
Nominal coverage target, usually
0.95.- coverage_tolerance
Absolute tolerance around
coverage_target.- max_mcse_rmse_ratio
Maximum acceptable Monte Carlo SE of RMSE divided by RMSE. Set to
NULLto skip this precision check.- max_rmse
Optional practical RMSE limit. Use a scalar for all parameter groups or a named vector/list with names such as
"facet","step","slope","Rater", or"facet:Rater:logit".- max_abs_bias
Optional practical absolute-bias limit. Naming follows
max_rmse.- top_n
Number of next-action lines retained in the compact output.
- digits
Digits used by the print method.
- ...
Reserved for future extensions.
- y
Reserved for S3 generic compatibility.
- type
Assessment plot route.
"status"summarizes checklist status counts;"metrics"plots a parameter-group assessment metric colored by its status.- metric
Metric used when
type = "metrics". Supported values are"rmse","bias","coverage","se_available", and"mcse_rmse".- draw
If
TRUE, draw with base graphics. IfFALSE, return anmfrm_plot_dataobject with reusable plot tables and metadata.
Value
An object of class mfrm_recovery_assessment with:
overview: compact run-level status.checklist: reviewer-facing adequacy checks.condition_review: generator-condition metadata, including boundedGPCMslope-regime interpretation and generated score-category support when available.condition_reporting_notes: reporter-facing generator-condition caveats separated from recovery metrics and release-gate decisions.diagnostic_reporting_notes: reporter-facing fit/separation caveats retained as diagnostic context rather than recovery gates.diagnostic_review: optional fit/separation operating-characteristic context when retained byevaluate_mfrm_recovery().metric_review: parameter-group metric checks.uncertainty_review: compact coverage / SE availability interpretation.reading_order: recommended first-read order for the summary, condition, plot, and row-level recovery outputs.next_actions: short action list sorted by severity.thresholds: thresholds used for the assessment.
Details
RMSE and bias adequacy depends on the substantive scale and the use case, so
the function does not mark them as failed unless the user supplies
max_rmse or max_abs_bias. Without those limits, the corresponding rows are
marked not_assessed and the next action asks the user to set practical
thresholds when a decision depends on the metric.
The condition_review table is generator metadata for interpreting the
recovery run. For bounded GPCM, GPCMSlopeRegime, StressLevel, and
generated score-category support describe the data-generating condition;
they are not model-fit tests and they are not literature-derived adequacy
cut points. condition_reporting_notes turns those generator conditions
into reporter-facing caveats, such as high-dispersion slope stress or sparse
generated score support.
The optional diagnostic_review table is available when
evaluate_mfrm_recovery() was called with include_diagnostics = TRUE.
It summarizes fit and separation operating characteristics as diagnostic
context only. Its availability fields do not mean that fit or separation
values are adequate, and those rows do not enter the recovery adequacy
status. diagnostic_reporting_notes should be read first when drafting
fit/separation language because it separates zero separation/reliability,
absolute fit-ZSTD flags, and df-sensitive ZSTD flags from recovery gates.
plot.mfrm_recovery_assessment() is a user-facing review aid. Use
type = "status" first to see where checklist attention is needed, then
type = "metrics" to inspect the parameter groups behind RMSE, bias,
coverage, standard-error availability, or Monte Carlo precision statuses.
The intended reading order is summary(recovery_review), then
condition_reporting_notes and condition_review, then
diagnostic_reporting_notes and diagnostic_review, then the status plot,
then the metric plot, then the row-level recovery table for the parameter
groups that need follow-up. When draw = FALSE, the plot data also include
reading_order, guidance, condition/diagnostic handoff tables, and
user-facing plot tables such as section_status for status plots and
metric_review for metric plots.
Examples
if (FALSE) { # \dontrun{
rec <- evaluate_mfrm_recovery(
n_person = 12,
n_rater = 2,
n_criterion = 2,
reps = 1,
maxit = 30,
seed = 123
)
assess_mfrm_recovery(rec, min_reps = 1, max_rmse = 1)
# Read the bounded-GPCM generator condition separately from recovery adequacy.
gpcm_spec <- build_mfrm_sim_spec(
n_person = 14,
n_rater = 2,
n_criterion = 2,
raters_per_person = 2,
model = "GPCM",
step_facet = "Criterion",
slope_facet = "Criterion",
slopes = c(0.85, 1.15),
assignment = "crossed"
)
gpcm_rec <- suppressWarnings(evaluate_mfrm_recovery(
sim_spec = gpcm_spec,
reps = 1,
fit_method = "MML",
quad_points = 5,
maxit = 12,
include_diagnostics = TRUE,
include_person = FALSE,
seed = 456
))
gpcm_review <- assess_mfrm_recovery(
gpcm_rec,
min_reps = 1,
max_rmse = c(slope = 2),
max_abs_bias = c(slope = 1),
min_se_available = NULL,
max_mcse_rmse_ratio = NULL
)
gpcm_review$condition_reporting_notes[, c(
"ConditionArea", "ReportingAttention", "ConditionFinding"
)]
gpcm_review$condition_review[, c(
"Model", "GPCMSlopeRegime", "StressLevel", "ScoreSupportStatus"
)]
gpcm_review$diagnostic_reporting_notes[, c(
"Facet", "ReportingAttention", "DiagnosticFinding"
)]
summary(gpcm_review)$reading_order
} # }