Survival Model Validation Utilities • survC

survC provides lightweight utilities for validating survival models. The package focuses on Cox regression workflows, wrapping common validation tasks such as extracting linear predictors, computing Harrell’s concordance, and summarising time-dependent ROC curves. It also includes helpers for preparing cohort data and producing slide-based validation reports.

Installation

Install the development version of survC from GitHub with:

# install.packages("pak")
pak::pak("newjoseph/survC")

The examples below require the survival, timeROC, officer, and rvg packages. They are listed in Imports and will be pulled in automatically when survC is installed.

Key features

Convert fitted coxph models into linear predictors or risk scores with calc_risk_score() for downstream metrics.
Summarise time-dependent ROC curves across custom horizons via tdroc_calc() and inspect the underlying timeROC object.
Compute Harrell’s C-index (with a 95% confidence interval) for training or validation cohorts using cindex_calc().
Generate multi-slide PowerPoint reports that compare training and validation ROC curves at designated time points with validation_report(), specifying which columns contain survival times and event indicators.
Prepare Excel-based cohort extracts for downstream modelling with prepare_adpkd_dataset().

Quick start

The example below demonstrates a typical validation workflow using the survival::lung dataset.

library(survC)
library(survival)

set.seed(2024)
lung <- survival::lung
lung <- lung[complete.cases(lung[, c("time", "status", "age", "ph.ecog")]), ]

# Split into training and validation cohorts
split_ids <- sample(seq_len(nrow(lung)))
train_idx <- split_ids[1:110]
val_idx <- split_ids[111:200]

train_df <- lung[train_idx, ]
val_df <- lung[val_idx, ]

# Fit a simple Cox model on the training data
cox_fit <- survival::coxph(
  survival::Surv(time, status == 2) ~ age + ph.ecog,
  data = train_df,
  x = TRUE
)

# 1. Linear predictor / risk scores
train_lp <- calc_risk_score(cox_fit)
val_lp <- calc_risk_score(cox_fit, data = val_df)

# 2. Harrell's concordance on the validation cohort
c_index_val <- cindex_calc(cox_fit, newdata = val_df)
c_index_val
#> Cindex  Lower  Upper 
#>  0.561  0.481  0.641

# 3. Time-dependent ROC summary at selected horizons
horizons <- c(200, 400)
roc_tbl <- tdroc_calc(
  time = val_df$time,
  status = as.integer(val_df$status == 2),
  marker = val_lp,
  times = horizons
)
roc_tbl
#>       time       AUC
#> t=200  200 0.5981693
#> t=400  400 0.5054571

The returned AUC table mirrors timeROC output and carries the full ROC object as the roc_obj attribute for plotting or inspection:

roc_obj <- attr(roc_tbl, "roc_obj")
list(
  AUC = roc_obj$AUC,
  times = roc_obj$times
)
#> $AUC
#>     t=200     t=400 
#> 0.5981693 0.5054571 
#> 
#> $times
#> [1] 200 400

Validation report (PowerPoint)

Use validation_report() to export one slide per horizon with training and validation ROC curves side-by-side. Pass the column names that hold follow-up times and event indicators via time_col and status_col. The function relies on officer and rvg so plots remain editable.

validation_report(
  train_data = transform(train_df, time = time, status = as.integer(status == 2)),
  val_data = transform(val_df, time = time, status = as.integer(status == 2)),
  model = cox_fit,
  time_col = "time",
  status_col = "status",
  times = horizons,
  time_unit = "days",
  output = "validation_report.pptx"
)

Development

Re-knit the README after editing with:

devtools::build_readme()

This regenerates README.md so GitHub reflects the latest examples.