The modelsummary_rms function is designed to process output from models fitted using the rms package and generate a summarised dataframe of the results. The goal is to produce publication-ready summaries of the models.
This vignette will guide you through the basic usage of the function and then move on to more advanced examples.
Make sure you have the required packages installed from CRAN or GitHub. Note, if you plan to output the results into Microsoft Word, we recommend also installing flextable and officer.
Here is a simple example using a linear regression model (“ordinary least squares”; OLS). The example data being used here is the built-in survey dataset from the MASS package. The models are for demonstration purposes only.
The output dataframe contains the estimated coefficients, their 95% confidence intervals, and the associated p-values. These are in a publication ready format.
# Loading the built-in dataset from the MASS package:
data("survey", package = "MASS")
# Fit a linear regression model using the rms package:
fit_ols <- ols(Wr.Hnd ~ Age + Exer + Sex, data = survey)
# Generate a model summary, and assign it to rmsMD_summary
rmsMD_summary <- modelsummary_rms(fit_ols)
# displaying rmsMD dataframe output
rmsMD_summary##    variable                coef_95CI Pvalue
## 1       Age  0.011 (-0.020 to 0.042)  0.474
## 2 Exer=None -0.112 (-0.798 to 0.574)  0.749
## 3 Exer=Some -0.021 (-0.447 to 0.406)  0.924
## 4  Sex=Male   2.146 (1.743 to 2.550) <0.001| variable | coef_95CI | Pvalue | 
|---|---|---|
| Age | 0.011 (-0.020 to 0.042) | 0.474 | 
| Exer=None | -0.112 (-0.798 to 0.574) | 0.749 | 
| Exer=Some | -0.021 (-0.447 to 0.406) | 0.924 | 
| Sex=Male | 2.146 (1.743 to 2.550) | <0.001 | 
By default, the function uses the following stylistic settings:
You can modify these defaults to adjust the appearance of the output.
# Generate a model summary with custom styling options
summary_custom <- modelsummary_rms(fit_ols, 
                                   combine_ci = FALSE, 
                                   round_dp_coef = 2, 
                                   round_dp_p = 5)
# to display the dataframe as a table
knitr::kable(summary_custom)| variable | coef | coef_lower95 | coef_upper95 | Pvalue | 
|---|---|---|---|---|
| Age | 0.01 | -0.02 | 0.04 | 0.47400 | 
| Exer=None | -0.11 | -0.80 | 0.57 | 0.74892 | 
| Exer=Some | -0.02 | -0.45 | 0.41 | 0.92386 | 
| Sex=Male | 2.15 | 1.74 | 2.55 | <0.00001 | 
By default, modelsummary_rms returns only the final
formatted summary (i.e. fullmodel = FALSE). This does not
include the model intercept, or information such as standard errors.
This output is made to be concise and show the key results.
If all information is required, you can set
fullmodel = TRUE.
This option returns additional results.
# Generate a model summary with custom styling options
summary_fullmodel <- modelsummary_rms(fit_ols, 
                                   combine_ci = FALSE, 
                                   round_dp_coef = 2, 
                                   round_dp_p = 5,
                                   fullmodel = TRUE)
knitr::kable(summary_fullmodel)| variable | coef | SE | p_values_raw | coef_lower95 | coef_upper95 | Pvalue | 
|---|---|---|---|---|---|---|
| Intercept | 17.39 | 0.3721170 | 0.0000000 | 16.66 | 18.12 | <0.00001 | 
| Age | 0.01 | 0.0157258 | 0.4740033 | -0.02 | 0.04 | 0.47400 | 
| Exer=None | -0.11 | 0.3498069 | 0.7489214 | -0.80 | 0.57 | 0.74892 | 
| Exer=Some | -0.02 | 0.2176646 | 0.9238556 | -0.45 | 0.41 | 0.92386 | 
| Sex=Male | 2.15 | 0.2056453 | 0.0000000 | 1.74 | 2.55 | <0.00001 | 
Exponentiating the coefficients of certain models makes the
interpretation more intuitive (e.g. as odds ratios in logistic
regression and hazard ratios in Cox models). This is controlled using
the exp_coef argument.
The modelsummary_rms package automatically sets an
appropriate value for exp_coef for the core
rms models ols, lrm, and
cph. This ensures OR and HR are displayed for logistic
regression and Cox regression models respectively.Below is an example
using modelsummary_rms on an rms
logistic regression model. Note this automatically provides OR:
# Note: For demonstration, we create a binary outcome using the survey dataset.
survey$BinaryOutcome <- ifelse(survey$Wr.Hnd > median(survey$Wr.Hnd, na.rm = TRUE), 1, 0)
# fitting the model
fit_lrm <- lrm(BinaryOutcome ~ Age + Exer + Sex, data = survey)
# rmsMD summary
summary_lrm <- modelsummary_rms(fit_lrm)
# displaying as a table
knitr::kable(summary_lrm)| variable | OR_95CI | Pvalue | 
|---|---|---|
| Age | 1.012 (0.965 to 1.060) | 0.626 | 
| Exer=None | 0.776 (0.279 to 2.160) | 0.627 | 
| Exer=Some | 0.805 (0.424 to 1.528) | 0.507 | 
| Sex=Male | 9.784 (5.323 to 17.982) | <0.001 | 
The modelsummary_rms from rmsMD package is also capable of working with non-rms models, such as those fitted using base R functions like lm(). However, in these cases the package does not automatically determine the appropriate value for exp_coef, so it must be set manually.
For example, when using a linear model (where exponentiation of coefficients is not required), you should explicitly set exp_coef = FALSE.
# Fit a simple linear model using lm() from base R (an example model fit without using rms package)
fit_lm <- lm(Wr.Hnd ~ Age + Exer + Sex, data = survey)
# Generate a model summary for the non-RMS model by explicitly setting exp_coef = FALSE
summary_lm <- modelsummary_rms(fit_lm, 
                               exp_coef = FALSE)
# display rmsMD results as a table
knitr::kable(summary_lm)| variable | coef_95CI | Pvalue | 
|---|---|---|
| (Intercept) | 17.387 (16.657 to 18.116) | <0.001 | 
| Age | 0.011 (-0.020 to 0.042) | 0.474 | 
| ExerNone | -0.112 (-0.798 to 0.574) | 0.749 | 
| ExerSome | -0.021 (-0.447 to 0.406) | 0.924 | 
| SexMale | 2.146 (1.743 to 2.550) | <0.001 | 
Restricted Cubic Splines (RCS) are a flexible modelling tool used to capture non-linear relationships between predictors and outcomes. In medicine, for the majority of continuous variables (e.g. age, blood pressure, or biomarker levels) the assumption of linearity may not hold. A key highlight of the rms package is the ability to analyse variables using RCS.
The rmsMD package is designed to report and summarise models that include RCS terms. Individual coefficients for RCS terms are difficult to interpret in isolation. Instead, an overall p-value can be generated to assess whether the overall relationship between the RCS variable and outcome is significant. By default modelsummary_rms removes the individual RCS coefficients, replacing them with the overall p-value for that variable.
Display an overall p-value for the spline terms using the
rcs_overallp option.
When this option is set to TRUE (which is the default), the
function computes a single p-value that tests the overall significance
of the spline terms for each variable. This overall p-value provides
insight into whether the relationship between the predictor and the
dependent variable is significant.
Hide the individual spline coefficients using the
hide_rcs_coef option.
Hiding the individual spline coefficients can be advantageous because
these lack straightforward clinical interpretation. Instead, the focus
is on the overall association captured by all RCS terms for that
specific variable. This helps simplify the output. If the variable has a
signficant association with outcome, we recommend plotting this
relationship.
Example of a model with RCS terms for the continuous outcome Age. The default settings are applied, which hides the individual RCS terms, and provides an overall p-value for the association of Age with outcome.
# Using the built-in dataset from the MASS package
data("survey", package = "MASS")
# Fit an OLS model including a restricted cubic spline for Age (with 4 knots)
fit_spline <- ols(Wr.Hnd ~ rcs(Age, 4) + Exer + Sex, data = survey)
# Generate an rmsMD model summary using default settings
summary_spline <- modelsummary_rms(fit_spline)
# Outputting this as a table
knitr::kable(summary_spline)| variable | coef_95CI | Pvalue | 
|---|---|---|
| Exer=None | -0.152 (-0.841 to 0.537) | 0.665 | 
| Exer=Some | -0.032 (-0.461 to 0.396) | 0.882 | 
| Sex=Male | 2.094 (1.682 to 2.506) | <0.001 | 
| RCSoverallP: Age | RCS terms | 0.545 | 
If individual RCS coefficients are required, these can be added in by
setting hide_rcs_coef to FALSE:
# Fit an OLS model including a restricted cubic spline for Age (with 4 knots)
fit_spline_hide <- ols(Wr.Hnd ~ rcs(Age, 4) + Exer + Sex, data = survey)
# Generate a model summary with rcs_overallp set to TRUE and hide_rcs_coef set to TRUE
summary_spline_hide <- modelsummary_rms(fit_spline_hide, 
                                        hide_rcs_coef = FALSE)
# Outputting this as a table
knitr::kable(summary_spline_hide)| variable | coef_95CI | Pvalue | 
|---|---|---|
| Age | 0.455 (-0.232 to 1.142) | 0.194 | 
| Age’ | -14.850 (-38.835 to 9.136) | 0.225 | 
| Age’’ | 26.771 (-16.692 to 70.233) | 0.227 | 
| Exer=None | -0.152 (-0.841 to 0.537) | 0.665 | 
| Exer=Some | -0.032 (-0.461 to 0.396) | 0.882 | 
| Sex=Male | 2.094 (1.682 to 2.506) | <0.001 | 
| RCSoverallP: Age | RCS terms | 0.545 | 
If overall p-values for the variables modelled with RCS are not
wanted, rcs_overallp can be set to FALSE:
# Fit an OLS model including a restricted cubic spline for Age (with 4 knots)
fit_spline_hide <- ols(Wr.Hnd ~ rcs(Age, 4) + Exer + Sex, data = survey)
# Generate a model summary with rcs_overallp set to TRUE and hide_rcs_coef set to TRUE
summary_spline_hide <- modelsummary_rms(fit_spline_hide, 
                                        rcs_overallp = FALSE, 
                                        hide_rcs_coef = FALSE)
knitr::kable(summary_spline_hide)| variable | coef_95CI | Pvalue | 
|---|---|---|
| Age | 0.455 (-0.232 to 1.142) | 0.194 | 
| Age’ | -14.850 (-38.835 to 9.136) | 0.225 | 
| Age’’ | 26.771 (-16.692 to 70.233) | 0.227 | 
| Exer=None | -0.152 (-0.841 to 0.537) | 0.665 | 
| Exer=Some | -0.032 (-0.461 to 0.396) | 0.882 | 
| Sex=Male | 2.094 (1.682 to 2.506) | <0.001 | 
In medical research, interactions can be critical, as the impact of a treatment or risk factor might differ across subgroups (for example, by age or sex). These interaction terms are handled by modelsummary_rms.
Here is a simple example:
# Using the built-in dataset from the MASS package
data("survey", package = "MASS")
# Fit an OLS model using the rms package with an interaction between Age and Exer.
fit_interact <- ols(Wr.Hnd ~ Age * Exer + Sex, data = survey)
# Generate a model summary that includes the interaction term
summary_interact <- modelsummary_rms(fit_interact)
knitr::kable(summary_interact)| variable | coef_95CI | Pvalue | 
|---|---|---|
| Age | 0.025 (-0.021 to 0.071) | 0.291 | 
| Exer=None | -0.078 (-2.341 to 2.185) | 0.946 | 
| Exer=Some | 0.598 (-0.795 to 1.992) | 0.400 | 
| Sex=Male | 2.142 (1.737 to 2.546) | <0.001 | 
| Age * Exer=None | -0.002 (-0.104 to 0.099) | 0.964 | 
| Age * Exer=Some | -0.031 (-0.097 to 0.035) | 0.360 | 
The rms package allows interactions with variables
modelled using restricted cubic splines. In this setting, the individual
coefficients for RCS terms and their interactions are difficult to
interpret. modelsummary_rms handles this situation by
providing overall p-values for RCS variables (which give the overall
p-value taking into account all spline terms and all of their
interaction terms), and overall p-values for the interactions (takes
into account linear and non-linear terms), instead of the individual
coefficients. As above, this can be altered by changing
rcs_overallp and hide_rcs_coef.
# Using the built-in dataset from the MASS package
data("survey", package = "MASS")
# Fit an OLS model with a restricted cubic spline for Age and an interaction between Age and Exer.
fit_spline_interact <- ols(Wr.Hnd ~ rcs(Age, 4) * Exer + Sex, data = survey)
# Generate a model summary with default RCS output
summary_spline_interact <- modelsummary_rms(fit_spline_interact)
# Format the output as a nice table
knitr::kable(summary_spline_interact)| variable | coef_95CI | Pvalue | 
|---|---|---|
| Exer=None | -3.885 (-50.286 to 42.515) | 0.870 | 
| Exer=Some | 23.159 (-1.786 to 48.104) | 0.069 | 
| Sex=Male | 2.120 (1.705 to 2.536) | <0.001 | 
| RCSoverallP: Age | RCS terms | 0.352 | 
| RCSoverallP: Age * Exer | RCS terms | 0.251 | 
The output of modelsummary_rms is a dataframe, as this is easy to work with and further process if required. This dataframe output can easily be exported to a word document using flextable and officer packages.
library(officer)
library(flextable)
library(dplyr)
# converting modelsummary_rms dataframe generated above into a flextable
rmsMD_as_table <- flextable(rmsMD_summary)
# use officer to create a 
doc <- read_docx() %>% 
  body_add_flextable(rmsMD_as_table) %>%
  body_add_par("Model summary from rmsMD", style = "heading 2")
# generating a temporary output path for demonstration. This would be replaced by the file path where the word document will be generated
output_path <- file.path(tempdir(), "example_output.docx")
# generating the word document
print(doc, target = output_path)
print(doc, target = "temp.docx")