KScens.Rd
Function KScens
computes the Kolmogorov-Smirnov statistic and p-value for
right-censored data against eight possible predefined or user-specified distributions
using either bootstrapping or a modified test. This function also accounts for
complete data.
# Default S3 method
KScens(times, cens = rep(1, length(times)),
distr = c("exponential", "gumbel", "weibull", "normal",
"lognormal", "logistic", "loglogistic", "beta"),
betaLimits = c(0, 1), igumb = c(10, 10), BS = 999,
params0 = list(shape = NULL, shape2 = NULL, location = NULL,
scale = NULL, theta = NULL),
tol = 1e-04, boot = TRUE, start = NULL, ...)
# S3 method for class 'formula'
KScens(formula, data, ...)
Numeric vector of times until the event of interest.
Status indicator (1, exact time; 0, right-censored time). If not provided, all times are assumed to be exact.
A string specifying the name of the distribution to be studied.
The possible distributions are the exponential ("exponential"
),
the Weibull ("weibull"
), the Gumbel ("gumbel"
),
the normal ("normal"
), the lognormal ("lognormal"
),
the logistic ("logistic"
), the loglogistic ("loglogistic"
),
and the beta ("beta"
) distribution. In addition, if the
character string used is "name"
, every distribution for which the
corresponiding density (dname
), probability (pname
)
and random generator (rname
) functions are defined, can be used.
Two-components vector with the lower and upper bounds of the Beta distribution. This argument is only required, if the beta distribution is considered.
Two-components vector with the initial values for the estimation of the Gumbel distribution parameters.
Number of bootstrap samples.
List specifying the parameters of the theoretical distribution.
By default, parameters are set to NULL
and estimated with
the maximum likelihood method. This argument is only considered,
if all parameters of the studied distribution are specified.
Precision of survival times.
A formula with a numeric vector as response (which assumes no censoring) or Surv
object.
Data frame for variables in formula
.
Logical to indicate if the p-value is computed using bootstrapping or using the
the modified Kolmogorov-Smirnov test (see details). Default is TRUE
.
A named list giving the initial values of parameters of the named distribution or a function of data computing initial values and returning a named list. This argument may be omitted (default) for the eight prespecified distributions. See more details in mledist.
Additional arguments for the boot
function of the
boot package.
By default the p-value is computed via bootstrapping methods.
The parameter estimation is acomplished with the fitdistcens
function of the fitdistrplus package.
To avoid long computation times due to bootstrapping, an alternative
with complete data is the function ks.test
of the stats package.
The precision of the survival times is important mainly in the data generation step of the bootstrap samples.
If boot = FALSE
a modified test is used to compute the p-value.
Fleming et al. (1980) proposed a modified Kolmogorov-Smirnov test to use
with right-censored data. This function reproduces this test for a
given survival data and a theorical distribution. The approximation for
the p-value is acceptable when it is smaller than 0.8 and excellent when
it is smaller than 0.2. The output of the function follows the notation
of Fleming et al. (1980).
In presence of ties, different authors provide slightly different definitions of \(\widehat{F}_n(t)\), with which other values of the test statistic might be obtained.
KScens
returns an object of class "KScens"
.
An object of class "KScens"
is a list containing the following components:
Distribution
Null distribution.
Hypothesis
Parameters under the null hypothesis (if params0
is provided).
Test
Vector containing the value of the modified Kolmogorov-Smirnov statistic (A
), the
estimated p-value (p-value
), the estimation of the image of the last recorded time (F(y_m)
) and
the last recorded time (y_m
).
Estimates
Vector with the maximum likelihood estimates of the parameters of the distribution under study.
StdErrors
Vector containing the estimated standard errors.
aic
The Akaike information criterion.
bic
The so-called BIC or SBC (Schwarz Bayesian criterion).
BS
The number of bootstrap samples used. If the modified test is used, a 0 is returned.
T. R. Fleming et al. Modified Kolmogorov-Smirnov test procedure with application to arbitrarily right-censored data. In: Biometrics 36 (1980), 607-625. URL: https://doi.org/10.2307/2556114
# Complete data with bootstrapping
set.seed(123)
KScens(times = rweibull(100, 12, scale = 4), distr = "weibull", BS = 99)
#> Null hypothesis: the data follows a weibull distribution
#>
#> KS Test results:
#> A p-value
#> 0.528 0.670
#>
# Censored data with bootstrapping
KScens(Surv(time, status) ~ 1, colon, distr = "norm", BS = 99)
#> Null hypothesis: the data follows a norm distribution
#>
#> KS Test results:
#> A p-value
#> 8.707 0.010
#>
# Censored data using the modified test
KScens(Surv(time, status) ~ 1, colon, distr = "norm", boot = FALSE)
#> Null hypothesis: the data follows a norm distribution
#>
#> KS Test results:
#> A p-value
#> 8.707 0.000
#>
data(nba)
summary(KScens(Surv(survtime, cens) ~ 1, nba, "logis", boot = FALSE), degs = 2)
#> Distribution: logis
#>
#> KS Test results:
#> A p-value
#> 2.099 0.000
#>
#> Parameter estimates (se):
#> theta1 theta2
#> 53.077 (0.457) 9.013 (0.218)
#>
#> AIC: 8460.297
#> BIC: 8472.866
#>
KScens(Surv(survtime, cens) ~ 1, nba, "beta", betaLimits = c(0, 80),
boot = FALSE)
#> Null hypothesis: the data follows a beta distribution
#>
#> KS Test results:
#> A p-value
#> 3.739 0.000
#>