analyses
The analyses
argument is the name of the file specifying the analyses to
perform. See section ANALYSES for details.
Simrisc was originally designed around 2010 by Marcel Greuter at the University Medical Center Groningen, and thereafter modified in 2015 by Chris de Jonge.
Short options are provided between parentheses, immediately following
their long option equivalents. Several parameters specify the path-names of
files produced by simrisc. If a path-name starts with a tilde character (~) then
the tilde is replaced by the user's home directory. An initial + is replaced
by the program's base directory (see option base
). When an analysis uses
multiple iterations then `$' characters in filename specifications are
replaced by the analysis' interation index.
All single-letter options referring to filesystem entries (directories, filenames) are capitalized, all other single-letter options are lowercase.
basedir
(-B)./
. If basedir
doesn't exist it is created by the program. If
the directory cannot be created and exception is thrown, terminating
the program. The basedir
specifications may specify relative or
absolute directory locations;
path
(-C)~/.config/simrisc'
is used.
path
(-D)!
(mnemonic: the logical not operator, i.e.,
--data !
). See section OUTPUT for a description of the
generated data;
age
(-a)tumor-age
, and is mutually
exclusive with the case
option;
nCases
(-l)nCases
cases have been analyzed and only
write the data for the final case to the data file. The rounds
and
sensitivity
files contain the summarized results of all nCases
analyzed cases;
label
specifications. See section ANALYSES for details;
path
(-P)
path
(-R)!
(i.e., --rounds !
). See section
OUTPUT for a description of the generated summary info;
path
(-S)!
(i.e.,
--sensitivity !
). See section OUTPUT for a description of the
produced sensitivity summary;
age
(-t)death-age
, and is mutually
exclusive with the case
option;
Unless the one-analysis
option is used the program's first and only
required argument is the name of a file providing the details of the analyses
to perform. Those files are called analysis files and must be standard ascii
text files. I.e., they only contain 7-bit ascii printable and white-space
characters. The identifiers used in analysis files and in configuration files
are interpreted case sensitively.
Parameter specifications starting with uppercase letters (like Analysis:
and Scenario:
) specify (sub)sections and contain no additional
specifications. Specifications starting with lowercase letters (like
ageGroup:
) are followed by actual parameter values. For a complete
overview refer to the simriscparams(7) man-page.
Analysis files may define multiple analyses. Each analysis specification must begin with a line containing
Analysis:At each
Analysis:
specification the program's initial configuration is
reset: the default option values are used unless redefined by explicitly
provided command-line options. Explicitly provided command-line options
cannot be altered by specifications in Analysis:
sections and remain
active while simrisc is running.
Following Analysis:
lines the characteristics of the analysis are
specified. These specifications may, in the following order:
Analysis:
lines). The text following label:
lines is
written at the top of the output files;
All specifications in Analysis:
sections are optional. An Analysis:
specification merely containing the line Analysis:
defines an analysis
using the explicitly specified command-line options or the default program
options and using the parameter specifications provided in the configuration
file.
Empty lines, trailing white-space, and all characters on lines starting at the
hash-mark (#
) are ignored and may be used anywhere in analysis files.
Lines not conforming to the above description result in error messages, causing simrisc to end.
When parameters of configuration file sections (cf. simriscparams(7)) are
omitted from Analysis:
sections then the parameters as specified in the
configuration file are used.
When program options are specified their long option names must be used. E.g.:
base: /tmp/ last-case: 20
Command-line options always overrule options specified in analysis files.
Multiple analysis sections should not use identically named output files, as the output files are (re)written for each separate analysis.
Analysis sections are commonly used to alter the default specifications of the configuration file. E.g., the default number of iterations equals 1. By specifying
iterations: 3the analysis performs 3 iterations.
Parameters are either read from the configuration file or they are redefined
in Analysis:
sections. E.g., in de provided configuration file screening
rounds use two-year intervals between the ages of 50 and 74. To use screening
rounds using 5-year intervals, between ages 50 and 65, then an
Analysis:
specification could be, e.g.,
Screening: round: 50 Mammo MRI round: 55 Mammo MRI round: 60 Mammo MRI round: 65 Mammo MRI
When the --one-analysis
option is used parameters may be altered by
providing comma-separated parameter specifications as program command-line
arguments. E.g., to perform one analysis, writing the data file to
/tmp/data
, simulating 1000 cases, and using 20 as seed for the random
number generator the command
simrisc -D /tmp/data -o Scenario:, cases: 1000, seed: 20can be used. Note that when using the
one-analysis
option parameter
section names must precede parameter specifications. E.g., since the
parameters cases
and seed
are defined in the `Scenario' section
(cf. simriscparams(7)) they must be preceded by the Scenario:
specification.
Subsequent Analysis:
specifications in analysis files use the program
options as specified on the command line (or, if not specified, the default
program options) and uses the configuration file's parameter
specifications. So when an Analysis:
specification modifies parameters,
then subsequent Analysis:
sections start from the unmodified option and
parameter specifications.
When Analysis:
sections specify base:
directory locations then file
specifications using an initial `+
' character to represent the base
directory (like --data +/data.txt
) use the base directory specified at
the Analysis:
's base:
specification.
The first lines of the generated files contain time stamps showing the date and time when the files were written. Here is an example of the time stamp, following the RFC 2822 format:
Wed, 27 Jan 2021 09:42:49 +0100If
label:
lines are used then the time stamp is followed by the label
specifications, which is then followed by an empty line. After this header the
file's specific data are shown.
The data in all files (except for the file listing the actually used
parameters (option --parameters
(P
))) are written using the standard
comma-separated format (cf. RFC 4180). The initial lines contain table
headings and column labels documenting the meanings of the various
columns. Likewise there is a final line ending the tables.
Data of simulated cases
For each simulated case the values of the following variables are written to file (one line of comma-separated values per simulated case):
case:
the (0-based) case-index;
cause of death:
either Natural
or Tumor
;
death age:
the case's age of death;
natural death age:
the case's natural age of death (if no tumor
occurs);
death status:
a numeric index specifying how and at what stage the
case died:tumor present:
Yes
if the simulation resulted in a tumor, No
if no tumor occurred;
tumor detected:
Yes
if the tumor was detected, No
if not;
interval tumor:
Yes
if the tumor was an interval tumor, No
if
not;
tumor diameter:
the tumor's diameter in mm when it was detected. 0.00
is shown if no tumor occurred. In the exceptional case where the
simulation produced a tumor whose diameter exceeded 1000 mm the value
1001 is shown.
tumor doubling days:
the time (in days) it takes for the tumor to
double its size;
tumor preclinical period:
the age at which the tumor is potentially
detectible by Mammographic screening;
tumor onset age:
the age at which the tumor first occurred;
tumor self-detect age:
the age at which the tumor was
self-detected. This age is the result of the simulation, and may
exceed the case's actual death age (if so, the case's data report that
no tumor is present);
tumor death age:
the age at which the tumor caused or would have
caused he case's death. The simulation process uses ages ranging from
0 through 100. If the age at which the tumor causes the case's death
exceeds 100, then 100.00 is reported;
costs:
the case's screening and (if appliccable) treatment costs;
self-detection indicator
: 1 if the tumor was self-detected, 0 if not
(also if there's no tumor);
detection round
: 0-based round index at which the tumor was detected
(or 1) if the tumor was self-detected, 0 if not (also if there's no
tumor).
~/.config/simrisc
: the default location of the program's
configuration file;
simrisc-VERSION/stdconfig/simrisc
, where VERSION
is
replaced by simrisc's actual release version;
.deb
files) the default configuration file is commonly available
as /usr/shared/doc/simrisc/simrisc.gz
simriscparams(7)