1.
Introduction
The widespread distribution of
Hospital Information Systems (HIS) in healthcare institutions requires
professional evaluation to assess the practical usefulness of these
applications. So far, evaluations of HIS have been undertaken
focussing mainly on financial aspects or considering the patients
interests. A major aspect has been neglected: The user! Nurses, physicians and other healthcare
employees, working
with the software, spend a lot of time each day by
filling in forms, reviewing medical inspection
results and handling an amount of information for
administration needs.
The
usability of a product is considered as a precondition of the
usefulness of an application (Nielsen, 1993). It is defined with
respect to “ the extent to which the product can be used by specific
users to achieve specific goals with effectiveness, efficiency, and
satisfaction in a specific context of use.” (ISO 9241 Part 11, 1998).
Unfortunately today not many applications fulfill this demand, and
thus cause errors, trouble and stress as well as high costs on the
part of the users and the organisation (Landauer, 1995).
Usability
evaluation aims at identifying strengths and weaknesses of an
application and gives hints for improving its usability. There is a
multitude of methods for the purpose of software evaluation (Gediga,
Hamborg & Düntsch, 2002). Questionnaires are well suited for the
summative evaluation of software applications, especially in larger
organisations like hospitals, public administrations etc. They are
economic evaluation techniques which can be applied to a larger number
of users at the same time with comparatively small financial effort.
In this paper the IsoMetrics
Inventory (Gediga, Hamborg & Düntsch, 1999) for summative and
formative evaluation of software usability will be presented. Its
application in an evaluation study concerned with the usability of a
HIS is demonstrated. In this study, we established an online version
of the questionnaire, aiming at reducing efforts and at speeding up
recurrent surveys and consecutive data evaluation. The equivalence of
the paper-and-pencil and the online format is examined as well as the
reliability of the questionnaire in the application area HIS.
The IsoMetrics questionnaire
will be presented in the context of an evaluation study which was
conducted at the University Hospital of Heidelberg, Department of
Internal Medicine.
2.1
Material and
methods
2.1.1
The
IsoMetrics questionnaire
The IsoMetrics usability
inventory (Gediga, Hamborg & Düntsch, 1999) provides a user-oriented,
summative as well as formative approach to software evaluation on the
basis of ISO 9241 Part 10. While summative evaluation is typically
quantitative and located at the end of a development process, using
numeric scores to assess the usability of an application, formative
evaluation provides (often qualitative) information about weaknesses
useful in improving the usability of a software system during the
engineering life cycle or before further development. Accordingly
there are two versions of IsoMetrics, both based on the same items:
IsoMetricsS (short) supports summative evaluation, whereas
IsoMetricsL (long) is best suited for formative evaluation
purposes. The inventory is available as English and German language
version and can be administered by either paper and pencil or an
online (inter-/intranet) version. The current version of IsoMetrics
(2.04 german/2.01 english) comprises 75 items operationalising the
seven design principles of the international standard ISO 9241 Part
10. ISO 9241 formulates „Ergonomic requirements for office work with
visual display terminals (VDTs)” and provides guidance for the
ergonomic design of interactive software. It comprises 17 different
parts, whereas Part 10 covers seven principles for dialog design (s.
Table 1).
Table 1: Dialogue
Principles according to ISO 9241 Part 10 (translated from the german
version by the authors).
|
Suitability for the
task |
A dialogue is
suitable, if it supports the user to realise his tasks
effectively and efficiently. Only those parts of the software
are presented, which are necessary to fulfil the task. |
|
Self-descriptiveness |
A dialogue is
self-descriptive, if every step is understandable in an
intuitive way, or, in case of mistakes supported by immediate
feedback. Further, an adequate support should be offered on
demand. |
|
Controllability |
A dialogue is
controllable, if the user is able to start the sequence and
influence its direction as well as speed till he reached his
aim. |
|
Conformity with user
expectations |
A dialogue is
conform with the user expectations, if it is consistent,
complying with the characteristics of the user, e.g. taking into
account the knowledge of the user in that special working area,
accounting education and experience as well as general
acknowledged conventions. |
|
Error tolerance |
A dialogue is error
tolerant, if the intended deliverable is reached with no or just
minimal additional effort despite of obvious faulty steering or
wrong input. |
|
Suitability for
individualisation |
A dialogue is
suitable for individualisation, if the system allows customising
according to the task as well as regarding the individual
capabilities and preferences of a user. |
|
Suitability for
learning |
A dialogue supports
the suitability of learning, if the user is accompanied through
different states of his learning process and the effort for
learning is as low as possible. |
The statement of each item of
the IsoMetricsS Questionnaire is assessed on a five point
rating scale starting from 1 ("predominantly disagree") to 5
("predominantly agree"). A further category ("no opinion") is offered
to reduce arbitrary answers.
IsoMetrics consists of the same
items as IsoMetricsS and uses the same rating procedure.
Additionally, each user is asked to give a second rating, based upon
the request “Please rate the importance of the above item in terms of
supporting your general impression of the software.” This rating
ranges from 1 (“unimportant”) to 5 (“important”). A further “no
opinion” category may also be selected. In this way, each item is
supplied with a weighting index. To evoke information about
malfunctions and weak points of the system under study, the question
“Can you give a concrete example where you can (not) agree with the
above statement?” is posed. This gives users the opportunity to report
problems with the software, which they attribute to the actual
usability item.
IsoMetrics has proved its
practicability in software development projects and field studies.
Given ten evaluating users, IsoMetricsL evokes
approximately one hundred remarks addressing weak-points of a given
software. Its reliability was examined and confirmed for each of the
seven design principles (Gediga, Hamborg & Düntsch, 1999, Gruber,
2000). In order to validate the IsoMetrics inventory, the scale means
of five different software systems were analysed and compared. It
could be shown that programs with different ergonomic qualities were
discriminated by the corresponding scales (Gediga, Hamborg & Düntsch,
1999).
The software examined,
“IS-H*MED” release 4.63B by T-Systems, Austria is based on the IS-H
solution by SAP, Germany. It is mainly table-oriented software with a
broad range of functions:
§
Creation of discharge letter:
A discharge letter is most often dictated on tape by a physician and
afterwards typed by a secretary. Proof-reading and corrections are
realised online, using a MS Word plug-in.
§
View of laboratory and diagnostic
findings: For each
patient, an overview of existing laboratory and diagnostic findings is
available. A list of the findings permits the physician a detailed
look.
§
Documentation of diagnostic finding:
In-patients can be selected by a physician from a listing of the
patients to feed in diagnostic findings. The ICD10-Code of the
diagnoses might be entered directly – or with the help of a plug-in
called KODIP. This plug-in covers the complete ICD-10 via a thesaurus
and offers additional information about the grouping accounting rules
etc.
§
Diagnose related grouping:
After the individual diagnostic findings and resulting medical
procedures (e.g. a heart catheter) are entered into the computer, a
calculation of the Diagnose Related Group’s (DRG) may be accomplished.
§
Order of medical examinations
supports the electronical ordering of medical examinations for a
patient.
§
Documentation of physical
examinations: This
function allows to document the results of an inspection, e.g. an
ultrasound sonic examination, or a radiology report. The reports are
mainly written with help of a MS Word plug-in.
§
Nursing category:
A staffing calculation methodology derived from the traditional
nursing hour per patient day (HPPD), taking into account a systematic
approach estimating effort for a patient with a specified disease.
§
Meal order:
The meal order starts with a listing of the in-patients on a ward.
Detailed orders according to the needs of the patients can be entered.
2.2
Preliminary
enquiry
Before the evaluation study
started, a preliminary enquiry was conducted to collect personal data
of the potential participants (nurses, doctors, secretaries and other
staff of the department). For that purpose a questionnaire was applied
addressing computer-experience in general as well as experience with
IS-H*MED, area of work, used IS-H*med functions, age and sex. 182
persons completed the questionnaire and were willing to take part in
the subsequent evaluation study. Results of the survey were treated
confidentially.
By means of a cluster analysis
six “user-types” according to the used IS-H*med functions were
discriminated (see table 2). Moreover three user categories were
distinguished due to the general as well as the IS-H*med specific
experience: Novices, intermediate and expert users.
2.3.1
Participants
and procedure
The evaluation study was
conducted in January and February 2003. Participation was voluntary,
no financial incentives were offered. We received 132 responses
(online as well as paper-and-pencil Questionnaires) from the 182
participants who took part in the preliminary study and from
additional spontaneous participants.
After the
exclusion of questionnaires with too much missing data (s. chapter
2.3.2 Data analysis) 106 responses remained. Mean age of these
participants was 38 years (SD = 8,81; range: 24-61 years). 55 persons
(51,9 %) were female, 36 (34 %) male; 15 participants (14,1 %) did not
answer the question about their gender. According to
computer-experience, the sample included 22 novice, 27 intermediate
and 30 expert users. 27 persons did not give information about their
general computer experience or their experience with the IS-H*MED
system.
2.3.2
Data
analysis
Questionnaires with more than
20% missing data (more than 15 items not answered) were excluded from
further analysis (s. Gediga, Hamborg & Willumeit, 1998). In case of
less or equal than 15 omissions, missing values were replaced by the
mean scale value (‘3’ ) of the items. The same procedure was applied
if the answer was 'no opinion'. This procedure was controlled by
comparing reliabilities based on the records without missing data with
the reliabilities based on the records with replaced missing data. The
procedure showed no differences of the reliabilities. Some items of
the questionnaire (A1, A8, T12, E8, F1, F7, F14, L1, and L7) are
formulated negative. The values of these items were inverted by the
transformation ri’ = 6 - ri for further
analysis.
To analyse the equivalence of
the paper-and-pencil and the online version of the IsoMetrics
questionnaire two matched groups (à N = 29) from a sub sample of all
participants were established with regard to the user-type, computer
experience, age and sex. The equivalence of both formats was assessed
with respect to the scale mean values and reliabilities.(For a
detailed description of this analysis, see Hamborg, Vehse, Ollermann &
Bludau, 2004).
The reliability of the scales
was computed in a next step. After that, the mean values for both
questionnaire formats were calculated to assess the ergonomic quality
of the application. Moreover the IS-H*med profile was compared with
the profiles of two reference systems.
For the ergonomic quality of
software systems should be assessed with respect to the context of use
(user, task, equipment and environment, see ISO 9241 Part 11, 1998) an
analysis of variance with user-type and computer experience (user
group, experience, etc.) as independent and the 7 IsoMetrics-Scales as
dependent variable was calculated. To identify special differences
between the identified user-types, post-hoc tests have been
calculated. To get more detailed information about ergonomic
shortcomings of the software, ratings of the single IsoMetrics items
were analysed at least.
2.4.1
Equivalence
of the online and paper-based questionnaire
Analysis of the scale means
revealed no marked differences between the two matched samples using
the online respectively the paper-and-pencil version of IsoMetricsS
(table 3).
Table 3:
Means of the online- and paper-pencil version
|
IsoMetrics Scale |
Format of the
Questionnaire |
N |
Mean |
SD |
T |
df |
sig.
(2-sides) |
|
Suitability for the
task (15) |
Online |
29 |
2.54 |
.793 |
-.771 |
56 |
.444 |
|
Paper |
|
29 |
2.70 |
.841 |
|
Self-descriptiveness (12) |
|
Online |
29 |
2.33 |
.764 |
-1.559 |
56 |
.125 |
|
Paper |
|
29 |
2.66 |
.835 |
|
Controllability (11) |
|
Online |
29 |
2.72 |
.795 |
-1.423 |
56 |
.160 |
|
Paper |
|
29 |
3.03 |
.864 |
|
Conformity with user expectations
(8) |
|
Online |
29 |
2.87 |
.634 |
-1,534 |
56 |
.131 |
|
Paper |
|
29 |
3.13 |
.692 |
|
Error tolerance (15) |
|
Online |
29 |
2.61 |
.586 |
-.683 |
56 |
.497 |
|
Paper |
|
29 |
2.72 |
.618 |
|
Suitability for individualisation
(6) |
|
online |
29 |
1.94 |
.763 |
-.245 |
48.39 |
.807 |
|
Paper |
|
29 |
2.00 |
1.161 |
|
Suitability for learning (8) |
|
online |
29 |
2.52 |
.708 |
-.784 |
56 |
.437 |
|
Paper |
|
29 |
2.70 |
1.058 |