Thyroid function tests – time for a reassessment*
D StJ O'Reilly
HK Pract 2001;23:212-216
Introduction
In 1999, 890000 measurements of thyroid stimulating hormone were performed by Scottish
hospital laboratories –– approximately one test for every six of Scotland's 5.1
million people.1 This number does not include tests performed in the
non-NHS laboratories or as part of the screening programme for congenital hypothyroidism.
Although laboratory statistics are not collected nationally in England and Wales,
the market in the United Kingdom (population 59 million) for thyroid stimulating
hormone diagnostic tests is currently estimated at 9-10 million each year.
A remarkable downgrading of the clinical aspects of hypothyroidism and hyperthyroidism
has paralleled the inexorable increase in the number of thyroid function tests performed
over the past 20 years. This has led to chaos in the diagnosis of hypothyroidism.
It has been stated that a diagnosis of clinical hypothyroidism can be made on the
basis of biochemical measurements alone and that signs and symptoms are unnecessary.2
Other authors protest, and maintain that biochemical tests can be misleading and
that the diagnosis can be made on clinical grounds alone.3 In hyperthyroidism,
a suppressed thyroid stimulating hormone concentration is currently the cornerstone
of biochemical diagnosis. No numerical value has been assigned to the serum concentration
of thyroid stimulating hormone below which suppression is considered to occur. This
value varies from centre to centre depending on the sensitivity of the local assay.
Thus, to many nonspecialists the diagnosis of hyperthyroidism is also confusing.
Methods
This review is based on my 20 years' postgraduate experience in providing biochemical
thyroid function tests and treating patients with thyroid disorders. I have selected
and highlighted some of the publications that have influenced my practice and call
into question the increasing reliance on biochemical thyroid function tests in making
a diagnosis.
Historical setting
The treatments currently used for hyperthyroidism and hypothyroidism were established
by the beginning of the 1970s. Though the symptoms and signs of these disorders
had been analysed and clinical scoring indices had been developed and validated
in the 1960s, clinical diagnosis remained problematic.4-8 The clinical
diagnostic schemes for hypothyroidism were similar,4-6 but there were
considerable differences between diagnostic schemes for hyperthyroidism. For example,
atrial fibrillation was considered by Wayne and Crooks to be one of the most powerful
discriminating signs,6,7 but it was not included by Gurney et al.8
Age, on the other hand, was a major diagnostic factor according to Gurney et al,8
but was not mentioned by Wayne or Crooks.6,7 From knowledge of the pathophysiology
of the hypothalamic-pituitary-thyroid axis available at that time, it was believed
that measuring the concentration of serum thyroid stimulating hormone would simplify
the diagnosis.
Hypothyroidism
The publication of a reliable and practical assay for thyroid stimulating hormone
was a landmark.9 A normal range of < 0.5-4.2 mU/l was established, based
on measurements from 29 control subjects. One of the first applications of the assay
was in patients who had undergone subtotal thyroidectomy for Graves' disease.10
In 28 "unequivocally euthyroid" patients followed for three to 21 years, the mean
concentration was 8.2 mU/l (range 1.3-34.0 mU/l). In four patients followed up for
four to 12 years and in whom a therapeutic trial of thyroxine had shown no benefit,
the thyroid stimulating hormone concentration range was 10.5-21.5 mU/l. These patients
were considered to be unequivocally euthyroid by a group who had validated clinical
indices for the diagnosis of hypoparathyroidism and hyperthyroidism.5,7
They were used to show the superiority of thyroid stimulating hormone measurements
in detecting hypothyroidism, and no suggestion was made that the normal range could
be widened.
In 1973, the data on which the concept of subclinical hypothyroidism was based were
published.11 The reference range for thyroid stimulating hormone, established
from measurement in 29 subjects,10 was used to classify 22 euthyroid
subjects as having subclinical hypothyroidism. In six of the 22 subjects given a
therapeutic trial of thyroxine, treatment showed no benefit, and 10 had originally
been recruited as normal controls.
Whickham survey
The Whickham survey was a further landmark.12 All Whickham residents
with a serum thyroid hormone concentration > 6 mU/l were diagnosed as being hypothyroid,
irrespective of their clinical status. This reinforced the view that the serum thyroid
stimulating hormone concentration defined hypothyroidism.
The 20 year follow up study of the Whickham survey has yielded invaluable data on
the natural history of thyroid disorders.13 A main conclusion of the
study, disseminated to most non-specialists in a review published in the BMJ, was
that "thyroid stimulating hormone concentrations above 2 mU/l are associated with
an increased risk of hypothyroidism". 2 Half of the population (male
and female) fall into this category.12 This conclusion was based on the
change in the slope of the line obtained when the log of the serum thyroid stimulating
hormone concentration was related to the logit probability of developing hypothyroidism
over a 20 year period in women (see box).13 The probability of a 40- year-old woman
with a thyroid stimulating hormone of 2.1 mU/l developing hypothyroidism is low
–– at 1 in 50 over 20 years. In men, the probability is so low that an equivalent
equation could not be derived.13
Clinical features ignored
The review also highlighted the fact that in making a diagnosis of clinical or overt
hypothyroidism "symptoms are not considered a criterion by some authorities".2 The
review claimed great authority. It was pointed out that some of the data on which
it was based had been collected for the consensus statement for good practice and
audit measures in the management of hypothyroidism and hyperthyroidism published
on behalf of the Royal College of Physicians of London and the Society for Endocrinology.14
This publication makes no reference to the clinical manifestations or clinical diagnosis
of hypothyroidism. Thus, the clinical features of hypothyroidism seem to have been
relegated to the status of historical curiosities.
Hyperthyroidism
Assays capable of defining the lower end of the statistically derived reference
range became available in the early 1980s. One evaluation of such an assay reported
that all of 110 hyperthyroid patients studied had a thyroid stimulating hormone
concentration < 0.07 mU/l, and all 62 euthyroid control subjects had concentrations
> 0.07 mU/l.15 However, some clinically euthyroid subjects with abnormally low thyroid
stimulating hormone concentrations were classified as having subclinical hyperthyroidism.15
Assays can now detect thyroid stimulating hormone in serum at concentrations of
0.005m U/l.16 At this low concentration, hyperthyroid patients were not distinguished
from some euthyroid, though ill, patients.16 The range of thyroid stimulating hormone
concentrations in patients whose condition stabilised on thyroxine replacement treatment
was < 0.005 to > 10.00 mU/l.16 It is therefore clear that measurement of the thyroid
stimulating hormone concentration has failed to deliver what was expected of it.
Clinical aspects
During this period the clincial aspects of hyperthyroidism have also been downgraded.
Most current undergraduate textbooks treat the clinical diagnosis of thyroid dysfunction
by referring the student to lists. In the current edition of the Oxford Textbook
of Medicine, this matter is dismissed in less than a line, and the reader is referred
to unweighted lists of the sympotoms and signs.17 In the popular postgraduate textbook
of Clinical Endocrinology, the biochemical diagnosis and assessment of hyperthyroidism
are given before the clinical features.18 Medical journals are now effectively devoid
of references to the clinical features of hyperthyroidism. Though a symptom rating
scale for the diagnosis of hyperthyroidism was described in 1988,19 the clinical
scoring systems for assessing hypothyroidism and hyperthyroidism are now rarely
cited (Table 1).
Non-thyroidal illness syndrome
We have recently become aware of the complexity of the effects of non-thyroidal
illness on the hypothalamicpituitary- thyroid axis and thyroid hormone metabolism.
Figures like the one shown (taken from a recent review20) are frequently used to
illustrate the nature of the changes that occur in serum thyroid hormone concentrations
in the non-thyroidal illness syndrome. These figures have never been published with
a numerical scale or error bars. The problem of interpreting free thyroxine was
summarised by the author: "It is common to find that a sample obtained from a patient
with non-thyroidal illness syndrome may have a raised free thyroxine by one method
but a normal or low free thyroxine by another."20 The equilibrium dialysis reference
method used to profile free thyroxine in the figure is technically demanding and
currently not established in the United Kingdom. As the original legend to the figure
explains:
The profile for free thyroxine is that obtained using equilibrium dialysis and low
sample dilution. The level of free thyroxine found using commercial methods will
be heavily method dependent. A profile of free triiodothyronine is not included
as some ultrafiltration methods suggest that normal or raised free triiodothyronine
may be found in illness whilst equilibrium dialysis methods usually show diminished
or normal concentrations.20
What free thyroxine and free triiodothyronine assays actually measure is controversial.21
However, what is clear is that we cannot interpret thyroid function tests in systemically
ill patients.
Current status of thyroid function tests
Our understanding of the complexity of the cerebralhypothalamic- pituitary-thyroid
axis and the mechanism of thyroid hormone action has grown enormously. Current knowledge
indicates that the cardiac effects of thyroid hormones, which are clinically very
important, are mediated via the α1 thyroid hormone receptor independent of the β
receptors, which are the dominant regulators of thyroid stimulating hormone secretion.22
False positive and negative results
Overlap between the statistically derived normal and abnormal ranges is accepted
in diagnostic tests, giving rise to false positive and false negative results. These
concepts have not been applied to measurements of thyroid stimulating hormone. Rather
than accepting that the test can be fallible, we transfer the problem to the patient.
In patients with systemic disease, the nonthyroidal illness syndrome is invoked
to explain the anomalous results, and healthy subjects are diagnosed as having subclinical
hypothyroidism or hyperthyroidism.11,15 The distribution of the serum thyroid stimulating
hormone concentration in the population is logarithmic.13 Thus, minor deviations
from the statistically derived reference range are unlikely to be clinically meaningful.11
Conclusion
Studies in 1580 inpatients23 and in 630 patients admitted as medical emergencies24
found that thyroid function tests performed as screening tests yielded abnormal
results in 33% and 20% of patients respectively. In both studies, the biochemical
tests suggested thyroid disease incorrectly (that is, they gave false positive results)
in nine cases out of 10. Thus, indiscriminate use of thyroid function tests is more
likely to confuse than to help.
We do not know how important the thyroid function tests are for making a diagnosis
of thyroid dysfunction. It is a matter of personal judgement. Experience has shown
that thyroid function tests, like all the signs and symptoms associated with hypothyroidism
and hyperthyroidism, are not totally reliable. As it becomes clear that biochemical
assessments cannot deliver the diagnostic accuracy expected of them, the fact that
the clinical aspects of assessing thyroid dysfunction are being sidelined is a cause
for concern. Doing more biochemical tests will lead to further confusion, not the
hoped for clarity. The information obtained from thyroid function tests, despite
its quantitative numerical appearances, is "soft". How soft has yet to be established.
I thank Dr David Lyon for mathematical help, Dr Ann Wales for obtaining the citation
data given in the table, and Drs G H Beastall and H G Gray for constructive comments
and discussion.
Competing interests: None declared.
Key messages
- There are no data on the relative importance of biochemical thyroid function tests
and clinical symptoms and signs in assessing thyroid dysfunction.
- Secretion of thyroid stimulating hormone is influenced by many factors other than
the negative feedback inhibition by thyroxine or triiodothyronine.
- Changes in thyroid stimulating hormone, thyroxine, and triiodothyronine concentrations
during systemic illness are poorly understood.
- Thyroid function tests cannot be interpreted in patients with systemic illness.
- Since thyroid stimulating hormone concentrations are distributed logarithmically
in the population, minor changes are unlikely to be clinically important.
- The possibility of false positive and false negative results should be considered
in interpreting thyroid stimulating hormone concentrations.
D StJ O’Reilly,
Consultant Clinical Biochemist,
Department of Clinical Biochemistry and Clinic for Thyroid Diseases, Royal Infirmary.
Correspondence to : Dr D StJ O’Reilly, Department of Clinical Biochemistry
and Clinic for Thyroid Diseases, Royal Infirmary, Glasgow G4 OSF, U.K.
References
- The NHS in Scotland. Laboratory statistics 1999. Edinburgh: Information and Statistics
Division, The NHS in Scotland, 1999.
- Weetman AP. Hypothyroidism: screening and subclinical disease. BMJ 1997;314:1175-1178.
- Skinner GRB, Thomas R, Taylor M, et al. Thyroxine should be tried in clinically
hyothyroid but biochemically euthyroid patients. BMJ 1997;314: 1764.
- Murray IPC. The clinical diagnosis of thyroid disease. Med J Aust 1964; 1:827-831.
- Billewicz WZ, Chapman RS, Crooks J, et al. Statistical methods applied to the diagnosis
of hypothyroidism. Q J Med 1969;38:255-266.
- Wayne EJ. Clinical and metabolic studies in thyroid disease. BMJ 1960;i: 78-90.
- Crooks J, Murray IPC, Wayne EJ. Statistical methods applied to the clinical diagnosis
of thyrotoxicosis. Q J Med 1959;28:211-234.
- Gurney C, Owen SG, Hall R, et al. Newcastle thyrotoxicosis index. Lancet 1970;ii:1275-1278.
- Hall R, Amos J, Ormston BJ. Radioimmunoassay of human serum thyrotrophin. BMJ 1971;i:582-585.
- Hedley AJ, Hall R, Amos J, et al. Serum thyrotropin levels after subtotal thyroidectomy
for Graves disease. Lancet 1971;i:455-458.
- Evered DC, Ormston BJ, Smith PA, et al. Grades of hypothyroidism. BMJ 1973;i:657-662.
- Tunbridge WMG, Evered DC, Hall R, et al. The spectrum of thyroid disease in a community:
the Whickham survey. Clin Endocrinol 1977;7:481-493.
- Vanderpump MPJ, Tunbridge WMG, French JM, et al. The incidence of thyroid disorders
in the community: a twenty-year follow-up of the Whickham survey. Clin Endocrinol
1995;43:55-68.
- Vanderpump MPJ, Ahlquist JAO, Franklyn JA, et al. Consensus statement for good practice
and audit measures in the management of hypothyroidism and hyperthyroidism. BMJ
1996;313:539-544.
- Seth J, Kellett HA, Caldwell G, et al. A sensitive immunoradiometric assay for serum
thyroid stimulating hormone: a replacement for the thyrotropin releasing hormone
test? BMJ 1984;289:1334-1336.
- Wilkinson E, Rae PWH, Thomson KJT, et al. Chemiluminescent third generation assay
(Amerlite TSH30) of thyroid stimulating hormone in serum or plasma assessed. Clin
Chem 1993;39:2166-2173.
- McGregor AM. The thyroid gland and disorders of thyroid function. In: Weatherall
DJ, Ledingham JGG, Warrell DA, eds. Oxford textbook of medicine. 3rd ed. Oxford:
Oxford University Press, 1996;1603-1621.
- Hall R. Hyperthyroidism and Graves disease. In: Besser GM, Thorner MO, eds. Clinical
endocrinology. 2nd ed. London: Mosby Wolfe, 1994;17:1- 24.
- Klein I, Trzepacz PT, Roberts M, et al. Symptom rating scale for assessing hyperthyroidism.
Arch Intern Med 1988;148:387-390.
- Beckett GJ, Wilkinson E. Thyroid hormone metabolism and thyroid function tests in
non-thyroidal illness. CPD Bull Clin Biochem 1998;1:9-14.
- Ekins R. Measurement of free hormones in blood. Endocrin Rev 1990;11: 5-46.
- Weiss RE, Murata Y, Cua K, et al. Thyroid hormone action on liver, heart, and energy
expenditure in thyroid hormone receptor β-deficient mice. Endocrinology 1998;139:4945-4952.
- Spencer C, Eigen A, Shen D, et al. Specificity of sensitive assays of thyrotropin
(TSH) used to screen for thyroid disease in hospitalised patients. Clin Chem 1987;33:1391-1396.
- Small M, Buchanan L, Evans R. Value of screening thyroid functions in acute medical
admissions to hospital. Clin Endocrinol 1990;32:185-191.
|