Thyroid function tests – time for a reassessment*
                            
                                D StJ O'Reilly 
                            
                                HK Pract 2001;23:212-216 
                            
                                Introduction 
                            
                                In 1999, 890000 measurements of thyroid stimulating hormone were performed by Scottish
                                hospital laboratories –– approximately one test for every six of Scotland's 5.1
                                million people.1 This number does not include tests performed in the
                                non-NHS laboratories or as part of the screening programme for congenital hypothyroidism.
                                Although laboratory statistics are not collected nationally in England and Wales,
                                the market in the United Kingdom (population 59 million) for thyroid stimulating
                                hormone diagnostic tests is currently estimated at 9-10 million each year. 
                            
                                A remarkable downgrading of the clinical aspects of hypothyroidism and hyperthyroidism
                                has paralleled the inexorable increase in the number of thyroid function tests performed
                                over the past 20 years. This has led to chaos in the diagnosis of hypothyroidism.
                                It has been stated that a diagnosis of clinical hypothyroidism can be made on the
                                basis of biochemical measurements alone and that signs and symptoms are unnecessary.2
                                Other authors protest, and maintain that biochemical tests can be misleading and
                                that the diagnosis can be made on clinical grounds alone.3 In hyperthyroidism,
                                a suppressed thyroid stimulating hormone concentration is currently the cornerstone
                                of biochemical diagnosis. No numerical value has been assigned to the serum concentration
                                of thyroid stimulating hormone below which suppression is considered to occur. This
                                value varies from centre to centre depending on the sensitivity of the local assay.
                                Thus, to many nonspecialists the diagnosis of hyperthyroidism is also confusing. 
                            
                                Methods 
                            
                                This review is based on my 20 years' postgraduate experience in providing biochemical
                                thyroid function tests and treating patients with thyroid disorders. I have selected
                                and highlighted some of the publications that have influenced my practice and call
                                into question the increasing reliance on biochemical thyroid function tests in making
                                a diagnosis. 
                            
                                Historical setting 
                            
                                The treatments currently used for hyperthyroidism and hypothyroidism were established
                                by the beginning of the 1970s. Though the symptoms and signs of these disorders
                                had been analysed and clinical scoring indices had been developed and validated
                                in the 1960s, clinical diagnosis remained problematic.4-8 The clinical
                                diagnostic schemes for hypothyroidism were similar,4-6 but there were
                                considerable differences between diagnostic schemes for hyperthyroidism. For example,
                                atrial fibrillation was considered by Wayne and Crooks to be one of the most powerful
                                discriminating signs,6,7 but it was not included by Gurney et al.8
                                Age, on the other hand, was a major diagnostic factor according to Gurney et al,8
                                but was not mentioned by Wayne or Crooks.6,7 From knowledge of the pathophysiology
                                of the hypothalamic-pituitary-thyroid axis available at that time, it was believed
                                that measuring the concentration of serum thyroid stimulating hormone would simplify
                                the diagnosis. 
                            
                                Hypothyroidism 
                            
                                The publication of a reliable and practical assay for thyroid stimulating hormone
                                was a landmark.9 A normal range of < 0.5-4.2 mU/l was established, based
                                on measurements from 29 control subjects. One of the first applications of the assay
                                was in patients who had undergone subtotal thyroidectomy for Graves' disease.10
                                In 28 "unequivocally euthyroid" patients followed for three to 21 years, the mean
                                concentration was 8.2 mU/l (range 1.3-34.0 mU/l). In four patients followed up for
                                four to 12 years and in whom a therapeutic trial of thyroxine had shown no benefit,
                                the thyroid stimulating hormone concentration range was 10.5-21.5 mU/l. These patients
                                were considered to be unequivocally euthyroid by a group who had validated clinical
                                indices for the diagnosis of hypoparathyroidism and hyperthyroidism.5,7
                                They were used to show the superiority of thyroid stimulating hormone measurements
                                in detecting hypothyroidism, and no suggestion was made that the normal range could
                                be widened. 
                            
                                In 1973, the data on which the concept of subclinical hypothyroidism was based were
                                published.11 The reference range for thyroid stimulating hormone, established
                                from measurement in 29 subjects,10 was used to classify 22 euthyroid
                                subjects as having subclinical hypothyroidism. In six of the 22 subjects given a
                                therapeutic trial of thyroxine, treatment showed no benefit, and 10 had originally
                                been recruited as normal controls. 
                            
                                Whickham survey 
                            
                                The Whickham survey was a further landmark.12 All Whickham residents
                                with a serum thyroid hormone concentration > 6 mU/l were diagnosed as being hypothyroid,
                                irrespective of their clinical status. This reinforced the view that the serum thyroid
                                stimulating hormone concentration defined hypothyroidism. 
                            
                                The 20 year follow up study of the Whickham survey has yielded invaluable data on
                                the natural history of thyroid disorders.13 A main conclusion of the
                                study, disseminated to most non-specialists in a review published in the BMJ, was
                                that "thyroid stimulating hormone concentrations above 2 mU/l are associated with
                                an increased risk of hypothyroidism". 2 Half of the population (male
                                and female) fall into this category.12 This conclusion was based on the
                                change in the slope of the line obtained when the log of the serum thyroid stimulating
                                hormone concentration was related to the logit probability of developing hypothyroidism
                                over a 20 year period in women (see box).13 The probability of a 40- year-old woman
                                with a thyroid stimulating hormone of 2.1 mU/l developing hypothyroidism is low
                                –– at 1 in 50 over 20 years. In men, the probability is so low that an equivalent
                                equation could not be derived.13 
                            
                                  
                            
                                Clinical features ignored 
                            
                                The review also highlighted the fact that in making a diagnosis of clinical or overt
                                hypothyroidism "symptoms are not considered a criterion by some authorities".2 The
                                review claimed great authority. It was pointed out that some of the data on which
                                it was based had been collected for the consensus statement for good practice and
                                audit measures in the management of hypothyroidism and hyperthyroidism published
                                on behalf of the Royal College of Physicians of London and the Society for Endocrinology.14
                                This publication makes no reference to the clinical manifestations or clinical diagnosis
                                of hypothyroidism. Thus, the clinical features of hypothyroidism seem to have been
                                relegated to the status of historical curiosities. 
                            
                                Hyperthyroidism 
                            
                                Assays capable of defining the lower end of the statistically derived reference
                                range became available in the early 1980s. One evaluation of such an assay reported
                                that all of 110 hyperthyroid patients studied had a thyroid stimulating hormone
                                concentration < 0.07 mU/l, and all 62 euthyroid control subjects had concentrations
                                > 0.07 mU/l.15 However, some clinically euthyroid subjects with abnormally low thyroid
                                stimulating hormone concentrations were classified as having subclinical hyperthyroidism.15
                                Assays can now detect thyroid stimulating hormone in serum at concentrations of
                                0.005m U/l.16 At this low concentration, hyperthyroid patients were not distinguished
                                from some euthyroid, though ill, patients.16 The range of thyroid stimulating hormone
                                concentrations in patients whose condition stabilised on thyroxine replacement treatment
                                was < 0.005 to > 10.00 mU/l.16 It is therefore clear that measurement of the thyroid
                                stimulating hormone concentration has failed to deliver what was expected of it. 
                            
                                Clinical aspects 
                            
                                During this period the clincial aspects of hyperthyroidism have also been downgraded.
                                Most current undergraduate textbooks treat the clinical diagnosis of thyroid dysfunction
                                by referring the student to lists. In the current edition of the Oxford Textbook
                                of Medicine, this matter is dismissed in less than a line, and the reader is referred
                                to unweighted lists of the sympotoms and signs.17 In the popular postgraduate textbook
                                of Clinical Endocrinology, the biochemical diagnosis and assessment of hyperthyroidism
                                are given before the clinical features.18 Medical journals are now effectively devoid
                                of references to the clinical features of hyperthyroidism. Though a symptom rating
                                scale for the diagnosis of hyperthyroidism was described in 1988,19 the clinical
                                scoring systems for assessing hypothyroidism and hyperthyroidism are now rarely
                                cited (Table 1). 
                            
                                  
                            
                                Non-thyroidal illness syndrome 
                            
                                We have recently become aware of the complexity of the effects of non-thyroidal
                                illness on the hypothalamicpituitary- thyroid axis and thyroid hormone metabolism.
                                Figures like the one shown (taken from a recent review20) are frequently used to
                                illustrate the nature of the changes that occur in serum thyroid hormone concentrations
                                in the non-thyroidal illness syndrome. These figures have never been published with
                                a numerical scale or error bars. The problem of interpreting free thyroxine was
                                summarised by the author: "It is common to find that a sample obtained from a patient
                                with non-thyroidal illness syndrome may have a raised free thyroxine by one method
                                but a normal or low free thyroxine by another."20 The equilibrium dialysis reference
                                method used to profile free thyroxine in the figure is technically demanding and
                                currently not established in the United Kingdom. As the original legend to the figure
                                explains: 
                            
                                  
                            
                                The profile for free thyroxine is that obtained using equilibrium dialysis and low
                                sample dilution. The level of free thyroxine found using commercial methods will
                                be heavily method dependent. A profile of free triiodothyronine is not included
                                as some ultrafiltration methods suggest that normal or raised free triiodothyronine
                                may be found in illness whilst equilibrium dialysis methods usually show diminished
                                or normal concentrations.20 
                            
                                What free thyroxine and free triiodothyronine assays actually measure is controversial.21
                                However, what is clear is that we cannot interpret thyroid function tests in systemically
                                ill patients. 
                            
                                Current status of thyroid function tests 
                            
                                Our understanding of the complexity of the cerebralhypothalamic- pituitary-thyroid
                                axis and the mechanism of thyroid hormone action has grown enormously. Current knowledge
                                indicates that the cardiac effects of thyroid hormones, which are clinically very
                                important, are mediated via the α1 thyroid hormone receptor independent of the β
                                receptors, which are the dominant regulators of thyroid stimulating hormone secretion.22 
                            
                                False positive and negative results 
                            
                                Overlap between the statistically derived normal and abnormal ranges is accepted
                                in diagnostic tests, giving rise to false positive and false negative results. These
                                concepts have not been applied to measurements of thyroid stimulating hormone. Rather
                                than accepting that the test can be fallible, we transfer the problem to the patient.
                                In patients with systemic disease, the nonthyroidal illness syndrome is invoked
                                to explain the anomalous results, and healthy subjects are diagnosed as having subclinical
                                hypothyroidism or hyperthyroidism.11,15 The distribution of the serum thyroid stimulating
                                hormone concentration in the population is logarithmic.13 Thus, minor deviations
                                from the statistically derived reference range are unlikely to be clinically meaningful.11 
                            
                                Conclusion 
                            
                                Studies in 1580 inpatients23 and in 630 patients admitted as medical emergencies24
                                found that thyroid function tests performed as screening tests yielded abnormal
                                results in 33% and 20% of patients respectively. In both studies, the biochemical
                                tests suggested thyroid disease incorrectly (that is, they gave false positive results)
                                in nine cases out of 10. Thus, indiscriminate use of thyroid function tests is more
                                likely to confuse than to help. 
                            
                                We do not know how important the thyroid function tests are for making a diagnosis
                                of thyroid dysfunction. It is a matter of personal judgement. Experience has shown
                                that thyroid function tests, like all the signs and symptoms associated with hypothyroidism
                                and hyperthyroidism, are not totally reliable. As it becomes clear that biochemical
                                assessments cannot deliver the diagnostic accuracy expected of them, the fact that
                                the clinical aspects of assessing thyroid dysfunction are being sidelined is a cause
                                for concern. Doing more biochemical tests will lead to further confusion, not the
                                hoped for clarity. The information obtained from thyroid function tests, despite
                                its quantitative numerical appearances, is "soft". How soft has yet to be established. 
                            
                                I thank Dr David Lyon for mathematical help, Dr Ann Wales for obtaining the citation
                                data given in the table, and Drs G H Beastall and H G Gray for constructive comments
                                and discussion. 
                            
                                Competing interests: None declared. 
                            
                                Key messages 
                            
                                 
                                    - There are no data on the relative importance of biochemical thyroid function tests
                                        and clinical symptoms and signs in assessing thyroid dysfunction.
 
                                    - Secretion of thyroid stimulating hormone is influenced by many factors other than
                                        the negative feedback inhibition by thyroxine or triiodothyronine.
 
                                    - Changes in thyroid stimulating hormone, thyroxine, and triiodothyronine concentrations
                                        during systemic illness are poorly understood.
 
                                    - Thyroid function tests cannot be interpreted in patients with systemic illness.
 
                                    - Since thyroid stimulating hormone concentrations are distributed logarithmically
                                        in the population, minor changes are unlikely to be clinically important.
 
                                    - The possibility of false positive and false negative results should be considered
                                        in interpreting thyroid stimulating hormone concentrations.
 
                                 
                            
                             
                            
                                D StJ O’Reilly,
                                Consultant Clinical Biochemist,
                                Department of Clinical Biochemistry and Clinic for Thyroid Diseases, Royal Infirmary.
                                 
                                    Correspondence to : Dr D StJ O’Reilly, Department of Clinical Biochemistry
                                    and Clinic for Thyroid Diseases, Royal Infirmary, Glasgow G4 OSF, U.K. 
                             
                             
                            
                                References
                                
                                    - The NHS in Scotland. Laboratory statistics 1999. Edinburgh: Information and Statistics
                                        Division, The NHS in Scotland, 1999. 
 
                                    - Weetman AP. Hypothyroidism: screening and subclinical disease. BMJ 1997;314:1175-1178.
                                    
 
                                    - Skinner GRB, Thomas R, Taylor M, et al. Thyroxine should be tried in clinically
                                        hyothyroid but biochemically euthyroid patients. BMJ 1997;314: 1764. 
 
                                    - Murray IPC. The clinical diagnosis of thyroid disease. Med J Aust 1964; 1:827-831.
                                    
 
                                    - Billewicz WZ, Chapman RS, Crooks J, et al. Statistical methods applied to the diagnosis
                                        of hypothyroidism. Q J Med 1969;38:255-266. 
 
                                    - Wayne EJ. Clinical and metabolic studies in thyroid disease. BMJ 1960;i: 78-90.
                                    
 
                                    - Crooks J, Murray IPC, Wayne EJ. Statistical methods applied to the clinical diagnosis
                                        of thyrotoxicosis. Q J Med 1959;28:211-234. 
 
                                    - Gurney C, Owen SG, Hall R, et al. Newcastle thyrotoxicosis index. Lancet 1970;ii:1275-1278.
                                    
 
                                    - Hall R, Amos J, Ormston BJ. Radioimmunoassay of human serum thyrotrophin. BMJ 1971;i:582-585.
                                    
 
                                    - Hedley AJ, Hall R, Amos J, et al. Serum thyrotropin levels after subtotal thyroidectomy
                                        for Graves disease. Lancet 1971;i:455-458. 
 
                                    - Evered DC, Ormston BJ, Smith PA, et al. Grades of hypothyroidism. BMJ 1973;i:657-662.
                                    
 
                                    - Tunbridge WMG, Evered DC, Hall R, et al. The spectrum of thyroid disease in a community:
                                        the Whickham survey. Clin Endocrinol 1977;7:481-493. 
 
                                    - Vanderpump MPJ, Tunbridge WMG, French JM, et al. The incidence of thyroid disorders
                                        in the community: a twenty-year follow-up of the Whickham survey. Clin Endocrinol
                                        1995;43:55-68. 
 
                                    - Vanderpump MPJ, Ahlquist JAO, Franklyn JA, et al. Consensus statement for good practice
                                        and audit measures in the management of hypothyroidism and hyperthyroidism. BMJ
                                        1996;313:539-544. 
 
                                    - Seth J, Kellett HA, Caldwell G, et al. A sensitive immunoradiometric assay for serum
                                        thyroid stimulating hormone: a replacement for the thyrotropin releasing hormone
                                        test? BMJ 1984;289:1334-1336. 
 
                                    - Wilkinson E, Rae PWH, Thomson KJT, et al. Chemiluminescent third generation assay
                                        (Amerlite TSH30) of thyroid stimulating hormone in serum or plasma assessed. Clin
                                        Chem 1993;39:2166-2173. 
 
                                    - McGregor AM. The thyroid gland and disorders of thyroid function. In: Weatherall
                                        DJ, Ledingham JGG, Warrell DA, eds. Oxford textbook of medicine. 3rd ed. Oxford:
                                        Oxford University Press, 1996;1603-1621. 
 
                                    - Hall R. Hyperthyroidism and Graves disease. In: Besser GM, Thorner MO, eds. Clinical
                                        endocrinology. 2nd ed. London: Mosby Wolfe, 1994;17:1- 24. 
 
                                    - Klein I, Trzepacz PT, Roberts M, et al. Symptom rating scale for assessing hyperthyroidism.
                                        Arch Intern Med 1988;148:387-390. 
 
                                    - Beckett GJ, Wilkinson E. Thyroid hormone metabolism and thyroid function tests in
                                        non-thyroidal illness. CPD Bull Clin Biochem 1998;1:9-14. 
 
                                    - Ekins R. Measurement of free hormones in blood. Endocrin Rev 1990;11: 5-46.
                                    
 
                                    - Weiss RE, Murata Y, Cua K, et al. Thyroid hormone action on liver, heart, and energy
                                        expenditure in thyroid hormone receptor β-deficient mice. Endocrinology 1998;139:4945-4952.
                                    
 
                                    - Spencer C, Eigen A, Shen D, et al. Specificity of sensitive assays of thyrotropin
                                        (TSH) used to screen for thyroid disease in hospitalised patients. Clin Chem 1987;33:1391-1396.
                                    
 
                                    - Small M, Buchanan L, Evans R. Value of screening thyroid functions in acute medical
                                        admissions to hospital. Clin Endocrinol 1990;32:185-191.
 
                                 
                             
                         |