Antithyroid drug use during pregnancy and the risk of birth defects in offspring: systematic review and meta-analysis of observational studies with methodological considerations
Funding information: Wellcome Trust, Grant/Award Number: 214588/Z/18/Z; European Research Council, Grant/Award Number: 639377
Abstract
Aims
Maternal antithyroid drug (ATD) use during pregnancy has been associated with an increased risk of birth defects in offspring. Uncertainty remains on the size of this risk and how it compares to untreated hyperthyroidism due to methodological limitations of previous studies.
Methods
Systematic review of MEDLINE and EMBASE identifying observational studies examining ATD use during pregnancy and risk of birth defects by 28 August 2020. Data were extracted on study characteristics, effect estimates and comparator groups. Adjusted effect estimates were pooled using a random-effects generic inverse variance method and absolute risk calculated.
Results
Seven cohort studies and 1 case–control study involving 6 212 322 pregnancies and 388 976 birth defects were identified reporting regression effect estimates. Compared to an unexposed population comparison, the association between ATD use during pregnancy and birth defects in offspring was: adjusted risk ratio (aRR) 1.16 95% confidence interval (CI) 1.08–1.25 for propylthiouracil (PTU); aRR 1.28 95%CI 1.06–1.54 for methimazole/carbimazole (MMI/CMZ); aRR 1.51, 95%CI 1.16–1.97 for both MMI/CMZ and PTU; and aRR 1.15 95%CI 1.02–1.29 for untreated hyperthyroidism. The excess risk of any and major birth defects per 1000, respectively, was: 10.2 and 1.3 for PTU; 17.8 and 2.3 for MMI/CMZ; 32.5 and 4.1 for both MMI/CMZ and PTU; and 9.6 and 1.2 for untreated hyperthyroidism.
Conclusions
When appropriately analysed the risk of birth defects associated with ATD use in pregnancy is attenuated. Although still elevated, the risk of birth defects is smallest with PTU compared to MMI/CMZ and may be similar to that of untreated hyperthyroidism.
What is already known about this subject
- Maternal antithyroid drug use during pregnancy has been associated with an increased risk of birth defects in offspring.
- Uncertainty remains on the size of this risk and how it compares to untreated hyperthyroidism due to methodological limitations of previous studies, including meta-analyses.
What this study adds
- Previously conducted meta-analyses have over-estimated the risk of birth defects associated with antithyroid drug use during pregnancy due to bias.
- Propylthiouracil use during pregnancy is associated with a smaller risk of birth defects compared to methimazole/carbimazole and may be similar in size as that observed with untreated hyperthyroidism.
1 INTRODUCTION
Hyperthyroidism affects between 0.1 and 0.4% of pregnancies and left untreated, may have detrimental maternal and foetal consequences including increasing the risk of preeclampsia, intrauterine growth restriction, preterm birth and maternal heart failure.1-3 The most common cause of hyperthyroidism in young women is Grave's disease, which results from stimulation of the thyroid by thyrotrophin receptor stimulating antibodies. This results in elevated free thyroxine (T4) and/or triiodothyronine (T3) with low thyroid-stimulating hormone (TSH), and typically requires treatment.4 In contrast, gestational transient hyperthyroidism resulting from thyroid gland stimulation by the human chorionic gonadotrophin hormone often requires no treatment.
Clinical guidelines recommending treatment of hyperthyroidism during pregnancy parallels that of nonpregnant women with use of antithyroid drugs (ATDs).5, 6 Commonly prescribed drugs for the management of hyperthyroidism in pregnancy are methimazole/carbimazole (MMI/CMZ) and propylthiouracil (PTU) that are considered equally effective. Both ATDs can cross the human placenta and has resulted in uncertainty about the risk of birth defects. PTU is the preferred ATD during the first trimester of pregnancy due to rare safety concerns surrounding the possible teratogenic effects of MMI/CMZ.7 The reported MMI/CMZ embryofoetopathy include aplasia cutis congenita (absence of a portion of skin, often localised on the head), craniofacial birth defects (choanal atresia; facial dysmorphism), defects of the abdominal wall and gastrointestinal tract (exomphalos, oesophageal atresia, omphalomesenteric duct anomaly), and ventricular septal defect.8 Consequently, in the first trimester of pregnancy clinical guidelines recommend switching from MMI/CMZ to PTU therapy in women with unplanned pregnancies and then using MMI/CMZ thereafter due to concerns of PTU hepatotoxicity.9 The role of PTU during pregnancy has been re-examined following reports of birth defects. These safety concerns recently led to updated warnings from the European Medicines Agency on the use of PTU and MMI/CMZ during pregnancy.10, 11
Two meta-analyses were recently published reporting that use of MMI/CMZ and PTU during pregnancy were associated with an increased risk of birth defects.12, 13 These meta-analyses used the number of cases and the number of people in the sampled population from observational studies to calculate pooled effect estimates for exposure to MMI/CMZ and exposure to PTU during pregnancy. By analysing data from observational studies as if they were trials, such estimates represent crude associations that are likely to be confounded.14 Heterogeneity between studies may also occur through the use of different study designs and further information from other types of comparator or reference groups may aid decision making. The aim of this study was to examine the association between the use of MMI/CMZ or PTU during pregnancy and the risk of birth defects in offspring, through systematic review and meta-analysis of data using more appropriate methods to support causal inference.
2 METHODS
A review of MEDLINE and EMBASE was conducted using a prespecified search strategy to identify all observational studies (cohort, case–control and sibling studies) assessing the risk of birth defects associated with the use of PTU or MMI/CMZ during pregnancy, published on or before 28 August 2020. The search strategy is reported in the online supplementary material. Titles and abstracts were screened, and full texts of relevant articles assessed for eligibility by 2 authors. Only English language publications and published data were included as we had no resources to translate articles. The systematic review was registered on the EU Register of Post-Authorisation Studies (EUPAS30990) and reported according to PRISMA (Preferred Reporting Items for Systematic Reviews).15
2.1 Inclusion and exclusion criteria
First, studies were included if they reported regression effect estimates, and the type of confounding adjustment and study design were described. Of these, meta-analysis was performed using only those studies reporting adjusted effect estimates for the association between ATDs and birth defects. Second, for estimating absolute risk difference (excess number of birth defects per 1000 pregnancies) studies were included if they reported birth defect rates from ATD exposed and from an untreated general population comparison group. Studies reporting birth defect rates among other selected populations of women were excluded.
2.2 Risk of bias
Methodological quality and risk of bias were evaluated for each study using the ROBINS-I tool.16 This tool is designed to assess the strengths and weaknesses of nonrandomised studies on the effects of interventions in relation to their risk of bias. The ROBINS-I tool covers 7 distinct domains, through which bias could be introduced: confounding, selection of participants into the study, classification of interventions, deviations due to intended interventions, missing data, measurement of outcome measures and selection of the reported result.
2.3 Data extraction
We extracted data from eligible studies for the following characteristics: study design; study population; sample size; exposure definition; type of comparator or reference group reported; and the accuracy and completeness of information on confounders. For each comparison, crude and adjusted regression effect estimates (odds ratios, hazard ratios or rate ratios) were identified with corresponding 95% confidence intervals (CIs). The outcome of interest were the risk of birth defects in children following MMI/CMZ exposure, PTU exposure, and both MMI/CMZ and PTU exposure during pregnancy.
2.4 Comparator groups and alternative study designs
Confounding by indication may occur when the underlying indication is associated with the outcome being studied. Using different comparator or reference groups may help to circumvent or minimise this potential confounding.17, 18 To examine the effect of different comparator groups on such confounding, regression effect estimates for the following prespecified comparator groups were sought:
- maternal exposure during pregnancy vs. all unexposed women (also called the unexposed population comparison group, and referred to as the classical comparison);
- maternal exposure during the prepregnancy period (discontinuers) vs. all unexposed women (this exposure may act as a negative control and should theoretically be noncausal);
- maternal exposure during pregnancy vs. an unexposed disease comparison group that may reduce confounding by indication or severity (in this context the direct effect of hyperthyroidism);
- siblings analyses with discordant prenatal medication exposure (this study design accounts for all time-fixed within-family confounding);
- paternal exposure during pregnancy vs. all unexposed women (this exposure may act as a negative control and should theoretically be noncausal); and
- an unexposed disease comparison group vs. all unexposed women (testing for confounding by indication).17, 18
2.5 Analysis
The characteristics of each study and method of confounding adjustment for studies reporting regression effect estimates were first described. The number of cases and total population from each study were then used to replicate the meta-analytical approach used by Li et al., which used a Mantel–Haenzel fixed-effect model.14 Adjusted effect estimates from all identified studies were then extracted, transformed on the natural log scale and pooled using the generic inverse variance method of analysis. Random-effects models were generated for each type of exposure and comparator group separately. When >1 study used the same data source, the largest study was initially selected with sensitivity analysis substituting this with other studies from the same data source. We also substituted effect estimates for studies that reported data for both any birth defect and for a subgroup related to CMZ/MMI. Furthermore, we performed a leave-1-out comparison with 2 studies that reported exposure as during pregnancy as opposed to all other studies that specifically measured first trimester exposure. Odds ratios from case–control studies and hazard ratios from cohort studies were combined because they closely approximate each other.19, 20 For reporting, pooled effect estimates are subsequently referred to as risk ratios (RR) throughout. Publication bias was assessed by testing for funnel-plot asymmetry using the Egger test. Analyses were conducted in Review Manager 5.3 (Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014).
To estimate absolute risk differences, the rate of birth defects (any, major) from general unexposed population estimates was first extracted from each study and presented as the rate per 1000 live births. The pooled adjusted effect estimates from the meta-analysis were then used to calculate the absolute risk of birth defects (any, major) per 1000 live births for each exposure group of interest. The absolute risk difference was then calculated by subtracting the birth defect rate (any, major) in the unexposed general population from each exposure group.
3 RESULTS
The systematic review identified 15 potentially relevant studies. We included 8 studies reporting regression estimates to describe the types of confounding adjustment and study design, and excluded 7 studies without regression estimates (Figure 1). The characteristics of these studies are presented in Table 1 (included studies) and Table S1 (excluded studies).21-35 Of the included studies reporting effect estimates, 7 were cohort studies (involving 6 195 342 pregnancies and 388 864 birth defects)21-26, 28 and 1 was a case–control study (involving 16 980 pregnancies and 112 birth defects).27 Two of these studies were conducted using the same overlapping populations from Denmark.21, 24 Of the comparator groups reported, 5 studies reported adjusted effect estimates for the risk of birth defects associated with use of MMI/CMZ during pregnancy vs. an unexposed population comparison group (the classical comparison), and 1 study compared MMI/CMZ use to an unexposed disease comparison group (Table S2). The corresponding number of studies for PTU were 5 using a classical comparison and 2 using a disease comparison group respectively. For use of both MMI/CMZ and PTU during pregnancy (i.e. switching), 3 studies reported adjusted effect estimates for the classical comparison. No studies were identified that undertook a sibling study design, examined prepregnancy exposure or paternal exposure. However, 1 study used a comparator consisting of either prepregnancy and postpregnancy exposure.
Author, publication (Reference) | Type of study design and data source | Country | Study population | Type of birth defectb | Timing of exposure | Duration of follow-up in children | ATD exposurec (cases/total) | Unexposed women with hyperthyroidism (cases/total) | Unexposed population comparison group (cases/total) |
---|---|---|---|---|---|---|---|---|---|
Andersen 201921 | Cohort study, National registries and birth cohorts | Denmark | Live births | All birth defects and 8 birth defect subgroups diagnosed before age 2 y | 6 mo before pregnancy to gestational wk 10 | Up to 2 y |
PTU (74/889) MMI (151/1574) |
Unexposed to ATD (174/951) | Unexposed to ATD (77 791/1 159 181) |
Seo 201822 | Cohort study, Claims database | Korea | Live births | All birth defects and 10 birth defect subgroups | During the first trimester | Up to 1 y |
PTU (699/9930) MMI (91/1120) MMI/PTU (147/1841) |
- | Unexposed to ATD (170 716/2 872 109) |
Andersen 201723 | Cohort study, National registries | Sweden | Live births | All birth defects and 13 birth defect subgroups | 6 mo before pregnancy to the end of gestational wk 10 | Up to 2 y |
PTU (14/218) MMI (11/162) MMI/PTU (4/66) |
ATD prior pregnancy and after pregnancy (136/1551) | Unexposed to ATD (54 827/682 343) |
Andersen 201324 | Cohort study, National registries | Denmark | Live births and stillbirths | All birth defects and 8 birth defect subgroups | 6 mo before pregnancy to the end of gestational wk 10 | Up to 2 y |
PTU (45/564) MMI/CMZ (100/1097) MMI/PTU (16/159) |
ATD prior pregnancy and after pregnancy (190/3543) | Unexposed to ATD and without hyperthyroidism (45 982/811 730) |
Korelitz 201325 | Cohort study, Claims database | USA | Live births | Any birth defect | 6 mo before pregnancy to the end of pregnancy | Up to 1 y |
PTU (66/915) MMI (6/108) MMI/PTU (14/126) |
Thyrotoxicosis before/during/after pregnancy (390/5932) | Unexposed to ATD and without hyperthyroidism (37 351/634 858) |
Yoshihara 201226 | Cohort study, Hospital database | Japan | Live births and perinatal losses of women with Graves' disease | Major birth defects | During the first trimester | 1st visit postdelivery |
PTU (26/1578) MMI (50/1426) |
Graves' disease (40/2065) | - |
Chen 201127 |
Case–control Study, Claims database |
Taiwan |
Live births of women with hyperthyroidism and age matched controls without |
Major birth defects | During pregnancy | Not reported |
PTU (5/630) MMI (0/73) |
Women with hyperthyroidism (15/2127) | Unexposed to ATD and euthyroid (92/14 150) |
Rosenfeld 200928 |
Prospective cohort study by Teratology Information Service |
Israel |
Live births and elective terminationsa Controls exposed to nonteratogens |
Major birth defects | Gestational wk 4–13 | 0–6 y | PTU (1/80) | - | Exposed to nonteratogenic drugs (34/1066) |
- ATD = antithyroid drug; CMZ = carbimazole; MMI = methimazole; PTU = propylthiouracil.
- All studies report adjusted effect estimates except Rosenfeld 200928 that only reports unadjusted effect estimates.
- a Elective terminations of pregnancy due to prenatally diagnosed anomalies.
- b Birth defects due to genetic disorders included.
- c Andersen 2013: MMI/CMZ followed by PTU (n = 149) or PTU followed by MMI/CMZ (n = 10). Andersen 2017: MMI/CMZ followed by PTU (n = 64) or PTU followed by MMI/CMZ (n = 2). Korelitz 2013. Seo 2018: switching order not specified. Chen 2011: 516/630 and 65/73 received PTU and MMI in the first trimester, respectively.
3.1 Meta-analysis
The meta-analytical approach used by Li et al. was first replicated using the number of reported cases and total population with a Mantel–Haenzel fixed-effect model. When analysed in this way, maternal MMI/CMZ exposure and PTU exposure during pregnancy was associated with a significantly increased risk of birth defects in offspring compared to the classical reference group of all unexposed women (RR 1.20 95%CI 1.02–1.42 for PTU; RR 1.64 95%CI 1.39–1.92 for MMI/CMZ; RR 1.83 95%CI 1.30–2.56 for MMI/CMZ and PTU use, Figure S1).
When only adjusted effect estimates were pooled using the inverse variance method of analysis, maternal use of MMI/CMZ during pregnancy was associated with a smaller but still significantly increased risk of birth defects compared to all unexposed women (random-effects adjusted RR [RaRR] 1.28 95%CI 1.06–1.54, Figure 2). This was similarly the case for use of PTU (RaRR 1.16 95%CI 1.08–1.25) and exposure to both MMI/CMZ and PTU during pregnancy (RaRR 1.51 95%CI 1.16–1.97).
The risk of birth defects among unexposed women with hyperthyroidism compared to unexposed women without hyperthyroidism was also significantly elevated (RaRR 1.15 95%CI 1.02–1.29, Figure 3). When PTU exposure was compared to unexposed women with hyperthyroidism reported in 2 studies, PTU exposure during pregnancy was not associated with an increased risk of birth defects (RaRR 0.72 95%CI 0.47–1.11, Figure 3). In contrast, only 1 study reported this comparison for MMI/CMZ that was significantly elevated (RR 2.28, 95%CI 1.54–3.33).
3.2 Sensitivity analyses
Two studies were conducted using the same data source from Denmark but covering different time periods.21, 24 When Andersen et al. (2019)21 was substituted for Andersen et al. (2013),24 pooled effect estimates for MMI/CMZ exposure and PTU exposure were RaRR 1.28 (95%CI 0.96–1.71) and RaRR 1.17 (95%CI 1.09–1.26) respectively. Two studies also reported effect estimates for subgroups of birth defects and exposure to CMZ/MMI and 1 study for PTU (Table S3).21, 23 When effect estimates for subgroups of birth defects were substituted, pooled effect estimates for exposure to MMI/CMZ was RaRR 1.37 (95%CI 0.98–1.91) and RaRR 1.17 (95%CI 1.09–1.26) for PTU (Figure S2). The results of the leave-1-out analysis for 2 studies measuring ATD exposure during pregnancy rather than specifically as first trimester exposure were similar to the main results (Table S4).
3.3 Absolute risk
For calculating overall absolute risk differences, we included 4 studies for any birth defects and 3 studies for major birth defects (Table 2). The overall rate of birth defects in unexposed women was 63.7 per 1000 live births for any birth defect and 8.2 per 1000 for major birth defects (Table 2). The absolute risk difference for any birth defect compared to the unexposed population comparison group per 1000 live births was estimated at: 9.6 in women with unexposed hyperthyroidism; 10.2 in women treated with PTU; 17.8 in women treated with MMI/CMZ; and 32.5 in women treated with both MMI/CMZ and PTU during pregnancy (Table 2). Corresponding numbers for major birth defects per 1000 live births were 1.2, 1.3, 2.3 and 4.1.
Study (reference) | Country | Total number of pregnancies in the unexposed population comparison group | Birth defect rate in unexposed women in the unexposed population comparison group (per 1000 live births) | Estimated excess number of birth defects (per 1000 live births) | |||
---|---|---|---|---|---|---|---|
Unexposed hyperthyroidism | PTU exposure | MMI/CMZ exposure | MMI/CMZ and PTU exposure | ||||
Any birth defect | |||||||
Andersen 201921 | Denmark | 1 159 181 | 67.1 | 10.1 | 10.7 | 18.8 | 34.2 |
Seo 201822 | Korea | 2 872 109 | 59.4 | 8.9 | 9.5 | 16.8 | 30.3 |
Andersen 201723 | Sweden | 682 343 | 80.4 | 12.1 | 12.9 | 22.5 | 41.0 |
Andersen 201324 | Denmark | 811 730 | 56.7 | 8.5 | 9.1 | 15.9 | 28.9 |
Korelitz 201325 | USA | 634 858 | 58.8 | 8.8 | 9.4 | 16.5 | 30.0 |
Overall* | 5 001 040 | 63.7 | 9.6 | 10.2 | 17.8 | 32.5 | |
Major birth defect | |||||||
Chen 201127 | Taiwan | 14 150 | 6.5 | 1.0 | 1.0 | 1.8 | 3.3 |
Lian 200532 | China | 22 765 | 9.3 | 1.4 | 1.5 | 2.6 | 4.7 |
Momotani 198435 | Japan | 350 | 2.9 | 0.4 | 0.5 | 0.8 | 1.5 |
Overall | 40 095 | 8.2 | 1.2 | 1.3 | 2.3 | 4.1 |
- * Overall rate for any birth defect calculated using Andersen 201921 and excluding Andersen 201324 due to the overlapping population.
- ** Yoshihara 2012,26 Rosenfeld 2009,28 Hawken 2016,29 Lo 2015,30 Clementi 2010,31 Gianantonio 200133 and Wing 199434 were excluded from the calculation of overall rate estimates and absolute risk differences as they did not report baseline birth defect rates from an unexposed general population comparison group. Exposure not only restricted to the first trimester (Korelitz 2013, Chen 2011).
3.4 Assessment of confounding factors and risk of bias
Of the 8 studies, 6 adjusted for maternal age, and 3 for each of the following: the infant's sex; year of birth; parity; and pregnancy type (Table S5). Adjustment for maternal physical history and smoking status occurred in 3 and 2 studies respectively. Adjustment of maternal physical history consisted mainly of metabolic conditions including diabetes and hyperlipidaemia. No studies adjusted for alcohol use during pregnancy and only 1 study adjusted for any other type of medication use during pregnancy, namely antiepileptic use.
For the classical comparison using all unexposed women, the risk of bias varied according to the domain studied and was influenced by variation in the type of confounding adjustment undertaken and uncertainty around exposure and outcome ascertainment bias (Table S6). The Egger test for MMI/CMZ exposed vs. all unexposed women (P = .020), PTU exposed vs. all unexposed women (P = .005) and PTU and MMI/CMZ exposed vs. all unexposed women (P = .027) were statistically significant for funnel plot asymmetry, although the number of included studies evaluating this were limited.
4 DISCUSSION
Meta-analysis is a powerful tool that combines estimates from multiple studies to improve power and precision, whilst allowing questions to be answered that are limited by individual studies. Meta-analysing data from observational studies examining the risk of ATD exposure during pregnancy as if they were trials may overestimate risk, which attenuates when more appropriate methods are used. However, observed associations between birth defects and exposure to PTU and/or MMI/CMZ were still elevated. Pooled effect estimates for women using PTU and unexposed women with hyperthyroidism were similar in size, whilst the highest risk estimates were observed in the groups switching between MMI/CMZ and PTU.
PTU is typically considered the safer option when hyperthyroidism requires treatment during the first trimester of pregnancy. However, safety concerns surrounding PTU exposure and the risk of hepatic toxicity and birth defects have emerged leading to regulatory updates to the product information in 2011 and 2019 respectively.9-11 Our findings for PTU exposure during pregnancy are reassuring as they suggest that it does not increase the risk of birth defects beyond that of the underlying maternal disease, but potentially has the advantage of reducing maternal morbidity associated with untreated hyperthyroidism according to previous research.3
The highest risk of birth defects was associated with use of both MMI/CMZ and PTU during pregnancy that corresponds to women who switch from 1 product to the other. The effects of switching can be difficult to interpret with these data as the order of switching was not always specified. However, in the majority of instances, switches appeared to be appropriate as most changed from MMI/CMZ to PTU during the first trimester of pregnancy. Given current guideline recommendations, this could represent a group more likely to include women with unplanned pregnancies that may differ in terms of lifestyle and early folic acid use for example. Whilst we cannot exclude the possibility that confounding by the severity of hyperthyroidism plays a role we would expect treated women to become euthyroid as their disease is controlled with ATD medication.
Our analysis demonstrates that using adjusted effect estimates is important to appropriately account for measured confounding factors and that risk estimates from meta-analyses based upon crude numbers may be of limited value for clinical and regulatory decision making. We recommend avoiding this method of meta-analysis for nonrandomised studies when adjusted effect estimates from observational studies are available. This is particularly important when such estimates are used to calculate absolute risk. In this regard, individual observational studies should be properly designed and conducted to provide the appropriate primary data, including being adequately powered to reduce the risk of a type-II error.
This study has several potential limitations. First, although accounting for confounding attenuated the observed associations, the included studies were heterogeneous in their approach to confounding adjustment suggesting that residual confounding remains possible. First trimester exposure was specifically evaluated in most but not all studies. However, the leave-1-out analysis produced similar results suggesting that these estimates are relevant to first trimester exposure. Studies were also heterogeneous in the outcome definition with some studies measuring any birth defects (also minor) whilst other focused on major birth defects. The baseline birth defect rate was as high as 8% in 1 study, probably due to inclusion of neonatal anomalies that according to the EuroCat classification system are minor anomalies for exclusion.23, 36 Heterogeneity may have also resulted from studies using different types of data source. Other features that could have affected the risk estimates include not excluding offspring with birth defects due to genetic disorders. This may introduce bias if women with hyperthyroidism/Graves disease have an increased risk of having children with birth defects of genetic origin.37 Moreover, ascertainment bias related to the outcome is expected as clinicians may scrutinize children of women with hyperthyroidism to a greater extent than children of healthy women, especially after the signals of ATD teratogenicity. Although this could have a larger effect on studies with shorter follow-up to detect outcomes we observed no obvious impact of this with the included studies. For these reasons, it is important to consider using an unexposed disease comparison group as an alternative comparator to better infer causality.17, 18 Notably no sibling study designs have been reported despite the potential advantage of controlling for shared familial confounding by design.
The majority of included studies assessed only birth defects in live births, which will miss defects due to elective termination, miscarriages or stillbirths. Conditioning on live births may give rise to collider bias if both exposure and outcome are associated with survival.38 Despite the advantages of having an unexposed disease comparison group, this approach may not account for differences in the severity or type of hyperthyroidism. It is likely that transient gestational hyperthyroidism may represent a significant proportion of untreated cases whereas Graves disease will probably be treated. Although we demonstrated comparable risk from PTU exposure and untreated hyperthyroidism, the overall number of studies was small and further observational studies may be useful, including those that explore the relationships between different comparator groups and study designs. We also cannot exclude the possibility of publication bias. Finally, this study examines birth defects as a composite outcome and data may not be generalizable to subtypes of birth defects such as MMI embryofoetopathy. However, whilst MMI/CMZ may be associated with a larger risk of birth defects not all studies consistently reported an elevated risk demonstrating that some uncertainty about this association remains. For example, in a large study from Sweden there were no cases of aplasia cutis and choanal or oesophageal atresia that have described as being the MMI embryopathy.23 The only study that was sufficiently powered to assess this birth defect pattern, however, found 5 additional cases of MMI embryopathy per 10 000 after first trimester exposures to ATDs compared to nonexposed (0.12 vs. 0.07%).22 This was also the only study to report a dose–response relationship and observed larger risk estimates for congenital anomalies with high cumulative doses of CBZ/MMI but not with PTU.
Future studies should evaluate the association between ATD and birth defects using comparable definitions of exposure and outcomes aligned with current recommendations and standards.36, 39 Estimates should be reported separately for CMZ/MMI and PTU rather than by ATD group that is still evaluated.40 Major groups of birth defects should also be investigated separately rather than any birth defect, and minor birth defects should preferably be studied as part of malformation patterns. Future studies should also consider incorporating additional types of comparators or reference groups in their designs (such as untreated hyperthyroidism or prepregnancy ATD exposure), ensure data sources have reasonable power to study the birth defects of interest, and consider the use of sibling study designs to address familial confounding. Techniques to quantify residual confounding for example by severity of maternal hyperthyroidism and unmeasured confounders could also be useful.18
Assuming MMI/CMZ and PTU are equally effective, our findings suggest that PTU use is the safer option in terms of the risk of birth defects in offspring, and that untreated hyperthyroidism may also carry a risk. This suggests that the net benefit of managing hyperthyroidism during the first trimester of pregnancy is in favour of using PTU when clinically indicated. However, such use must also be balanced against other potential risks with PTU such as hepatoxicity and the limitation of current observational studies.
5 CONCLUSION
Previous meta-analyses examining the risk of maternal ATD use during pregnancy overestimate the risk of birth defects in offspring. When appropriately analysed this risk is attenuated. Although still elevated, the risk of birth defects is smallest with use of PTU during pregnancy.
ACKNOWLEDGEMENTS
We thank the reviewers of the British Journal of Clinical Pharmacology for the valuable input and thorough review of our manuscript. No funding was obtained to conduct this study. H.N. is supported by the European Research Council Starting Grant “DrugsInPregnancy” grant number 639377. D.M. is supported by a Wellcome Trust Clinical Research Development Fellowship (Grant 214588/Z/18/Z).
COMPETING INTERESTS
The authors have no conflicts of interest in connection with this article. D.M. and H.N. are current independent scientific expert members of the European Medicines Agency Pharmacovigilance Risk Assessment Committee.
CONTRIBUTORS
D.M. and L.F. collected the data and performed the analysis with guidance from H.N. D.M. is the guarantor for the study. All authors contributed to the study design, interpretation of results, writing the manuscript and approved the final draft.
Open Research
DATA AVAILABILITY STATEMENT
All data are either contained within the manuscript or can be accessed via the referenced publications.