Cite this asTargovnik HM, Citterio CE, Siffo S, Rivolta CM (2016) Advances and Perspectives in Genetics of Congenital Thyroid Disorders Associated with Thyroglobulin Gene Mutations. Open J Biol Sci 1(1): 062-070. DOI: 10.17352/ojbs.000006
Dyshormonogenesis due to thyroglobulin (TG) gene mutations is a rare cause of congenital hypothyroidism with an estimated incidence of approximately 1 in 100,000 newborns. The TG gene is organized in 48 exons, spanning over 270 kb on human chromosome 8q24. The human TG mRNA is 8.5 Kb long and the preprotein monomer is composed of a 19 amino acids signal peptide followed by a 2749 residues polypeptide. Until now, one hundred seventeen deleterious mutations in the human TG gene have been identified and characterized, originating structural changes in the protein that alter the normal protein folding, assembly and biosynthesis of thyroid hormones: 19 splice site mutations, 23 nonsense mutations, 57 missense mutations, 13 deletions (9 single nucleotide deletions, 2 multiple nucleotide deletions and 2 involving a large number of nucleotides), 4 single nucleotide insertions or duplication and 1 imperfect DNA inversion. The p.R277*, p.R1511*, p.A2215D, p.R2223H and p.R2317* mutations are the most frequently identified TG mutations in Caucasian population, whereas c.274+2T>G, p.C1058R, p.C1245R and p.C1977S are the most common mutations in Asian population.
TG mutations are inherited in an autosomal recessive manner and affected individuals are either homozygous or compound heterozygous for gene mutations and the parents should be carriers of one the TG mutation.
New approaches including the use of new sequencing technology, will eclipse traditional methods of detecting mutations and will allow the quick identification of mutations in remote regions as well as the detection of coexistence of multiple mutations in the same gene or in different thyroid genes.
Thyroglobulin (TG) is a structural, ancestral and secretory protein with high specificity for the biosynthesis of thyroid hormones. Its main function is to provide the precursor for synthesis and storage of thyroid hormones [1-4]. It is also an important storage of iodine when external iodine availability is limited. Biosynthesis of thyroid hormones requires the integrity of a complex protein system, several sequential steps and is critically dependent upon the native three-dimensional structure of TG [1-4]. The central steps in thyroid hormone synthesis take place at the cell–colloid interface of follicular thyroid cells [1-4]. Iodine is covalently bound to Tyr amino acids within TG. Correctly folded TG homodimers are secreted into the follicular lumen where coupling between either two diiodotyrosine (DIT) residues, or between a DIT and a monoiodotyrosine (MIT) residues, results in the formation of 3,5,3´-triiodothyronine (T3) and 3,5,3´,5´-tetraiodothyronine (T4) [1-4]. The iodination and coupling reactions are mediated by thyroid peroxidase (TPO) with a source of hydrogen peroxide [1-4]. The H2O2 generation system of the thyroid involves a metabolic pathway which includes the dual oxidase 1 and 2 (DUOX1, DUOX2), and DUOX maturation factor 1 and 2 (DUOXA1, DUOXA2) proteins. The mature TG molecules remain in the lumen of thyroid follicles. Afterwards, TG is subjected to proteolysis, MIT and DIT are subsequently deiodinated by the iodotyrosine dehalogenase [1-4]. Thyroid gland produces predominantly T4 together with a small amount of the T3. The peripheral metabolism of thyroid hormone is determined by the action of the D1, D2 and D3 selenodeiodinases that catalyze the interconversion of T4 in T3 .
Thyroid hormones are essential for normal central nervous system development [6,7]. Untreated congenital hypothyroidism result in irreversible mental delay and short stature [6,7]. For over three decades, mutations in the human TG gene have been identified associated with congenital goiter [8-53] and also endemic and nonendemic goiter [54-56]. The clinical spectrum ranges from euthyroidism to permanent severe hypothyroidism [1,2,6,7]. Phenotypic variations among patients with the same mutations have been observed. In addition to iodine deficiency, other environmental and genetic factors may contribute to clinical variability.
In this review we summarize the most recent data related to thyroid disorders caused by mutations in the TG gene and provide data that have an impact on the disease management as well counseling benefits for the patients and their families.
Congenital hypothyroidism (CH) is the most frequent endocrine disease in infants, with prevalence of 1:2000 - 1:3000 newborns and is characterized by high levels of thyroid-stimulating hormone (TSH) as a consequence of reduced thyroid function [1,2,6,7]. It is also one of the most common preventable causes of cognitive and motor deficits. Prevention of CH is based on carrier identification, genetic counseling and prenatal diagnosis. In neonates a complete diagnosis of CH should include clinical examination, biochemical thyroid tests, thyroid ultrasound, radioiodine or technetium scintigraphy and perchlorate discharge test (PDT) [6,7]. In the last three decades, considerable progress has been made in identifying the genetic and molecular causes of CH. Knowing the prevalence of the mutations present in each population will facilitate greatly the molecular genetic testing. The classification based on the genetic alterations divides CH into two main categories caused: (a) by disorders of thyroid gland development (dysembriogenesis or thyroid dysgenesis group) or (b) by defects in any of the steps of thyroid hormone synthesis (dyshormonogenesis group) [1,2,6,7]. The dysembryogenesis or thyroid dysgenesis group, which accounts for the 80-85 % of the cases, results from a thyroid gland that is completely absent in orthotopic or ectopic location (agenesis or athyreosis), severely reduced in size but in the proper position in the neck (orthotopic hypoplasia) or located in an unusual position (thyroid ectopy) at the base of the tongue or along the thyroglossal duct [1,2,6,7]. In only 5% of the patients, CH is associated with mutations in genes responsible for the development or growth of thyroid cells: NKX2.1 (also known as TTF1 or T/EBP), FOXE1 (also known as TTF2 or FKHL15), paired box transcription factor 8 (PAX-8), NKX2.5, and TSHR genes [1,2,6,7]. Epigenetic mechanisms leading to stochastic variations in the expression of multiple loci could be responsible for the sporadic characteristic of thyroid dysgenesis. Consequently, the genetic mechanisms underlying the defects in thyroid organogenesis in the majority of the cases remain to be elucidated.
Dyshormonogenesis, which accounts for the remaining 15-20% of the cases, has been linked to mutations in the SLC5A (Na+/I_ symporter, NIS) , SLC26A4 (Pendrine, PDS) , TPO , dual oxidasa2 (DUOX2), DUOX maturation factor 1 and 2 (DUOXA1 and DUOXA2) [60,61], iodotyrosine dehalogenase 1 (DEHAL1)  and thyroglobulin (TG) [1-4,8-56] genes. These mutations produce a heterogeneous spectrum of congenital hypothyroidism, with an autosomal recessive inheritance. Thereafter, the patients are typically homozygous or compound heterozygous for the gene mutations and the parents, carriers of one mutation.
Dyshormonogenesis due to TG gene mutations is a rare cause of CH with an estimated incidence of approximately 1 in 100,000 newborns [1,2]. The patients with TG synthesis defects presents a congenital goiter or goiter appearing shortly after birth, clinical spectrum ranges from euthyroid to mild or severe permanent hypothyroidism, high iodide uptake, normal organification of iodide (negative PDT), elevated serum TSH with simultaneous low or normal serum T4 and T3 levels, and low serum TG concentration [1-2,8-53]. The presence of very low TG level and also negative PDT in a goitrous individual are the basis for the selection of patients for molecular studies in the TG gene . Patients with iodotyrosine dehalogenase deficiency will also develop goiter with hypothyroidism, when dietary iodide is limiting. In these patients the PDT does not show increased release of radioiodine after administration of the competitor, indicating that the organification process itself is not affected, whereas the serum TG levels are frequently elevated [2,62]. Patients with an iodide transport defect by mutations in SLC5A gene have a normal-sized or somewhat enlarged thyroid gland, elevated plasma TG levels and no radio-iodide uptake [2,57]. Iodide organification defects are associated with mutations in the TPO, DUOX2, DUOXA2 or SLC26A4 genes and characterized by a positive PDT [2,59-61]. Mutations in SLC26A4 gene cause Pendred syndrome characterized by congenital sensorineural hearing loss and goiter without or with hypothyroidism [2,58].
Human TG gene is a single copy gene of 270 kb long that maps on chromosome 8q24 and contains an 8,459-8,468 nucleotides coding sequence (GenBank Accesion Number: NM_003235.4) divided into 48 exons (Figure 1a) [63-71]. TG gene expression is stimulated by TSH through the modulation of the intracellular level of cyclic adenosine monophosphate (cAMP) [1-3]. TSH exerts its function via a G protein-coupled receptor, the TSH receptor (TSHR), which relies on the associated G protein to transmit and amplify the signal inside the cell [1-3]. Transcription of the TG gene is under control of the coordinated action of a master set of transcription factors that includes the NKX2.1, FOXE1 and PAX8, by their binding to the TG promoter on their respective consensus sequences [1-3].
TG is a large homodimeric secretory protein (660 kDa) with a high degree of glycosylation [1-4]. The human TG mRNA codes for a polypeptide chain of 2767 amino acids [63-65]. The monomeric TG preprotein has a leader peptide of 19 amino acids followed by 2748-amino-acid polypeptide (Figure 1b) [63-65]. Four hormonogenic acceptor tyrosine residues have been identified and localized at positions 5, 1291, 2554 and 2747 in human TG (Figure 1b) [1-4,72,73]. Each TG monomer contains 67 Tyr and 122 Cys residues, representing 2.44% and 4.44% of the total amino acids, respectively [63-65].
The internal protein organization makes TG an example of gene evolution by intragenic duplication events and gene fusions. The TG protein is composed of four structural and functional regions (Figure 1b). The N-terminal and the central part of the monomer includes three types of repetitive motifs, called TG type-1, TG type-2, and TG type-3, organized in three regions (I, II and III) (Figure 1b) [1-4,74,75], are Cys-rich repeat domains covalently bound by disulfide bonds. TG monomer contains eleven elements of type-1 repetitive motif located between positions 12 - 1191 and between 1492 - 1546; three elements of type-2 repetitive motif located between amino acids 1437 and 1484, and five elements of type-3 repetitive motif between residues 1584 and 2168 [1-4,74,75]. Interestingly, type-1 repeats could function as binder and reversible inhibitors of protease and have been found as parts of six architecturally distinct protein groups: testicans, secreted modular calcium binding protein (SMOCs), trops, splice variant of the major histocompatibility complex class II–associated invariant chain, insulin-like growth factor–binding protein (IGFBP) and nidogen . Each repeat TG type-1 is composed of approximately 60 amino acids, in which the positions of Cys, Pro and Gly residues are highly conserved. Some insertions of variable length are found in fixed positions. Region I comprises 10 of the 11 TG type-1 repeats, a linker, between repeat 1–4 and 1–5, and hinge segments (Figure 1b). Region II contains the 3 TG type-2 repeats and the 11th TG type-1 repeat (Figure 1b), whereas region III contains the five TG type-3 repeats (Figure 1b) [1-4,74,75]. The fourth region located in the carboxy-terminal, between residues 2192 to 2716, is a nonrepetitive domain that shows significant homology with the acetylcholinesterase (ACHE), named the ACHE-like or CheL domain (Figure 1b) [1-4,77-80]. ACHE-like domain is required for protein dimerization and consequently plays a critical structural and functional role in the TG protein that is essential for intracellular transport of TG to the site of its hormonogenesis . This region functions as an intramolecular chaperone and as a molecular escort for TG regions I, II, and III .
After translation, intensive postranslational processes take place in the ER, Golgi aparatus, apical membrane and folicular lumen which include homodimers assembly, glycosylation, sialylation, sulfation, phosphorylation, iodination, multimerization and the formation of intrachain disulfide bonds [1-4]. Several ER enzymes and molecular chaperones, such as calnexin (CNX), calreticulin (CRT), GRP94, BiP, Protein Disulfide Isomerase (PDI), ERp57, ERp29, and ERp72 interact, both concurrently and sequentially, with TG during its folding and assembly and may serve to prevent export of improperly folded TG proteins [1-4].
Molecular diagnosis of TG deficiency has been traditionally established by PCR-based approaches followed by systematic sequence analysis. Although these classical screenings are capable of detecting more than 90% of all point mutations, they do not detect large noncoding intragenic rearrangements or heterozygous deletions. The first-described human mutation causing a TG defect associated with CH was the mutation g.IVS3-3C>G . Subcloning and sequencing of the cDNA fragments revealed that exon 4 is missing from the major TG transcript in the goiter . Removal of exon 4 does not modify the reading frame of TG mRNA. However, exon 4 encodes tyrosine 130 which has been proposed as an important donor tyrosine involved in the synthesis of thyroxine, after coupling with the major acceptor tyrosine at position 5 . The loss of the tyrosine 130 provides a coherent explanation to the hypothyroid status of the patient . To date, one hundred seventeen deleterious mutations in the human TG gene have been identified and characterized: 19 splice site mutations, 23 nonsense mutations, 57 missense mutations, 13 deletions (9 single nucleotide deletions, 2 multiple nucleotide deletions and 2 involving a large number of nucleotides), 4 single nucleotide insertions or duplication and 1 imperfect DNA inversion (Table 1) [8-56]. The p.R277* [12,16,18,22,23,28,29,30,33,34,40,42,53], p.R1511* [9,16,23,32], p.A2215D [22,26,28,29,42], p.R2223H [14,29,31], p.R2317* [29,38,42,52] mutations are the most frequently identified TG mutations in Caucasian population, whereas c.274+2T>G [27,48,49], p.C1058R [17,20], p.C1245R [11,15,17,20,24,36] and p.C1977S [11,15,17,20] are the most common mutations in Asian population.
Exon skipping in the TG gene can be caused by nucleotide substitutions or deletion in acceptor or donor splice sites involving the -3/-2/-1 (c.275-3C>G, c.6563-2A>G, c.2762-1G>A, c.6200-1G>C, c.7998-1G>A) or +1/+2/+3/+4/+5/+6 position (c.638+1G>A, c.745+1G>A, c.4932+1G>C, c.5686+1G>T, c.5686+1G>A, c.5686+1G>C, c.6262+1delG, c.6876+1delG, c.274+2T>G, c.7036+2T>A, c.7862+2T>A, c.4159+3_+4delAT; c.3433+3_+6delGAGT, c.638+5G>A), respectively (Table 1) [8,10,13,16,19,20,26-28,30,36,39,40,44,46,48,49,53]. Recently, two exonic cryptic 5’splicing sites in exons 6 (c.745+1G>A)  and 19 (c.4159 + 3_+4delAT)  of the TG gene have been identified. The elimination of exons in the TG gene by aberrant splicing results in an altered ability to transfer an iodophenoxyl group from the donor site to the acceptor iodotyrosine.
The 23 inactivating mutations that generates truncated proteins have been localized in exons 4 (p.Y107*, p.R140*) 7 (p.R277*), 9 (p.R432*, p.S509*, p.Q611*, p.W618*, p.Q636*, p.Q692*), 10 (p.Q717*, p.Q752*, p.R768*, p.Q810*), 13 (p.C1032*) 20 (p.W1418*), 22 (p.R1511*), 27 (p.Q1765*, p.Q1777*), 31 (p.Y1903*) 37 (p.Q2142*), 40 (p.R2317*), 46 (p.Q2638*) and 47 p.R2688*) of the TG gene (Table 1) [9,12,16,18,20,22,23,27-29,30,32-34,36-38,40-42,45,47,49,51-53]. The p.R277X by p.R277* mutation is the most frequently reported mutation in the TG gene and affected individuals have either homozygous or compound heterozygous mutations. This mutation has been found in families from Brazil, Argentina, Galicia and France (Table 1) [12,16,18,22,23,28-30,33,34,40,42,53]. The functional consequences of p.R277X by p.R277* truncated protein are a complete loss of the central and carboxy-terminal hormonogenic domains and consequently, limited ability to generate thyroid hormone. However, p.R277X by p.R277* TG peptide retains its ability for T4 synthesis because it still harbors both the acceptor Tyr 5 and the donor Tyr 130. The p.R277X by p.R277* mutation was identified in members of unrelated families with history of CH from Brazil, Argentina and France (Table 1) [9,16,23,32]. The p.R277X by p.R277* mutation is removed from the transcripts by exon skipping using an alternative splicing . The elimination of mutated exon 22 in the pre-mRNA restores the reading frame allowing translation to reach the normal stop codon and results in an in-frame deletion of 57 amino acid residues . The c.886C>T and c.4588C>T nonsense mutations occur in a CpG dinucleotide sequence and could be caused by deamination of a methylated cytosine resulting in a thymine. The CGA arginine codon is considered a hot spot for mutations in mammalian DNA. Truncated protein can be also caused by nucleotide deletions (Table 1) [14,20,27,35,36,42,49,53,56] and insertions or duplication (Table 1) [22,33,36,39] in the TG gene. Recently, genetic analysis using an Inverse-PCR (I-PCR) -based approach in three brothers of Turkish origin born from consanguineus parents and affected by CH, goiter and low levels of serum TG, showed a DNA inversion of 16,962 bp in the TG gene associated with two deleted regions at both sides of the inversion limits . The inversion region includes the first 9 bp of exon 48, 1015 bp of intron 47, 191 bp of exon 47, 1523 bp of intron 46, 135 bp of exon 46 and the last 14,089 bp of intron 45 . The proximal deletion corresponds to 27 bp of TG intron 45, while the distal deletion spans the last 230 bp of TG exon 48 and the first 588 bp of intergenic region downstream TG end . The parents were heterozygous carriers of the complex rearrangement.
Sequencing analysis of the TG gene revealed sixteen missense mutations that involved wild-type Cys residue: p.C141S, p.C164Y, p.C175G, p.C707Y, p.C1058R, p.C1245R, p.C1262Y, p.C1474Y, p.C1491F, p.C1588F, p.C1878Y, p.C1885G, p.C1977S, p.C1981W, p.C1987Y and p.C2135Y (Table 1) [11,15,17,20-22,24,36,42,49,53]. The loss of Cys residues can eliminate disulfide bonds and alter the normal conformational structure of the TG, possibly preventing the interaction of hormonogenic acceptor and donor sites.
The first report of a missense mutation in the ACHE-homology domain of TG in humans was observed in a French family with two affected siblings with congenital goitrous hypothyroidism . A fetal goiter was diagnosed in both patients by ultrasound at the sixth month of gestation . Percutaneous umbilical vein blood sampling was carried out under ultrasound guidance showing severe fetal hypothyroidism. The sequencing analysis showed a new compound heterozygous mutations, the mutation p.R2223H located in the ACHE-homology domain is associated with a mutation at nucleotide position 1143 in exon 9 (p.362fsX382) . Later 8 new missense mutations were reported in the ACHE homology domain: p.A2215D, p.G2300D, p.R2317Q, p.G2355V, p.G2356R, p.L2528Q, p.R2566W and p.W2666L (Table 1) [17,20-22,24,26,28,29,42,49,50,53]. Functional analysis suggests that the p.A2215D mutation results in retention of the TG protein inside the ER and degradation via the proteasome system , as already observed in the cog/cog congenital goiter mouse and the WIC-rdw non-goitrous CH rat [81-83]. ER quality control system prevents misfolded TG protein export from ER to Golgi and consequently fails to be transported to the site of thyroid hormone synthesis . These gives rise to a distention of ER, abnormality called as ER-Storage Disease (ERSD) . Consequently, misfolded TG proteins are degraded by the ER-associated degradation (ERAD) pathway.
TG mutation have been described in Afrikander cattle , Dutch goats , cog/cog mouse , WIC-rdw rats [82,83] and Wistar Hannover GALAS rats .
The congenital goiter of Afrikander cattle is an autosomal recessive disease characterized by a TG synthesis defect . The inactive mutation is a c.2146C>T in exon 9 that generates a stop codon at amino acid position 697 (p.R697X by p.R697*) . The nonsense mutation is thus removed from the transcripts by exon skipping, and there is a preferential accumulation in the goiter of a TG mRNA lacking exon 9 . The original reading frame is maintained in the alternative spliced mRNA, which, as it lacks the mutated exon, is translatable into a potentially functional protein and preexisting as a minor mRNA species in normal animals .
An inbred Dutch goat strain with congenital hypothyroidism and goiter was extensively studied by de Vijlder et al . The hereditary TG synthesis defect in Dutch goats is caused by a c.945C>G that changes a triplet TAC coding for Tyr in exon 8 into a triplet TAG giving a stop codon (p.Y296X by p.Y296*) .
The cog/cog trait originally appeared as a spontaneous autosomal recessive phenotype in the inbred AKR/J strain of mouse . Congenital hypothyroidism with goiter was observed in the cog/cog mouse, suggesting a defect of the TG synthesis . Kim et al. identified a missense mutation, contained within the ACHE-like domain of the TG coding sequence, as the molecular basis for congenital hypothyroid goiter in the cog/cog mouse . Direct sequencing of the cog TG cDNA showed a c.6848T>C mutation), generating the p.L2263P mutation in the mature cog TG protein . Transient expression of the proteins indicated that cog TG exhibits a severe defect in the exit from the ER, whether the correction of this missense mutation restores the normal TG secretion .
The WIC-rdw rat is a hereditary hypothyroid variant derived from the Wistar-Imamichi strain [82,83]. In contrast to human patients and others animal models of congenital hypothyroidism, the WIC-rdw rat presents a hypoplastic thyroid gland, despite elevated circulating levels of TSH and reduced serum T3 and T4 [82,83]. The possibility that the mutant proteins may be cyto-toxic for thyroid growth and proliferation may be hypothesized . Additional experiments are clearly needed to determine the mechanism by which WIC-rdw rat presents a hypoplastic thyroid gland. The sequencing of the WIC-rdw rat TG cDNA revealed a c.6958G>C mutation [82,83]. The corresponding amino acid substitution in the ACHE-like domain of the WIC-rdw TG was p.G2300R [82,83]. As in the cog/cog mouse models, the WIC-rdw TG was retained inside the ER in cells. [82,83]. Experimental studies showed that when mutant TG was co-expressed with wild-type TG, the two proteins cross-dimerized, and secretion of WIC-rdw TG was partially restored.
Finally, Wistar Hannover Galas rat showed two distinct spontaneous abnormalities: goiter and dwarfism . The responsible mutation is a guanine to thymine transversion at the acceptor site of intron 6 of the TG gene (749-1G>T) that induces a complete missing of exon 7 from the TG transcript . Interestingly, homozygotes manifested both dwarfism and goiter, while heterozygotes had only a goiter, suggesting that the mutant phenotype is inherited as an autosomal semi-dominant trait .
Recent technological advances in instrumentation, computer hardware and software for next-generation sequencing (NGS) platforms  have led to the identification of new mutations in the TG gene [47-51,53]. The new technologies allow also the identification in the same patient with CH the coexistence of multiple mutations in different thyroid genes; for instances mutations in TG associated with mutations in DUOX2 [47,50,53] or TPO  or TSHR .
In the present paper, we discussed remarkable advances in the understanding of the pathophysiology of the CH associated with TG defects as well as in the identification of the mutations responsible for the disease. However, the impact of several mutations on the development of this disorder remains to be elucidated. The introduction of NGS approaches, characterized by a marked increase in the yield of DNA sequencing and the ability to analyze large populations will probably allow a change in the traditional understanding of the molecular and genetic bases of CH and the genotype-phenotype correlation. The identification of the coexistence of multiple mutations in the same gene or in different thyroid specific genes could contribute to the accurate diagnosis and classification of the defects. Moreover, the massive identification of mutations could be of greatly importance in the near future, since preimplantation of genetic diagnosis will be available for families in which the genetic defects responsible for the CH have been previously identified.
H.M. Targovnik and C.M. Rivolta are established investigators of the Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET).
C.E. Citterio is a posdoctoral research fellow of CONICET.
S. Siffo is research fellow of the Fondo para la Investigación Científica y Tecnológica (FONCyT-ANPCyT-MINCyT).
This study was supported by grants from the FONCyT-ANPCyT-MINCyT (PICT 2014-1193 to CMR, PICT 2012-1090 and PICT 2015-1811 to HMT), CONICET (PIP 2015-11220150100499 to CMR) and Universidad de Buenos Aires (UBACyT 2016-20020150100099BA to CMR).
Subscribe to our articles alerts and stay tuned.