Type I collagen is the most abundant structural protein in vertebrates. It is a heterotrimeric molecule composed of two α1 chains and one α2 chain, forming a long uninterrupted triple helical structure with short non-triple helical telopeptides at both the N- and C-termini. During biosynthesis, collagen acquires a number of post-translational modifications, including lysine modifications, that are critical to the structure and biological functions of this protein. Lysine modifications of collagen are highly complicated sequential processes catalysed by several groups of enzymes leading to the final step of biosynthesis, covalent intermolecular cross-linking. In the cell, specific lysine residues are hydroxylated to form hydroxylysine. Then specific hydroxylysine residues located in the helical domain of the molecule are glycosylated by the addition of galactose or glucose-galactose. Outside the cell, lysine and hydroxylysine residues in the N- and C-telopeptides can be oxidatively deaminated to produce reactive aldehydes that undergo a series of non-enzymatic condensation reactions to form covalent intra- and inter-molecular cross-links. Owing to the recent advances in molecular and cellular biology, and analytical technologies, the biological significance and molecular mechanisms of these modifications have been gradually elucidated. This chapter provides an overview on these enzymatic lysine modifications and subsequent cross-linking.
Collagens comprise a large family of triple helical proteins and are the most abundant protein in vertebrates, representing ∼30% of the total proteins. There are now at least 29 genetically distinct types of collagen identified that are encoded by at least 44 genes . Depending on the molecular structure and assembly mode, they can be divided into several subgroups . Fibril-forming collagens comprise the largest subgroup, including types I, II, III, V, XI, XXIV and XXVII. Of all types of collagens, fibrillar type I collagen is the most abundant type providing most tissues and organs with form, stability and connectivity. In addition to the structural functions, type I and other collagens also function as ligands for specific cell receptors, such as integrins, discoidin domain receptors, glycoprotein VI and the mannose receptor family etc., to control cellular activities and extracellular matrix remodelling . Studies have indicated that glycosylation of hydroxylysine residues of collagen plays a key role in the interaction with some of these receptors [4–6].
One of the critical factors for the structural and biomechanical functions of type I collagen fibrils are the PTMs (post-translational modifications) of peptidyl lysine residues. Lysine modifications of collagen are highly regulated sequential processes that take place inside and outside the cell. In the cell, specific peptidyl lysine residues both in the helical and non-helical (telopeptide) domains of the molecule (for the structure of type I collagen, see below) can be hydroxylated forming 5-hydroxylysine. Specific hydroxylysine residues in the helical domain can then be glycosylated with the addition of galactose, some of which can be further glycosylated with the addition of glucose. Specific enzymes catalyse each of these sequential and domain-specific lysine modifications in the cell. Outside the cell, an enzymatic oxidative deamination occurs to the telopeptidyl lysine and hydroxylysine residues producing the reactive aldehydic residues. The aldehydes produced then initiate a series of non-enzymatic condensation reactions to form extensive covalent intra- and inter-molecular cross-links which are critical for the biomechanical functions of the collagen fibrils . Since type I collagen is the most well investigated collagen for its lysine modifications and they are probably shared by other fibrillar collagens, this chapter focuses on those modifications of type I collagen. Owing to the limited space, non-enzymatic glycosylation, known as glycation, of collagen is not dealt with in this chapter (for a detailed review of this, see ).
The biosynthesis of collagen is a long complicated process involving a number of PTMs, chain association and folding, secretion, procollagen processing, self-assembly and progressive cross-linking (Figure 1). Type I collagen is a long (∼300 nm long, ∼1.5 nm thick) heterotrimeric molecule composed of two α1 chains and one α2 chain. An α1 homotrimeric form exists as a minor form. The molecule consists of three domains: the N-terminal non-triple helical domain (N-telopeptide), the central triple helical domain and the C-terminal non-triple helical domain (C-telopeptide). The single (uninterrupted) triple helical domain represents more than 95% of the molecule (for the molecular structure, see Figure 2).
After synthesis in the ER (endoplasmic reticulum), three proα chains (two proα1 and one proα2) associate with one another near the C-terminus of the chains, and the triple helix is propagated towards the N-terminus in a zipper-like fashion to form a procollagen molecule. Several specific enzymes and chaperones are involved in chain association, stabilization and correct folding [2,9]. Before the formation of a triple helix procollagen molecule, however, the nascent proα chains acquire several PTMs. They include the hydroxylation of specific proline residues, which are catalysed by prolyl-4-hydroxylases and prolyl-3-hydroxylase. The former reacts on proline with the minimum sequence X-Pro-Gly and the latter appears to require a Pro-4-Hyp-Gly (Hyp is hydroxyproline) sequence. The proline hydroxylation is critical for the stabilization of the triple helix conformation. The hydroxylation of specific lysine residues and glycosylation of hydroxylysine residues also occur enzymatically (see below) before the formation of a triple helical procollagen molecule. The procollagen molecules are then transported from the ER to the Golgi stacks, and are secreted to the extracellular space. The N- and C-propeptides of procollagen are then cleaved off by a group of metalloproteinases belonging to the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) and BMP1 (bone morphogenetic protein 1)/Tolloid-like proteinases respectively, releasing a mature type I collagen molecule.
After both N- and C-propeptides are cleaved, the collagen molecules are then spontaneously packed in parallel and longitudinally staggered with respect to one another by some multiple of axial repeat distance, called a D period (∼67 nm). This packing arrangement creates repeated zones of high and low packing density, i.e. overlap and hole zones respectively, showing a characteristic banding pattern of a collagen fibril seen at the ultrastructural level (Figure 1). The molecules in the fibril are then stabilized by extensive covalent intermolecular cross-linking, which is critical for the formation of mechanically functional collagen fibrils. For the details of fibrillar collagen biosynthesis, see .
Intracellular lysine modifications of collagen
Hydroxylation of lysine
In human type I collagen, there are 38 lysine residues in an α1 chain (36 in the helical domain, one in the C-telopeptide domain and one in the N-telopeptide domain) and 31 in an α2 chain (30 in the helical domain, one in the N-telopeptide domain, and none in the C-telopeptide domain) (UniProt numbers P02452, P02464 and P08123) (Figure 3A). The numbers and sites of lysine residues in the α1 chain are conserved among different species, e.g. human, mouse, rat and bovine (UniProt numbers P02452, P11087, P02454 and P02453 respectively); however, there are slight differences in those in the α2 chain. There is one extra lysine residue in the helical region in mouse and rat (total lysine residues 32, 31 in the helical domain and one in N-telopeptide domain) and there are three sites that vary among these four species (UniProt numbers P08123, Q01149, P02466 and P02465 respectively). The lysine residues in both α1 and α2 chains that are involved in the formation of covalent cross-linking (telopeptide and helical domains, see below) are all conserved among species. Biochemical analysis has indicated that the extent of lysine hydroxylation of collagen is highly variable in comparison with proline hydroxylation. Approximately 50% of proline residues in collagen are hydroxylated in various genetic types of collagen in different tissues, and it does not significantly change under physiological conditions. The extent of lysine hydroxylation, however, can vary from 15 to 90% depending on the collagen types  and, even within type I collagen, it varies significantly from tissue to tissue and under the physiological/pathological condition of the tissue . Furthermore, a difference in the extent of lysine hydroxylation exists between the helical and telopeptide domains of a type I collagen molecule [11–14]. These observations led investigators to speculate that there is more than one mechanism for lysine hydroxylation of collagen [12,15].
LH (lysyl hydroxylase; EC22.214.171.124) catalyses the formation of hydroxylysine in -X-Lys-Gly- sequences in a reaction that requires Fe2+, 2-oxoglutarate, O2 and ascorbate . In addition to the -X-Lys-Gly- sequence, -X-Lys-Ala- and -X-Lys-Ser- sequences present in the telopeptides (both N- and C-termini) of fibrillar collagen can be hydroxylated. However, for a long time, it was not clear whether one LH hydroxylates lysine residues in both helical and telopeptide domains of the molecule. In the late 1990s, three genes encoding isoforms of LH (procollagen-lysine, 2-oxoglutarate and 5-dioxygenase, PLOD1–PLOD3/LH1–3) were identified and partially characterized [17,18]. The substrate specificities of these isoforms in vivo are still not clearly established; however, evidence indicates that LH1 primarily hydroxylates lysine residues in the helical domains of fibrillar/non-fibrillar collagens . For LH2, two alternatively spliced isoforms were identified, termed LH2a or LH2(short) and LH2b or LH2(long) respectively. The latter includes an additional 21 amino acids encoded by the 63 nucleotide exon 13A and the former does not. The LH2b appears to be ubiquitously expressed and is the major form of LH2 in most tissues . Our initial observation that LH2 gene expression was closely associated with lysine hydroxylation in the telopeptide domains of a type I collagen molecule led us to propose LH2 (LH2b) as a telopeptidyl LH . Several subsequent reports confirmed that LH2b is indeed the telopeptidyl LH and that forced overexpression of the LH2b gene in cells results in an increase in the Hylald (δ-hydroxy, α-aminoadipic acid-δ-semialdehyde)-derived collagen cross-links [20–23] (for collagen cross-linking, see below). LH3 is a multifunctional enzyme possessing LH, GT (hydroxylysyl galactosyltransferase) and GGT (galactosylhydroxylysyl glucosyltransferase) activities . However, its LH activity may be significant for collagen types IV and V, but not type I collagen .
The biological significance of lysine hydroxylation/subsequent modifications cannot be overemphasized. In humans, mutations in the gene encoding LH1 cause Ehlers–Danlos syndrome type VIA and the clinically similar Nevo syndrome, characterized by hypermobile joints, hyperextensible skin and kyphoscoliosis etc. [26,27]. Mutations in LH2 are known to cause Bruck syndrome type 2, a rare autosomal-recessive disorder characterized by bone fragility and congenital joint contractures [21,28]. Most recently, mutations in the LH3 gene and reduced LH3 protein have also been shown to cause several congenital connective tissue malformations and epidermolysis bullosa simplex [29,30]. In an osteoblastic cell-culture system, altered levels of LH2b cause defective cross-linking patterns, collagen fibrillogenesis and matrix mineralization , which also indicates the critical importance of telopeptidyl lysine hydroxylation of collagen in determining bone quality .
Glycosylation of hydroxylysine
In collagens and collagen-like proteins, some of the hydroxylysine residues in the helical domain are further modified by the sequential steps of O-linked glycosylation producing G-Hyl (galactosylhydroxylysine) and GG-Hyl (glucosylgalactosylhydroxylysine). These reactions are catalysed by GT (EC 126.96.36.199) and GGT (EC 188.8.131.52) respectively (Figure 3B) [10,33]. Structural analysis of GG-Hyl shows that galactose is attached to the hydroxy group of hydroxylysine by a β-glycosidic bond, whereas glucose is linked by an α-glycosidic bond to C-2 of the galactose unit forming the complex structure of 2-O-α-D-glucopyranosyl-O-β-D-galactopyranosylhydroxylysine (Figure 3B). Both reactions require the presence of a free ε-amino group of the hydroxylysine residue, UDP-galactose or UDP-glucose as carbohydrate donors, and the most preferred bivalent cation cofactor Mn2+ . The complete triple helical structure of a collagen molecule inhibits further glycosylation, indicating that the reactions take place on single α chains prior to triple helix formation of procollagen ( and reviewed in ). The enzymatic activities of GT and GGT have previously been extensively studied and characterized from various tissue extracts (reviewed in ), but it was only approximately a decade ago, that it was found that they were modulated by the multifunctional enzyme LH3 [35,36]. By analysing LH3-deficient mice it was shown that impairment in type IV and VI collagen biosynthesis and secretion resulted in embryonic lethal phenotypes, in which the severity of the phenotype showed a significant correlation only to the level of GGT activity, not LH or GT activity [25,37,38]. It has also been reported that decreased GGT activity from mutations discovered in the human LH3 gene results in congenital disorders affecting connective tissues of various organs [29,30]. For bone type I collagen, most recently, we have shown that the major function of LH3 is GGT activity, but not LH or GT activity . This is consistent with previous studies demonstrating that the GT activity of LH3, when compared with GGT, was significantly lower [35,36] or undetectable [37,39]. As for the collagen GT enzyme, a novel family of collagen galactosyltransferase enzymes, GLT25D1 and GLT25D2 (glycosyltransferase 25 domain 1 and glycosyltransferase 25 domain 2) which are encoded by the GLT25D1 and GLT25D2 genes, has been identified. Their tissue distribution and enzymatic activity, as GT, were reported, but the substrate specificity of each isoform is not clearly defined. Of the two isoforms, GLT25D1 exhibited broad expression in several fetal and adult human tissues, whereas GLT25D2 expression was detected only in a few cell types, suggesting the former to be the main GT enzyme . We have also shown that Glt25d1 is the major isoform expressed in the mouse osteoblast cell line . Taken together, these recent studies indicate that, in type I collagen, galactosylation of hydroxylysine is catalysed by GLT25D1 and subsequent glucosylation by LH3 (Figure 3).
The type (mono- or di-) and extent of collagen glycosylation varies among different collagen types, showing a higher abundance in the less-organized collagen type IV of the basement membrane and lower levels in the highly organized fibrillar collagens, types I, II or III [10,33]. Furthermore, even those in type I collagen vary depending on the functional regions within the tissue , maturation [41,42] and pathological conditions [29,30,43–46]. In bone, for instance, alterations in the levels of collagen glycosylation have been reported in several bone/skeletal disorders, such as osteogenesis imperfecta [44, 47–49], post-menopausal osteoporosis [43,50,51], osteosarcoma, osteofibrous dysplasia  and Kashin–Beck disease , suggesting a role of collagen glycosylation in bone mineralization. Considering the bulky structure of the sugars and the glycosylation sites, structural as well as biological functions have been proposed for collagen glycosylation, i.e. control of collagen fibrillogenesis [34,53–57], cross-linking [58–62], remodelling , collagen–cell interaction [4,6] and induction of vessel-like structures . Despite all of the findings with regard to collagen glycosylation reported to date, the specific biological functions of the glycosylated hydroxylysine residues in collagen, in relation to the extent and type of glycosylation at their molecular loci, are still not clearly defined.
Extracellular lysine modifications
Oxidative deamination of lysine and hydroxylysine by LOX (lysyl oxidase)
After procollagen molecules were secreted into the extracellular space and both N- and C-propeptide extensions are cleaved off, collagen molecules then self-assemble into fibrils. Recent studies have suggested that procollagen processing and fibril assembly may begin within the confines of the cell membrane . These cleavages, especially that of C-propeptide by BMP1/Tolloid-like proteinases, is necessary for the initiation of fibrillogenesis . During fibrillogenesis, specific lysine and hydroxylysine residues in the N- and C-telopeptides are oxidatively deaminated by LOX to form the respective aldehydic forms, Lysald (α-aminoadipic acid-δ-semialdehyde) and Hylald (Figure 4), which initiates a series of condensation reactions to form covalent intra- and inter-molecular cross-links (see below). The reaction catalysed by LOX is RCH2NH2 + H2O + O2 → RCH = O + NH3 + H2O2, where the amine group is derived from telopeptidyl lysine and hydroxylysine. In type I collagen, there are two residues of lysine or hydroxylysine in the C-telopeptides (both from α1 chains, i.e. there is no lysine residue in the α2 C-telopeptide) and three in the N-telopeptides (a total of five residues/molecule) that can be oxidized by LOX (Figure 4). LOX contains two cofactors essential to its catalytic function. This enzyme contains one tightly bound copper ion (Cu2+) and a unique covalently integrated organic cofactor, LTQ (lysine tyrosylquinone). LTQ is probably the product of an autocatalytic reaction involving Cu2+-mediated/assisted oxidation of a specific peptidyl tyrosine residue (residue 345 in rat LOX) followed by covalent cross-linking with the ε-amino group of a conserved peptidyl lysine residue (residue 314 in rat LOX) . It is interesting that this cross-linking enzyme requires its own covalent intramolecular cross-link, LTQ, to function. Apparently LOX cannot catalyse this reaction on lysine and hydroxylysine residues located in the rigid helical domain of the molecule.
LOX is a copper-dependent amine oxidase (EC 184.108.40.206, protein-lysine 6-oxidases) and it is synthesized as a catalytically quiescent N-glycosylated∼50 kDa proLOX  by various cell types. ProLOX is processed extracellularly by BMP1/Tolloid-like proteinases, the same proteinases that cleave the C-propeptide of type I procollagen, to produce an active mature ∼30 kDa LOX. Recently, an additional processing of proLOX has been reported, resulting in a truncated form of active LOX, although its biological function is not clear at present . The N-terminal propeptide of LOX has been implicated in regulating the localization of the enzyme, suppressing tumours and inhibiting cell proliferation [69–71]. It has been reported that type I collagen molecules need to be pre-assembled into quarter-staggered fibrillar arrays prior to oxidation of the telopeptidyl lysine and hydroxylysine by LOX [66,72]. Thus it is interesting to note that the same proteinases produce an active LOX enzyme and, at the same time, prepare for its collagen substrate. They also process other types of collagen, a number of extracellular matrix proteins and some of the growth factors, demonstrating their broad and key role in regulating and orchestrating the formation of the extracellular matrix .
The critical importance of LOX-mediated cross-linking in tissue stability is well exemplified by ‘lathyrism’. This is the condition where deleterious effects in connective tissues are caused by lathyrogens such as BAPN (β-aminopropionitrile) , a potent and irreversible inhibitor of LOX, thus preventing collagen and elastin cross-linking. It has been proposed that the inhibition occurs by the formation of a ‘dead-end’ complex between BAPN and the carbonyl cofactor of LOX, LTQ (Figure 4) . In addition to this classic example of specific inhibition of a biochemical pathway, gene-deletion studies have also provided significant insights into the critical role of LOX in connective tissue formation and maintenance. Mice lacking the lox gene die at the end of gestation or as neonates exhibiting the rupture of the aorta and diaphragm, and other connective tissues defects [76,77]. However, collagen cross-links, although diminished, were still formed in these mice, indicating some compensatory mechanisms by other LOX family members.
During the last two decades, four LOXL (LOX-like) proteins (LOXL1–4) have been identified, thus establishing a gene family of five members . Structurally, LOX and LOXL1 form one subfamily, and LOXL2–4 another family. Although the N-terminal regions of the LOX family members are highly variable, all possess high similarity in the C-terminal catalytic domain, i.e. a putative Cu2+-binding site, a CRL (cytokine receptor-like) domain and amino acid residues forming the LTQ site . Previous studies have shown that LOX family members possess a wide range of biological functions other than collagen/elastin cross-linking, and they include tumour suppression or metastases promotion, control of growth-factor activity, actin filament formation and chemotaxis for various cell types etc. [80–85]. Although the substrate specificity of each LOXL isoform is not clear at present, studies have indicated their distinct functions. The direct involvement of these LOXL proteins in collagen cross-linking is also still not clearly demonstrated.
The enzymes that catalyse the reactions of lysine modifications of collagen described above, their characteristics and related human diseases/disorders are summarized in Table 1.
Covalent intra- and inter-molecular cross-linking
The final step of type I collagen biosynthesis is the formation of covalent cross-links, which is critical for the biomechanical functions of collagen. In this process, again, lysine modifications play a crucial role.
The telopeptidyl aldehydes generated by LOX react with another aldehyde or with the ε-amine groups derived from the unmodified lysine and hydroxylysine residues to form the initial intra- and inter-molecular cross-links. They can then further mature into multivalent cross-links by involving additional amino acid residues including peptidyl histidine. The presence of such complex intermolecular cross-links demonstrates the highly specific molecular packing arrangement in the fibril, as the amino acid residues involved in cross-linking are relatively sparse (especially histidine) in collagen and they need to be in the correct proximity. Depending on the combination of amino acid residues, several cross-linking pathways have evolved (for details of LOX-mediated and non-enzymatic cross-linking in various connective tissues, see [8,32,86,87]). Figure 5 illustrates cross-linking pathways that are derived from Lysald or Hylald.
One of the most well-investigated pathways is the one leading to a Pyr (pyridinoline) cross-link, which was first identified by Fujimoto . It starts with the Hylald residue in the telopeptides, i.e. the 16th residue in the α1 C-telopeptide (α1-16C), and the 9th and 5th residue in the α1 and α2 N-telopeptide (α1-9N and α2-5N) (Figure 5). Then the Hylald reacts with the ε-amino group of the helical hydroxylysine residue on a neighbouring molecule producing an intermolecular aldimine (Schiff base) cross-link, deH-DHLNL (dehydro-dihydroxylysinonorleucine). This can be formed between 16C-Hylald (α1) and the juxtaposed helical hydroxylysine (Hyl87) of α1 and α2 chains, and between the 9N (α1)- or 5N (α2)-Hylald and the juxtaposed helical hydroxylysine (α1-930 or α2-933) (Figure 5). The aldimine can spontaneously rearrange to form a ketoamine form by Amadori rearrangement. However, some may remain as an aldimine form in situ. Both forms are reducible with reducing agents, such as sodium borohydride, under mild conditions. Then, two residues of deH-DHLNL/ketoamine can condense with each other forming a tri-functional cross-link, Pyr, by releasing a helical hydroxylysine residue. Thus Pyr is composed of two residues of telopeptidyl Hylald and one helical hydroxylysine residue. Although several mechanisms were proposed for Pyr formation [87,89–91] and the details are still not elucidated, Pyr is clearly a maturational product of deH-DHLNL tying together three  or two collagen molecules . In mineralized tissues, the major locus of deH-DHLNL and Pyr involves the 16C-Hylald (α1) and 87-Hyl (α1) [87,92–94]. Apparently not all deH-DHLNL mature into Pyr as the rate of the decrease of deH-DHLNL/ketoamine and an increase of Pyr is often disproportional in vivo as well as during an in vitro incubation (, and M. Sricholpech and M. Yamauchi, unpublished work). The fate of all deH-DHLNL is not fully understood but, recently, an alternative mechanism to Pyr formation through its further oxidation and free arginine addition has been suggested .
A minor Pyr analogue form, d-Pyr (deoxypyridinoline) , is formed when a helical lysine residue (instead of hydroxylysine) is involved, thus the analogue form is composed of two telopeptidyl Hylald and one helical lysine residue. This analogue is relatively abundant in mineralized tissues.
Prl (pyrrole) cross-links, another type of trivalent cross-link found in skeletal tissues, are probably formed by a similar condensation reaction composed of telopeptidyl Hylald, telopeptidyl Lysald and helical hydroxylysine or lysine [59,97]. However, due to its inherent lability, its quantitative analysis and characterization is limited.
Although Pyr is present in most skeletal and connective tissues, such as cartilage, bone, dentin, tendon, aorta and ligament, it is almost absent from skin and cornea . Interestingly, these two connective tissues cover the external surface of the human body and are subject to UV light, to which Pyr is susceptible . Nature clearly chose a different cross-linking pathway in these tissues, producing a more UV-resistant stable tri-functional cross-link, HHL (histidinohydroxylysininorleucine) [99,100].
The Lysald derived from the C-telopeptide in skin and cornea collagen reacts with the ε-amine group of the helical Hyl87 on a neighbouring molecule to form an aldimine intermolecular cross-link, deH-HLNL (dehydro-hydroxylysinonorleucine). deH-HLNL can then form a stable non-reducible cross-link, HHL, by involving the helical residue His92 on an α2 chain [99,101]. Quantitative analysis demonstrated that this cross-link rapidly increased from birth through maturational stage, with a gradual, but steady, increase with aging in both cow and human skin collagen . The structure and experimental evidence indicate that the imidazole C-2 of histidine is linked to C-6 of norleucine (ε-deaminated lysine residue) which in turn is linked to the C-6 of the amino group of hydroxylysine . Thus two of the three amino acid residues, i.e. hydroxylysine and histidine, in HHL are derived from the helical domain of the molecule. HHL, like Pyr, is stable in acid hydrolysis but, in contrast with Pyr, is stable against UV light , and thus suitable for skin and cornea.
Another major cross-link in soft connective tissues is derived from an ACP (aldol condensation product; α,β-unsaturated aldol) formed between two residues of Lysald, which occurs as an intramolecular cross-link located in the N-telopeptides [104,105]. Michael addition of N3 imidazole of histidine to the ACP produces deH-AH (dehydro-aldolhistidine) which can further condense with the ε-amino group of hydroxylysine forming an aldimine bond to produce deH-HHMD (dehydro-histidinohydroxymerodesmosine)  (Figure 5), one of the major cross-links in soft connective tissue collagen [62,106]. This cross-link, which involves four amino acids, has the most complex chemical structure in all of the known collagen cross-links.
An aldimine cross-link, dehydro-lysinonorleucine, can also occur when Lysald condenses with helical lysine (Figure 5). However, this cross-link is minor in collagen.
The LOX-mediated collagen cross-linking is controlled at three levels, i.e. quantity, quality and maturation. (i) Quantity of cross-links: since all of the cross-links described are initiated by the LOX-catalysed aldehyde formation in the telopeptides, the levels of aldehyde primarily determine the quantity of cross-links formed. Thus the expression and activity of LOX (possibly LOXLs) are key to controlling the quantity of cross-links. (ii) Quality of cross-links: for quality, i.e. type of cross-links, the extent of lysine hydroxylation in the N- or C-telopeptides and those in the juxtaposed helical domain of a neighbouring molecule is the critical determinant. Thus the levels and activities of LH isoforms are key to control the quality of cross-links. (iii) Maturation of the cross-links: as described above, the formation of the initial aldimine cross-links or ACP and their maturation are spontaneous, and thus not controlled by specific enzymes. It is likely that the microenvironment around the cross-linking regions controls these processes. One such potential factor is the state (extent and type) of glycosylation of hydroxylysine involved in cross-linking due to their bulky structures and hydrophilic nature.
Conclusions and perspectives
Lysine modifications of collagen have been investigated for the last five decades, and their biological significance, molecular mechanisms and detailed chemistries have been gradually elucidated. The modifications are tissue-specific and probably associated with their functions. Identification of isoforms of LH, LOX and GLT25D has provided significant new insights into the tissue-specific control of such modifications. However, there are still a number of unanswered questions including: (i) specificity and redundancy of these isoforms, (ii) regulatory mechanisms of gene expression and enzymatic activity of these isoforms in different cells and tissues, (iii) function(s) of hydroxylysine glycosylation, and (iv) controlling factor(s) of cross-link maturation. Some of these issues could be addressed by studies employing gain- and loss-of-function approaches (e.g. single- and double-gene knockdown, and overexpression) in well-defined cell-culture systems, and tissue- and developmental stage-specific conditional gene-deletion experiments in vivo.
• Lysine modifications of fibrillar type I collagen occur both intra- and extra-cellularly in a tissue-specific manner.
• Inside cells, specific lysine residues in the helical domain are hydroxylated primarily by LH1, and those in the telopeptide domains probably by LH2(b).
• Specific hydroxylysine residues in the helical domain can be sequentially glycosylated by the additions of galactose and then glucose. These reactions may be catalysed by GLT25D1 and LH3 respectively.
• Outside the cells, lysine and hydroxylysine residues in the telopeptides can be oxidatively deaminated by LOX, and possibly by its family members, generating the respective aldehydes, i.e. Lysald and Hylald.
• The aldehydes then condense with the juxtaposed lysine, hydroxylysine, Lysald, Hylald and histidine to spontaneously form various covalent cross-links, which are tissue-specific.
Work in the authors laboratory is supported by the National Institute of Health (NIH) [grant numbers R21 DE019569, DE020909 and AR060978].
- © The Authors Journal compilation © 2012 Biochemical Society