Determination of gene organization in the human IGHV region on single chromosomes (original) (raw)
Related papers
Immunogenetics, 2005
More than 100 variable (V), 27 diversity (D), and six joining (J) genes are encoded in the human heavy chain locus, and many of these genes exists in different allelic forms. The number of genes and the allelic differences help to create diversity in the immunoglobulin receptors, a key feature of the adaptive immune system. We here report the identification of two novel and seemingly functional alleles of human heavy chain genes. The variable IGHV3-23*04 allele is found with an allele frequency of 0.21 amongst Danish Caucasians, whereas the novel joining IGHJ6*04 allele is rare (allele frequency 0.02). We also report the full sequence of IGHV3-h. The gene exists in two allelic forms but is only found in 58% of the Danish Caucasians studied. The methionine translation initiation codon is mutated, ATG→AAG, and we therefore propose that the gene is a pseudogene incapable of being translated.
The Journal of Immunology, 2010
Individual variation in the Ig germline gene repertoire leads to individual differences in the combinatorial diversity of the Ab repertoire, but the study of such variation has been problematic. The application of high-throughput DNA sequencing to the study of rearranged Ig genes now makes this possible. The sequencing of thousands of VDJ rearrangements from an individual, either from genomic DNA or expressed mRNA, should allow their germline IGHV, IGHD, and IGHJ repertoires to be inferred. In addition, where previously mere glimpses of diversity could be gained from sequencing studies, new large data sets should allow the rearrangement frequency of different genes and alleles to be seen with clarity. We analyzed the DNA of 108,210 human IgH chain rearrangements from 12 individuals and determined their individual IGH genotypes. The number of reportedly functional IGHV genes and allelic variants ranged from 45 to 60, principally because of variable levels of gene heterozygosity, and included 14 previously unreported IGHV polymorphisms. New polymorphisms of the IGHD3-16 and IGHJ6 genes were also seen. At heterozygous loci, remarkably different rearrangement frequencies were seen for the various IGHV alleles, and these frequencies were consistent between individuals. The specific alleles that make up an individual's Ig genotype may therefore be critical in shaping the combinatorial repertoire. The extent of genotypic variation between individuals is highlighted by an individual with aplastic anemia who appears to lack six contiguous IGHD genes on both chromosomes. These deletions significantly alter the potential expressed IGH repertoire, and possibly immune function, in this individual.
BMC Genomics, 2011
Background Segmental duplication and deletion were implicated for a region containing the human immunoglobulin heavy chain variable (IGHV) gene segments, 1.9III/hv3005 (possible allelic variants of IGHV3-30) and hv3019b9 (a possible allelic variant of IGHV3-33). However, very little is known about the ranges of the duplication and the polymorphic region. This is mainly because of the difficulty associated with distinguishing between allelic and paralogous sequences in the IGHV region containing extensive repetitive sequences. Inability to separate the two parental haploid genomes in the subjects is another serious barrier. To address these issues, unique DNA sequence tags evenly distributed within and flanking the duplicated region implicated by the previous studies were selected. The selected tags in single sperm from six unrelated healthy donors were amplified by multiplex PCR followed by microarray detection. In this way, individual haplotypes of different parental origins in the sperm donors could be analyzed separately and precisely. The identified polymorphic region was further analyzed at the nucleotide sequence level using sequences from the three human genomic sequence assemblies in the database. Results A large polymorphic region was identified using the selected sequence tags. Four of the 12 haplotypes were shown to contain consecutively undetectable tags spanning in a variable range. Detailed analysis of sequences from the genomic sequence assemblies revealed two large duplicate sequence blocks of 24,696 bp and 24,387 bp, respectively, and an incomplete copy of 961 bp in this region. It contains up to 13 IGHV gene segments depending on haplotypes. A polymorphic region was found to be located within the duplicated blocks. The variants of this polymorphism unusually diverged at the nucleotide sequence level and in IGHV gene segment number, composition and organization, indicating a limited selection pressure in general. However, the divergence level within the gene segments is significantly different from that in the intergenic regions indicating that these regions may have been subject to different selection pressures and that the IGHV gene segments in this region are functionally important. Conclusions Non-reciprocal genetic rearrangements associated with large duplicate sequence blocks could substantially contribute to the IGHV region diversity. Since the resulting polymorphisms may affect the number, composition and organization of the gene segments in this region, it may have significant impact on the function of the IGHV gene segment repertoire, antibody diversity, and therefore, the immune system. Because one of the gene segments, 3-30 (1.9III), is associated with autoimmune diseases, it could be of diagnostic significance to learn about the variants in the haplotypes by using the multiplex haplotype analysis system used in the present study with DNA sequence tags specific for the variants of all gene segments in this region.
Immunogenetics, 2001
The immunoglobulin (Ig) specificity repertoire in most vertebrates is established during early B-cell development by rearrangement of variable genes located at the Ig heavy-chain (IGH) and light-chain (IGL) loci. The variable domain of the IgH chain is encoded by a variable gene (V), a diversity gene (D), and a joining gene (J), whereas the IgL chain variable region is encoded by a V and J gene. A translocon IGH gene organization (V n -D n -J n -C n ), as in humans and rodents, generally consists of many V genes (>100), a few to a dozen D genes, and a small number of J genes ).
Immunogenetics, 2011
Complete and accurate knowledge of the genes and allelic variants of the human immunoglobulin gene loci is critical for studies of B cell repertoire development and somatic point mutation, but evidence from studies of VDJ rearrangements suggests that our knowledge of the available immunoglobulin gene repertoire is far from complete. The reported repertoire has changed little over the last 15 years. This is, in part, a consequence of the inefficiencies involved in searching for new members of large, multigenic gene families by cloning and sequencing. The advent of high-throughput sequencing provides a new avenue by which the germline repertoire can be explored. In this report, we describe pyrosequencing studies of the heavy chain IGHV1, IGHV3 and IGHV4 gene subgroups in ten Papua New Guineans. Thousands of 454 reads aligned with complete identity to 51 previously reported functional IGHV genes and allelic variants. A new gene, IGHV3-NL1*01, was identified, which differs from the nearest previously reported gene by 15 nucleotides. Sixteen new IGHV alleles were also identified, 15 of which varied from previously reported functional IGHV genes by between one and four nucleotides, while one sequence appears to be a functional variant of the pseudogene IGHV3-25. BLAST searches suggest that at least six of these new genes are carried within the relatively well-studied populations of North America, Europe or Asia. This study substantially expands the known immunoglobulin gene repertoire and demonstrates that genetic variation of immunoglobulin genes can now be efficiently explored in different human populations using high-throughput pyrosequencing.
Definition of the human immunoglobulin variable lambda (IGLV) gene subgroups
European Journal of Immunology, 1990
MolBculaire, URA CNRS 1191, Universitk Montpellier II, Montpellier Definition of the human immunoglobulin variable lambda (IGLV) gene subgroups* Comparison of 60 human immunoglobulin variable lambda (IGLV) sequences allowed us to define seven subgroups designated VJ to VhVII. We demonstrate that all h proteins sequenced so far fall into the subgroups I, 11, I11 and VI, and that the h regions previously assigned to subgroups IV and V belong, in fact, to subgroups 111 and 11, respectively. Four sequences not belonging to any of the subgroups I, 11, I11 and VI define the new subgroups IV, Vand VII. Interestingly, these subgroups show a higher homology to rabbit or mouseVh genes than to the other human Vh subgroups. By comparison of the proteins either with the sequences deduced from the germ-line genes or with the consensus sequences, the rate of amino acid changes due to somatic mutations or allelic variations was evaluated in several h proteins. Framework and complementarity-determining regions of the human IGLVgenes and proteins were delineated. Alignment of the h sequences shows that functional V-J rearrangement occurs, with or without deletion of nucleotides encoding one or two amino acids at the 3' end of the V gene. Diversity of the third complementarity-determining region is due to somatic mutations and to flexible V-J junction, but there is no evidence of N-diversity in the human h locus.
European Journal of Immunology, 1989
The human immunoglobulin heavy chain constant gene locus (IGHC) is polymorphic at both the protein (Gm and A2m allotypes) and the DNA level [RFLP for the gamma genes (IGHG), the switch p region (IGHSM) and the switch cx regions (IGHSA)]. The polymorphisms have been a valuable tool for assessment of the IGHC locus organization and a variety of population genetics and immunological investigations. In this study three new probes, identifying regions related to the IGHG (IGHPG and IGHSG) or IGHA (IGHAT) genes, have been employed to describe 11 different loci, 6 of which were polymorphic. Most of the polymorphisms are probably due to short insertions/deletions, particularly the SG regions, due to their repetitive structure. Ten loci were assigned to the IGHC region on the basis of known restriction maps, deletion mapping and association with mapped RFLP; the l l t h , despite a striking sequence similarity with the IGHPG regions, could not be assigned to any known IGHC subregion. Analysis of these and previously known IGHG RFLP in a sample of 65 unrelated subjects plus 15 families allowed us to draw a genetic map, with particularly high resolution in the GP-G2-G4 genes region, revealing a marked discontinuity in the linkage disequilibrium values between pairs of adjacent loci.
Definition of the RFLP alleles in the human immunoglobulin IGHG gene locus
European Journal of Immunology, 1988
In order to define more precisely the polymorphism of the human immunoglobulin IGHG (Cy) genes and, consequently, to understand the structure and evolution of this multigene family, we have investigated the Restriction Fragment Length Polymorphisms (RFLP) of the IGHG genes in 113 unrelated individuals and 18 families. Using the restriction enzymes BamHI, Sac1 and EcoRI, and hybridization to a IGHG probe and to a specific IGHG3 (Cy3) probe, we describe 47 different restriction fragments (KF) in the IGHG locus, allowing us to define at least 15 RFLP alleles for the different IGHG genes. Our data demonstrate that the restriction fragment length polymorphism is occasionally due to the presence or absence of a given restriction site as a consequence of a point mutation, such as the creation of an Eco RI site, at the codon 3 of exon 2 in one IGHG4 allele (IGHG4*D2). However, most of the RFLP we observed seem to result from the insertion or deletion of a few hundred base pairs as illustrated by the IGHG3 polymorphism.