Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 24;17(1):255.
doi: 10.1186/s12920-024-02024-0.

Identification of allele-specific KIV-2 repeats and impact on Lp(a) measurements for cardiovascular disease risk

Affiliations

Identification of allele-specific KIV-2 repeats and impact on Lp(a) measurements for cardiovascular disease risk

Sairam Behera et al. BMC Med Genomics. .

Abstract

The abundance of Lp(a) protein holds significant implications for the risk of cardiovascular disease (CVD), which is directly impacted by the copy number (CN) of KIV-2, a 5.5 kbp sub-region. KIV-2 is highly polymorphic in the population and accurate analysis is challenging. In this study, we present the DRAGEN KIV-2 CN caller, which utilizes short reads. Data across 166 WGS show that the caller has high accuracy, compared to optical mapping and can further phase approximately 50% of the samples. We compared KIV-2 CN numbers to 24 previously postulated KIV-2 relevant SNVs, revealing that many are ineffective predictors of KIV-2 copy number. Population studies, including USA-based cohorts, showed distinct KIV-2 CN, distributions for European-, African-, and Hispanic-American populations and further underscored the limitations of SNV predictors. We demonstrate that the CN estimates correlate significantly with the available Lp(a) protein levels and that phasing is highly important.

Keywords: Cardiovascular disease; Copy number variation; Illumina; LPA; Next Generation Sequencing.

PubMed Disclaimer

Conflict of interest statement

F.J.S receives research support from Illumina, PacBio and ONT. L.P. is funded from Genentech. E.N., M.R., E.R., J.H. and V.O. are employees from Illumina. J.R.B., X.C. and M.A.E. are employees from PacBio. V.K.M. is employed now at Genentech.

Figures

Fig. 1
Fig. 1
Overview of KIV-2 LPA performance: A Genome wide alignment of short Illumina reads and long PacBio HiFi and ONT reads to LPA gene. We observed many non-unique mappings (i.e. mapping quality MQ = 0, shown in white) in KIV-2. The GRCh38 representation of KIV-2 contains six copies of the 5.5 kbp repeat, challenging even for longer reads. This is shown by many white (MQ = 0) PacBio reads and to a lesser extent for Oxford Nanopore Technologies (ONT) reads. The fraction of non-unique mapped reads increased with shorter reads, which often hindered the direct assessment of KIV-2 repeat copy number. B Total KIV-2 copy number compared between calls made by the DRAGEN LPA caller or using optical mapping technology, for cases where optical mapping spanned both genomic alleles. Dashed lines indicate error margins of 5% from optical mapping copy number. C Allelic KIV-2 copy number compared where the per-allele copy number was available from the DRAGEN LPA caller and from optical mapping assemblies. Dashed lines indicate error margins of 5% from optical mapping copy number
Fig. 2
Fig. 2
Performance of the DRAGEN LPA caller across the 1KGP. A Distribution of LPA KIV-2 total copy numbers among samples from 1KGP. B Histogram of absolute value of allele length difference between allele1 and allele2 among samples from 1KGP. The DRAGEN LPA caller could haplotype-resolve about 50% of the samples. C For 60 trios where both offspring and parent KIV-2 allele lengths are reported, an allele combination is chosen which minimizes the total difference between each of the two offspring alleles and one from each parent as the most likely allele origin. Each pair, consisting of one offspring allele and one associated parent allele, is shown for a total of 120 allele pairs. Dashed lines indicate error margins of 5% from expected. D For 153 duos where both offspring KIV-2 allele lengths and those from one parent are reported, a most likely allele combination for one offspring allele and one parental allele is chosen as in C. Dashed lines indicate error margins of 5% from expected. E The distribution of allele1 (longest allele in sample) copy numbers among approximately 50% of the 1KGP samples. F The distribution of allele2 copy numbers (shortest allele in sample)
Fig. 3
Fig. 3
Comparison of SNV markers with CN states of KIV-2. Samples from 1KGP were genotyped for 12 SNVs which have been reported as associated with different Lp(a) levels or copy number states. KIV-2 copy number was compared between groups of samples homozygous for the major (more common) or minor (less common) alleles of these SNVs, to test for association between SNV status and KIV-2 copy number. p-values were obtained by Kolmogorov–Smirnov tests and corrected for multiple testing by the Bonferroni method. Points colored by ancestry population of the sample
Fig. 4
Fig. 4
Overview of ARIC and Hispanic Community Health Study/Study of Latinos (HCHS/SOL) cohorts across the USA: A Total copy numbers across the individuals for African-American, European-American and Hispanic-American populations. B Difference between allele1 copy number and allele2 copy number among various populations C) and D distribution of allele1 and allele2 copy numbers. In each diploid genome with haplotype resolved KIV-2 calls, allele1 is the longer allele and allele2 is the shorter
Fig. 5
Fig. 5
Comparison of SNV markers with CN states of KIV-2. Samples from ARIC and SOL cohorts were genotyped for nine SNVs which have been reported as associated with different Lp(a) levels or copy number states. KIV-2 copy number was compared between groups of samples homozygous for the major (more common) or minor (less common) alleles of these SNVs, to test for association between SNV status and KIV-2 copy number. p-values were obtained by Kolmogorov–Smirnov tests and corrected for multiple testing by the Bonferroni method. Points colored by ancestry population of the sample
Fig. 6
Fig. 6
Association between KIV-2 CN estimates and Lp(a) measurement (in nmol/L): A The mean Lp(a) level vs CN quartiles for African-American (p-value = 4.03e-47) and European-American (p-value = 2.15e-46) samples from ARIC cohort, and Hispanic-American samples from HCHS/SOL cohort (p-value = 2.64e-42). B The mean LPA KIV-2 CN vs Lp(a) levels of three groups based on their diagnostic cutoff values of Lipid level i.e. < 75 nmol/L as low, > 125 nmol/L as high and in between 75 nmol/L and 125 nmol/L as medium. C Allele specific scatterplot of allele CN differences for KIV-2 CN and their impact on diagnostic cutoff values of Lipid levels per population. Allele 2 (Y-axis) is always reported as the smaller allele

Update of

References

    1. Benjamin EJ, et al. Heart Disease and Stroke Statistics-2019 Update: A Report From the American Heart Association. Circulation. 2019;139:e56–528. - PubMed
    1. Rosamond W, et al. Heart disease and stroke statistics–2007 update: a report from the American Heart Association Statistics Committee and Stroke Statistics Subcommittee. Circulation. 2007;115:e69–171. - PubMed
    1. Said MA, Verweij N, van der Harst P. Associations of Combined Genetic and Lifestyle Risks With Incident Cardiovascular Disease and Diabetes in the UK Biobank Study. JAMA Cardiol. 2018;3:693–702. - PMC - PubMed
    1. Khera AV, et al. Genetic Risk, Adherence to a Healthy Lifestyle, and Coronary Disease. N Engl J Med. 2016;375:2349–58. - PMC - PubMed
    1. Shen M, et al. Associations of combined lifestyle and genetic risks with incident psoriasis: A prospective cohort study among UK Biobank participants of European ancestry. Journal of the American Academy of Dermatology Preprint at. 2022. 10.1016/j.jaad.2022.04.006. - PubMed

LinkOut - more resources