Manara - Qatar Research Repository
Browse
DOCUMENT
10.3389_fgene.2022.982930.pdf (1.44 MB)
DATASET
supp_Table 1 (2).xlsx (8.99 MB)
DOCUMENT
supp_Data Sheet 1.docx (1.3 MB)
1/0
3 files

Gene-specific machine learning model to predict the pathogenicity of BRCA2 variants

journal contribution
submitted on 2024-04-21, 10:30 and posted on 2024-04-21, 10:31 authored by Mohannad N. Khandakji, Borbala Mifsud

Background

Existing BRCA2-specific variant pathogenicity prediction algorithms focus on the prediction of the functional impact of a subtype of variants alone. General variant effect predictors are applicable to all subtypes, but are trained on putative benign and pathogenic variants and do not account for gene-specific information, such as hotspots of pathogenic variants. Local, gene-specific information have been shown to aid variant pathogenicity prediction; therefore, our aim was to develop a BRCA2-specific machine learning model to predict pathogenicity of all types of BRCA2 variants.


Methods

We developed an XGBoost-based machine learning model to predict pathogenicity of BRCA2 variants. The model utilizes general variant information such as position, frequency, and consequence for the canonical BRCA2 transcript, as well as deleteriousness prediction scores from several tools. We trained the model on 80% of the expert reviewed variants by the Evidence-Based Network for the Interpretation of Germline Mutant Alleles (ENIGMA) consortium and tested its performance on the remaining 20%, as well as on an independent set of variants of uncertain significance with experimentally determined functional scores.


Results

The novel gene-specific model predicted the pathogenicity of ENIGMA BRCA2 variants with an accuracy of 99.9%. The model also performed excellently on predicting the functional consequence of the independent set of variants (accuracy was up to 91.3%).


Conclusion

This new, gene-specific model is an accurate method for interpreting the pathogenicity of variants in the BRCA2 gene. It is a valuable addition for variant classification and can prioritize unreviewed variants for functional analysis or expert review.


Other Information

Published in: Frontiers in Genetics
License: https://creativecommons.org/licenses/by/4.0/
See article on publisher's website: https://dx.doi.org/10.3389/fgene.2022.982930

History

Language

  • English

Publisher

Frontiers

Publication Year

  • 2022

License statement

This Item is licensed under the Creative Commons Attribution 4.0 International License.

Institution affiliated with

  • Hamad Bin Khalifa University
  • College of Health and Life Sciences - HBKU
  • Hamad Medical Corporation

Usage metrics

    College of Health and Life Sciences - HBKU

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC