Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 17;26(1):84.
doi: 10.1186/s12859-025-06056-w.

Transfer learning for accelerated failure time model with microarray data

Affiliations

Transfer learning for accelerated failure time model with microarray data

Yan-Bo Pei et al. BMC Bioinformatics. .

Abstract

Background: In microarray prognostic studies, researchers aim to identify genes associated with disease progression. However, due to the rarity of certain diseases and the cost of sample collection, researchers often face the challenge of limited sample size, which may prevent accurate estimation and risk assessment. This challenge necessitates methods that can leverage information from external data (i.e., source cohorts) to improve gene selection and risk assessment based on the current sample (i.e., target cohort).

Method: We propose a transfer learning method for the accelerated failure time (AFT) model to enhance the fit on the target cohort by adaptively borrowing information from the source cohorts. We use a Leave-One-Out cross validation based procedure to evaluate the relative stability of selected genes and overall predictive power.

Conclusion: In simulation studies, the transfer learning method for the AFT model can correctly identify a small number of genes, its estimation error is smaller than the estimation error obtained without using the source cohorts. Furthermore, the proposed method demonstrates satisfactory accuracy and robustness in addressing heterogeneity across the cohorts compared to the method that directly combines the target and the source cohorts in the AFT model. We analyze the GSE88770 and GSE25055 data using the proposed method. The selected genes are relatively stable, and the proposed method can make an overall satisfactory risk prediction.

Keywords: Auxiliary studies; Gene expression data; Survival analysis; Transfer learning; Weighted least squares.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare that they have no Conflict of interest.

Figures

Algorithm 1
Algorithm 1
Process of Trans-AFT Algorithm
Fig. 1
Fig. 1
Sum of absolute estimation errors of the Lasso-AFT, Pooled-AFT, Trans-AFT method with homogeneous designs under two configurations. The proportion of censoring is set to 20%, 50%, 70%. The y axis corresponds to b-β1 for some estimator b
Fig. 2
Fig. 2
Sum of absolute estimation errors of the Lasso-AFT, Pooled-AFT, Trans-AFT method with heterogeneous designs under two configurations. The proportion of censoring is set to 20%, 50%, 70%. The y axis corresponds to b-β1 for some estimator b
Fig. 3
Fig. 3
Flow diagram of date processing, analysis, and evaluation
Fig. 4
Fig. 4
GSE88770 data: occurrence index of individual genes selected by proposed transfer learning method
Fig. 5
Fig. 5
GSE88770 data: K-M survival curves of two hypothetical risk groups identified by proposed transfer learning method
Fig. 6
Fig. 6
GSE25055 data: occurrence index of individual genes selected by proposed transfer learning method
Fig. 7
Fig. 7
GSE25055 data: K-M survival curves of two hypothetical risk groups identified by proposed transfer learning methods
Fig. 8
Fig. 8
GSE25065 data: K-M survival curves of two hypothetical risk groups identified by proposed transfer learning methods

References

    1. Alizadeh AA, Eisen MB, Davis RE, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature. 2000;403(6769):503–11. 10.1038/35000501. - PubMed
    1. Rosenwald A, Wright G, Wiestner A, et al. The proliferation gene expression signature is a quantitative integrator of oncogenic events that predicts survival in mantle cell lymphoma. Cancer Cell. 2003;3(2):185–97. 10.1016/s1535-6108(03)00028-x. - PubMed
    1. Cristofanilli M, Angulo AG, Sneige N, et al. Invasive lobular carcinoma classic type: response to primary chemotherapy and survival outcomes. J Clin Oncol. 2005;23(1):185–97. 10.1200/JCO.2005.03.111. - PubMed
    1. Arpino G, Bardou VJ, Clark GM, et al. Infiltrating lobular carcinoma of the breast: tumor characteristics and clinical outcome. Breast Cancer Res. 2004;6(3):149–56. 10.1186/bcr767. - PMC - PubMed
    1. Lamovec J, Bracko M. Metastatic pattern of infiltrating lobular carcinoma of the breast: an autopsy study. J Surg Oncol. 1991;48(1):28–33. 10.1002/jso.2930480106. - PubMed

LinkOut - more resources