The Frailty Index (FI) is a well-established clinical tool for assessing elderly health. Recent efforts have aimed to refine FI for diverse settings or simplify its collection, enabling more efficient population screening. Motivated by this clinical challenge, this is the first study that aims to enhance both index efficiency and practicality by proposing a data pipeline composed of: (i) a nonlinear feature selection method to identify the most relevant variables for index prediction; (ii) Multi-Objective Symbolic Regression, a symbolic machine learning technique to generate simplified index formulas that replicate the original index's values, distribution, and risk stratification ability using fewer variables and (iii) rigorous model evaluation through calibration, temporal correlation and associations with related clinical outcomes. We tailored our approach to streamlining the 53-item FI for people living with HIV (PLWH), in use at the Modena HIV Metabolic Clinic, utilizing electronic health records from a public hospital clinic providing data on about 4,800 patients. Several reduced FI (rFI) formulas were derived, with the simplest model, rFI(16), relying on just 16 indicators. Validation confirmed that the rFIs successfully replicate FI statistical properties and screening power also maintaining consistent relations with other geriatric outcomes. By facilitating index distillation from retrospective data, our method offers broad adaptability to various clinical case studies.
Distilling Clinical Scores with Symbolic Machine Learning: Frailty Index, an HIV Cohort Case Study / Guidetti, V.; Motta, F.; Milic, J.; Ferrari, D.; Guaraldi, G.; Mandreoli, F.. - (2025), pp. 164-173. ( 13th IEEE International Conference on Healthcare Informatics, ICHI 2025 ita 2025) [10.1109/ICHI64645.2025.00027].
Distilling Clinical Scores with Symbolic Machine Learning: Frailty Index, an HIV Cohort Case Study
Guidetti V.;Motta F.;Milic J.;Guaraldi G.;Mandreoli F.
2025
Abstract
The Frailty Index (FI) is a well-established clinical tool for assessing elderly health. Recent efforts have aimed to refine FI for diverse settings or simplify its collection, enabling more efficient population screening. Motivated by this clinical challenge, this is the first study that aims to enhance both index efficiency and practicality by proposing a data pipeline composed of: (i) a nonlinear feature selection method to identify the most relevant variables for index prediction; (ii) Multi-Objective Symbolic Regression, a symbolic machine learning technique to generate simplified index formulas that replicate the original index's values, distribution, and risk stratification ability using fewer variables and (iii) rigorous model evaluation through calibration, temporal correlation and associations with related clinical outcomes. We tailored our approach to streamlining the 53-item FI for people living with HIV (PLWH), in use at the Modena HIV Metabolic Clinic, utilizing electronic health records from a public hospital clinic providing data on about 4,800 patients. Several reduced FI (rFI) formulas were derived, with the simplest model, rFI(16), relying on just 16 indicators. Validation confirmed that the rFIs successfully replicate FI statistical properties and screening power also maintaining consistent relations with other geriatric outcomes. By facilitating index distillation from retrospective data, our method offers broad adaptability to various clinical case studies.Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris




