Volume 182, Issue 2 p. 380-395
THEMED ISSUE ARTICLE
Open Access

MicroRNA-guided drug discovery for mitigating persistent pulmonary complications in critical COVID-19 survivors: A longitudinal pilot study

María C. García-Hidalgo

María C. García-Hidalgo

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Conceptualization (equal), Formal analysis (equal), ​Investigation (equal), Methodology (equal), Software (equal), Visualization (equal), Writing - original draft (equal), Writing - review & editing (equal)

Search for more papers by this author
Iván D. Benítez

Iván D. Benítez

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Formal analysis (equal), ​Investigation (equal), Software (equal), Writing - original draft (equal), Writing - review & editing (equal)

Search for more papers by this author
Manel Perez-Pons

Manel Perez-Pons

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Methodology (equal), Writing - review & editing (equal)

Search for more papers by this author
Marta Molinero

Marta Molinero

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Visualization (supporting), Writing - review & editing (equal)

Search for more papers by this author
Thalía Belmonte

Thalía Belmonte

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Writing - review & editing (equal)

Search for more papers by this author
Carlos Rodríguez-Muñoz

Carlos Rodríguez-Muñoz

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Writing - review & editing (equal)

Search for more papers by this author
María Aguilà

María Aguilà

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

Contribution: Data curation (equal)

Search for more papers by this author
Sally Santisteve

Sally Santisteve

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Data curation (equal)

Search for more papers by this author
Gerard Torres

Gerard Torres

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Group of Precision Medicine in Chronic Diseases, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

Contribution: Conceptualization (equal), ​Investigation (equal), Writing - review & editing (equal)

Search for more papers by this author
Anna Moncusí-Moix

Anna Moncusí-Moix

Group of Precision Medicine in Chronic Diseases, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

Contribution: Data curation (equal)

Search for more papers by this author
Clara Gort-Paniello

Clara Gort-Paniello

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Data curation (equal)

Search for more papers by this author
Rafael Peláez

Rafael Peláez

Biomarkers and Molecular Signaling Group, Neurodegenerative Diseases Area Center for Biomedical Research of La Rioja, CIBIR, Logroño, Spain

Contribution: Writing - review & editing (equal)

Search for more papers by this author
Ignacio M. Larráyoz

Ignacio M. Larráyoz

Biomarkers and Molecular Signaling Group, Neurodegenerative Diseases Area Center for Biomedical Research of La Rioja, CIBIR, Logroño, Spain

BIAS, Department of Nursing, University of La Rioja, Logroño, Spain

Contribution: Formal analysis (equal), Writing - review & editing (equal)

Search for more papers by this author
Jesús Caballero

Jesús Caballero

Grup de Recerca Medicina Intensiva, Intensive Care Department Hospital Universitari Arnau de Vilanova, Lleida, Spain

Contribution: Resources (equal)

Search for more papers by this author
Carme Barberà

Carme Barberà

Intensive Care Department, University Hospital Santa María, IRBLleida, Lleida, Spain

Contribution: Resources (equal)

Search for more papers by this author
Estefania Nova-Lamperti

Estefania Nova-Lamperti

Molecular and Translational Immunology Laboratory, Department of Clinical Biochemistry and Immunology, Faculty of Pharmacy, Universidad de Concepcion, Concepcion, Chile

Contribution: Writing - review & editing (equal)

Search for more papers by this author
Antoni Torres

Antoni Torres

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Pneumology Department, Clinic Institute of Thorax (ICT), Hospital Clinic of Barcelona, Insitut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), ICREA, University of Barcelona (UB), Barcelona, Spain

Contribution: Conceptualization (equal), Funding acquisition (equal), ​Investigation (equal), Writing - review & editing (equal)

Search for more papers by this author
Jessica González

Jessica González

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Conceptualization (equal), Data curation (lead), Funding acquisition (equal), ​Investigation (equal), Writing - review & editing (equal)

Search for more papers by this author
Ferran Barbé

Ferran Barbé

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Contribution: Conceptualization (equal), Funding acquisition (equal), ​Investigation (equal), Writing - review & editing (equal)

Search for more papers by this author
David de Gonzalo-Calvo

Corresponding Author

David de Gonzalo-Calvo

Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida, Lleida, Spain

CIBER of Respiratory Diseases (CIBERES), Institute of Health Carlos III, Madrid, Spain

Correspondence

David de Gonzalo-Calvo, PhD, Translational Research in Respiratory Medicine, University Hospital Arnau de Vilanova and Santa Maria, IRBLleida. Avda, Alcalde Rovira Roure 80, 25198 Lleida, Spain.

Email: [email protected]

Contribution: Conceptualization (lead), Formal analysis (lead), Funding acquisition (equal), ​Investigation (lead), Methodology (supporting), Supervision (lead), Visualization (lead), Writing - original draft (lead), Writing - review & editing (lead)

Search for more papers by this author
First published: 15 February 2024
Citations: 4

Funding information: This work is supported by Instituto de Salud Carlos III (COV20/00110), co-funded by the European Union and Centro de Investigación Biomedica En Red – Enfermedades Respiratorias (CIBERES). CIBERES is an initiative of the Instituto de Salud Carlos III. We were further funded by Programa de donaciones “estar preparados” UNESPA (Madrid, Spain), Fundación Francisco Soria Melguizo (Madrid, Spain), Beca SEPAR – Ayuda a la investigación (1437/2023), Beca SEPAR – Ayuda a la investigación (1222/2022) and Beca SOCAP – Investigador emergent. MCGH is the recipient of a predoctoral fellowship from “Universitat de Lleida.” MM is the recipient of a predoctoral fellowship (PFIS: FI21/00187) from Instituto de Salud Carlos III and co-funded by the European Union. MPP is the recipient of a predoctoral fellowship (PFIS: FI23/00022) from Instituto de Salud Carlos III and co-funded by the European Union. CR is supported by Departament de Salut (Pla Estratègic de Recerca i Innovació en Salut (PERIS): SLT028/23/000191). IML is supported by a Miguel Servet contract (CPII20/00029) from the Instituto de Salud Carlos III, co-funded by the European Union. DdGC has received financial support from Instituto de Salud Carlos III (Miguel Servet 2020: CP20/00041), co-funded by the European Union. Funders did not participate in any part of the research design, conduct, analysis and manuscript production.

Abstract

Background and Purpose

The post-acute sequelae of SARS-CoV-2 infection pose a significant global challenge, with nearly 50% of critical COVID-19 survivors manifesting persistent lung abnormalities. The lack of understanding about the molecular mechanisms and effective treatments hampers their management. Here, we employed microRNA (miRNA) profiling to decipher the systemic molecular underpinnings of the persistent pulmonary complications.

Experimental Approach

We conducted a longitudinal investigation including 119 critical COVID-19 survivors. A comprehensive pulmonary evaluation was performed in the short-term (median = 94.0 days after hospital discharge) and long-term (median = 358 days after hospital discharge). Plasma miRNAs were quantified at the short-term evaluation using the gold-standard technique, RT-qPCR. The analyses combined machine learning feature selection techniques with bioinformatic investigations. Two additional datasets were incorporated for validation.

Key Results

In the short-term, 84% of the survivors exhibited impaired lung diffusion (DLCO < 80% of predicted). One year post-discharge, 54.4% of this patient subgroup still presented abnormal DLCO. Four feature selection methods identified two specific miRNAs, miR-9-5p and miR-486-5p, linked to persistent lung dysfunction. The downstream experimentally validated targetome included 1473 genes, with heterogeneous enriched pathways associated with inflammation, angiogenesis and cell senescence. Validation studies using RNA-sequencing and proteomic datasets emphasized the pivotal roles of cell migration and tissue repair in persistent lung dysfunction. The repositioning potential of the miRNA targets was limited.

Conclusion and Implications

Our study reveals early mechanistic pathways contributing to persistent lung dysfunction in critical COVID-19 survivors, offering a promising approach for the development of targeted disease-modifying agents.

LINKED ARTICLES

This article is part of a themed issue Non-coding RNA Therapeutics. To view the other articles in this section visit http://onlinelibrary.wiley.com/doi/10.1111/bph.v182.2/issuetoc

Graphical Abstract

Abbreviations

  • ARDS
  • acute respiratory distress syndrome
  • DLCO
  • lung diffusing capacity for carbon monoxide
  • IMV
  • invasive mechanical ventilation
  • KIM1
  • kidney injury molecule 1
  • PTN
  • pleiotrophin
  • What is already known

    • The molecular basis of the pulmonary sequelae persistence remains unclear.
    • Treating respiratory complications in post-severe infections is challenged by ineffective therapies.

    What does this study add

    • Plasma levels of miR-9-5p and miR-486-5p are associated with long-term lung dysfunction.
    • At the systemic level, the molecular signature of persistent respiratory sequelae involves diverse pathobiological mechanisms.

    What is the clinical significance

    • Heterogeneity could hinder patient management, contributing to limited treatment responses and unfavourable clinical trial outcomes.
    • miRNA-based therapeutics offer a promising avenue for addressing the persistence of lung dysfunction.

    1 INTRODUCTION

    The post-acute sequelae of SARS-CoV-2 are increasingly recognized as a global societal challenge, given their impact on individual well-being and their substantial burden on healthcare systems and the economy (Kerksieck et al., 2023). This condition encompasses a wide spectrum of multiorgan symptoms and complications that manifest following the recovery from acute COVID-19 (post-acute COVID-19 syndrome, post-COVID syndrome or long COVID). The core symptoms include persistent fatigue, anosmia, memory loss, muscle pain and respiratory difficulties, among others (Nalbandian et al., 2021). The presence of post-acute organ damage following SARS-CoV-2 infection primarily pertains to pulmonary complications (Singh et al., 2023), spanning from self-limited abnormalities to advanced-stage pulmonary pathology.

    Lung dysfunction has arisen as the hallmark in convalescent individuals recovering from severe COVID-19, with the lung diffusing capacity for carbon monoxide (DLCO) representing the most compromised parameter of pulmonary function in hospitalized patients. Data from various patient cohorts indicate a prevalence of impaired diffusing capacity of around 50% even 1 year after hospital discharge (Schlemmer et al., 2023). This observation aligns with findings from investigations into other coronaviruses, influenza and acute respiratory distress syndrome (ARDS) (Herridge et al., 2003). Although the number of ongoing clinical trials and therapies under development continues to grow, there are no widely accepted treatment options for managing pulmonary lesions in survivors of severe respiratory infections, including critical COVID-19. For instance, there is insufficient evidence to support the use of anti-fibrotic agents in resolving pulmonary fibrosis that results from infections such as SARS-CoV-2 (Mylvaganam et al., 2021). Understanding the pathobiological mechanisms contributing to the persistence of lung dysfunction after hospital discharge is therefore of paramount importance. This molecular approach would allow the development of new therapeutic strategies or the repositioning of previously approved drugs for early and targeted interventions, avoiding exposure to low-effective disease-modifying agents.

    MicroRNAs (miRNAs), a class of small noncoding RNAs consisting of 19–25 nucleotides, play a pivotal role in post-transcriptional gene expression regulation by binding to target mRNAs, thereby suppressing translation and/or inducing target mRNA degradation. miRNAs form coordinated and complex gene expression networks (Makkos et al., 2021), with a single miRNA capable of targeting numerous transcripts, and a specific transcript containing binding sites for multiple miRNAs. Consequently, miRNAs are increasingly recognized as key regulators of virtually all cellular processes. Evidence from in vitro and in vivo studies underscores their contribution to disease pathogenesis, including respiratory conditions (Rupani et al., 2013). Moreover, miRNAs have been detected in the extracellular milieu and various biofluids (Valadi et al., 2007), where they participate in cell-to-cell communication, mainly through paracrine signalling (Li et al., 2021). While the concept of miRNAs acting as hormones has been a subject of debate (Bär et al., 2019), numerous studies conducted over the years have provided compelling evidence of their role as endocrine genetic signals (Ying et al., 2021).

    Altogether, miRNA profiling has emerged as a promising tool for comprehensively unravelling disease mechanisms and identifying potential interventions (Jusic et al., 2022). In this study, we employed circulating miRNA profiling in conjunction with machine learning feature selection techniques to characterize, for the first time, the early mechanistic pathways associated with the persistence of lung dysfunction (1 year after hospital discharge) in survivors of critical COVID-19. The ultimate goal was to propose novel therapeutic candidates aimed at mitigating long-term pulmonary abnormalities.

    2 METHODS

    2.1 Study population

    This was a substudy of the CIBERESUCICOVID multicentre study (Torres et al., 2021). Survivors of critical COVID-19 admitted to intensive care units (ICU) between March and December 2020 and subsequently evaluated in a post-COVID-19 consultation in Hospital Universitari Arnau de Vilanova-Santa María (Lleida, Spain) were enrolled if they fulfilled the following inclusion criteria: positive nasopharyngeal swab PCR for SARS-CoV-2, aged over 18, developed ARDS according to the Berlin definition (Ranieri et al., 2012) and presence of diffusion impairment in the short-term evaluation of the follow-up (defined as a DLCO < 80%). The exclusion criteria included previous chronic pulmonary disease, died during the follow-up, transferred to another department or institution, incomplete follow-up and presence of non-COVID alterations that could hamper the follow-up or alter the respiratory functional tests (Figure S1). Demographic, clinical, pharmacological and laboratory data were extracted from the electronic medical records and introduced in a REDCap database. Incoherent or missing data were identified and reviewed by experienced researchers. Publicly available whole blood RNA-sequencing (https://www.ncbi.nlm.nih.gov/geo/GSE228320) (García-Hidalgo, Peláez, et al., 2022) and plasma proteomic datasets (https://zenodo.org/record/8375667) (García-Hidalgo, González, Benítez, Carmona, Santisteve, Moncusí-Moix, et al., 2022) were used in the validation studies.

    The study protocol was approved by the medical ethics committee of Hospital Universitari Arnau de Vilanova (CEIC/2273). The study was performed in full compliance with the Declaration of Helsinki (World Medical Association, 2013). The patients were provided with written information about the nature and goals of the study and signed an informed consent form.

    2.2 Pulmonary evaluation

    A complete pulmonary evaluation was performed as previously detailed (González et al., 2021, 2022, 2023). Patients with diffusion impairment in the short-term visit (median [P25; P75] of 94.0 [83.8;105] days after hospital discharge) entered in the follow-up. Those patients that attended the long-term visit (median [P25; P75] of 358 [302;379] days after hospital discharge) completed the study population. In brief, the consultation comprised the assessment of general and respiratory symptoms, chest computed tomography and a 6-min walking test (6MWT). Respiratory function (diffusion capacity and lung volumes) was evaluated using a flow spirometer (MasterScreen, Jaeger, Germany) in agreement with the guidelines of the American Thoracic Society (ATS) (Celli et al., 2004). DLCO was calculated as a percentage of the predicted DLCO value according to the European Community Lung Health Survey (Roca et al., 1998). Lung diffusion impairment was defined as a DLCO value under 80% of predicted, based on ATS/ERS task force: standardization of lung function testing (Pellegrino et al., 2005).

    Patients who completed the 1-year follow-up were stratified, taking into consideration the persistence of diffusion impairment in the long-term visit: (i) “Recovery” (DLCO ≥ 80% of predicted); (ii) “Persistence” (DLCO < 80% of predicted).

    2.3 Plasma microRNA profiling

    Plasma samples were processed under standardized conditions with support from the IRBLleida Biobank (B.000682) and Biobank and Biomodels Platform ISCIII PT20/00021. Venous blood samples were collected by venipuncture in EDTA tubes (BD, NJ, USA) after a night of fasting and before beginning the pulmonary evaluation at the short-term visit. Blood samples were centrifuged at 1500 × g for 10 min at room temperature. Plasma supernatant was immediately aliquoted and stored at −80°C.

    Total RNA was isolated from 200 μl of frozen plasma using the miRNeasy Serum/Plasma Advanced kit (Qiagen, Hilden, Germany). As a normalization strategy, synthetic Caenorhabditis elegans miR-39-3p (cel-miR-39-3p) was added as an external reference miRNA (1.6 × 108 copies per microlitre). The mixture was supplemented with 1 μg of MS2 carrier RNA (Roche Diagnostics, Mannheim, Germany) to improve extracellular miRNA yielding. The RNA Spike-In Kit (UniSp2, UniSp4 and UniSp5) (Qiagen, Hilden, Germany) was added as quality control of the RNA isolation. All reagents were spiked into samples during RNA isolation after incubation with the denaturing solution. The isolated RNA was eluted in 20 μl of RNAse-free water and stored at −80°C.

    miRNA quantification was performed in agreement with the protocol from the miRCURY LNA Universal RT microRNA PCR System (Qiagen, Hilden, Germany). Reverse transcription (RT) to synthesize complementary DNA (cDNA) was conducted using a miRCURY LNA RT Kit (Qiagen, Hilden, Germany) in a total reaction volume of 10 μl. The spike-in UniSp6 (Qiagen, Hilden, Germany) was added to monitor the RT reaction. The RT conditions were the following: incubation at 42°C during 60 min, inactivation at 95°C during 5 min and immediate cooling at 4°C. Then, cDNA was kept at −20°C. A 41-miRNA panel, including mediators associated with mechanistic pathways altered in COVID-19 and its respiratory sequelae (de Gonzalo-Calvo et al., 2021), was quantified using miRCURY LNA miRNA Custom Panels (384-well plates) (Qiagen, Hilden, Germany). qPCR was performed with the QuantStudio™ 7 Flex Real-Time PCR System (Applied Biosystems, Waltham, MA, USA) in a volume of 10 μl. RT-qPCR conditions were 95°C during 2 min, 40 cycles of 95°C during 10 s followed by 56°C during 1 min and melting curve analysis. Synthetic UniSp3 was used as an interplate calibrator and qPCR control. Amplification curves were assessed using QuantStudio Software v1.3 (Thermo Fisher Scientific, Massachusetts, USA). The presence of single products and the absence of primer dimer was corroborated by melting curve analysis.

    The quantification cycle (Cq) was defined as the fractional cycle number at which the fluorescence exceeded a given threshold. The Cq values of spike-in RNA templates were first analysed to monitor the homogenous efficiencies of RNA extraction procedures, the robustness of RT and the absence of PCR inhibitors. The ratio ΔCq (miR-23a-3p − miR-451a) proposed by Blondal et al. (2013) was used to exclude haemolysed samples. Cqs above 35 cycles were considered undetectable and censored at the minimum level observed for each miRNA. Relative quantification was performed using the 2−ΔCq method (ΔCq = CqmiRNA − Cqcel-miR-39-3p). Expression levels were log-transformed for statistical analysis.

    2.4 Data and statistical analysis

    The statistical analyses were performed using R software, version 4.1.2. Statistical analysis was undertaken only for studies where each group size was at least n = 5 and using independent values, that is, not treating technical replicates as independent values. Outlier analysis was conducted to identify incoherent data. Following thorough examination by experienced researchers, no outliers were excluded in the subsequent analysis. Descriptive statistics were used to outline the characteristics of the study population. The normality of the data was assessed using the Shapiro–Wilks test. Continuous variables were compared between groups using either the Mann–Whitney U-test or Student's t test, as appropriate, while categorical variables were compared using Fisher's exact test or chi-squared test. Data were presented as frequencies (percentage) for categorical variables and as medians [25th;75th percentile] or means (SD) for continuous variables. Pearson correlation was used to estimate the correlation between miRNAs. Differential gene expression analysis was conducted using the DESeq2 package (Love et al., 2014). Differences in protein levels were analysed using linear models with Empirical Bayes statistics (Ritchie et al., 2015). The P-value threshold defining statistical significance was set at <0.05.

    miRNAs associated with persistent lung diffusion impairment were identified using a consensus of four supervised machine learning feature selection techniques with the intersection among the methods selected as the candidate miRNAs. A prior filtration step was performed to eliminate redundant information by removing miRNAs highly correlated to each other (cutoff Pearson correlation coefficient [PCC] > 0.7).
    • Random Forest by Boruta: Boruta is a variable selection wrapper algorithm that employs random forest (RF) as the classification method (Kursa & Rudnicki, 2010). Boruta compared the importance of miRNAs to the importance of artificial shadow attributes, which were created by shuffling the original ones. The algorithm performed an iterative process (50 runs) that retained only those miRNAs exhibiting a higher importance value than their shadow counterparts. Finally, the attributes selected in any of the runs were incorporated. The method's classification performance was evaluated using a fivefold cross-validation (repeated 10 times) on a RF model. The number of variables randomly sampled as candidates at each split and the number of trees were set to their optimal values.
    • Random Forest by VSURF: VSURF is a stepwise feature selection process based on RF (Genuer et al., 2015). The algorithm comprised three steps: (i) elimination of miRNAs with low importance by ranking the average of the variable importance measures in 50 runs of RF; (ii) calculation of the Out-of-bag (OOB) error rates for 50 RF runs for each nested model. The miRNAs with the lowest OOB error rates were selected; (iii) selection of the final model was performed by conducting an ascending sequence of RF tests that considered the inclusion of each miRNA to remove the most redundant ones. The performance of the selected miRNAs was assessed in a manner similar to the Boruta algorithm.
    • Relaxed Least Absolute Shrinkage and Selection Operator (LASSO): LASSO is a regression analysis method that selects variables by fitting a regularized least squares model (Fonti & Belitser, 2017). It penalized the regression coefficients, shrinking some of them to zero. The miRNAs with nonzero coefficients after the shrinkage were selected for inclusion in the model. The lambda parameter (λ), which determines the strength of the penalty, was calculated through fivefold cross-validation. λ was selected as the value associated with the standard error greater than the minimum misclassification error. The variables were standardized before fitting the model. The model's performance was assessed using a 5-fold cross-validation (repeated 10 times).
    • Sparse partial least squares regression (sPLS): sPLS is a dimension reduction method that decomposes data matrices into component scores by imposing an L1 penalty (Lê Cao et al., 2008). The method selected miRNAs while considering the LASSO penalty on the loading vectors estimated by partial least squares regression. A fivefold cross-validation was conducted to calculate optimal values for the sparsity parameters and the number of components in the final model.

    The manuscript complies with BJP's recommendations and requirements on experimental design and analysis (Curtis et al., 2022).

    2.5 Bioinformatics

    Experimentally validated miRNA-gene interactions were extracted from TarBase v8 (Karagkouni et al., 2018). The search settings were restricted to interactions annotated from human, with Ensembl Gene IDs and from robust methods directly assessing miRNA binding. Protein–protein interaction (PPI) networks were constructed using the Search Tool for the Retrieval of Interacting Genes/Proteins database (STRING) (Szklarczyk et al., 2021). The minimum required interaction score was set at 0.900 (highest confidence). The functional enrichment analysis of the downstream target genes was conducted using the clusterProfiler 4.0 R package (Wu et al., 2021). ClusterProfiler library is embedded with updated annotations, such as Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome, among others. The q value (false discovery rate [FDR] adjusted P value) threshold was set at <0.05. Lung cell expression patterns were defined using data from the Genotype-Tissue Expression (GTEx) Portal (https://www.gtexportal.org/home/). The Drug-Gene Interaction database (DGIdb v4.2.0) was used to predict interactions between FDA-approved drugs and miRNA targets (Freshour et al., 2021).

    2.6 Materials

    Details of materials and suppliers are provided in specific subsections in Methods.

    2.7 Nomenclature of targets and ligands

    Key protein targets in this article are hyperlinked to corresponding entries in https://www.guidetopharmacology.org, and are permanently archived in the Concise Guide to PHARMACOLOGY 2021/22(151–157).

    3 RESULTS

    Among the 119 survivors of critical COVID-19 included in the study, 84% exhibited impaired lung diffusion (DLCO < 80% of predicted) at the short-term evaluation after hospital discharge. This group of patients entered in the follow-up study. No significant differences were reported between patients included in the study and those unreachable to attend the follow-up, except for the respiratory profile at the short-term visit (Table S1). One year after hospital discharge, DLCO remained abnormal in 54.4% of these patients. Survivors with persistent diffusion impairment 1 year after discharge were older, had longer hospital and ICU stays and were more subjected to invasive mechanical ventilation (IMV) and prone positioning (Table 1). In the short-term visit, this group demonstrated significantly worse profiles concerning pulmonary function parameters, functional capacity and radiological findings (Table 2). As expected, a higher prevalence of diffusion impairment was observed in the long-term visit (Table 3), despite the improvement in the DLCO levels during the follow-up (Figure S2).

    TABLE 1. Sociodemographic, clinical and hospitalization characteristics based on the persistence of pulmonary dysfunction after 1 year of follow-up.
    All Recovery Persistence P value n
    n = 57 n = 26 n = 31
    Sociodemographic characteristics
    Age (years) 62.0 [50.0;67.0] 57.0 [47.2;62.0] 63.0 [57.0;68.0] 0.021 57
    Female 16 (28.1%) 9 (34.6%) 7 (22.6%) 0.477 57
    BMI (kg·m−2) 28.7 [26.1;32.9] 28.2 [25.1;31.0] 29.3 [26.8;33.1] 0.396 57
    Smoking history 0.193 57
    Former 29 (50.9%) 11 (42.3%) 18 (58.1%)
    Nonsmoker 26 (45.6%) 13 (50.0%) 13 (41.9%)
    Current 2 (3.51%) 2 (7.69%) 0 (0.00%)
    Clinical characteristics
    Hypertension 31 (54.4%) 12 (46.2%) 19 (61.3%) 0.381 57
    Type II diabetes mellitus 12 (21.1%) 4 (15.4%) 8 (25.8%) 0.525 57
    Obesity 23 (40.4%) 8 (30.8%) 15 (48.4%) 0.280 57
    Cardiovascular disease 3 (5.26%) 1 (3.85%) 2 (6.45%) 1.000 57
    Chronic lung disease 1 (1.75%) 0 (0.00%) 1 (3.23%) 1.000 57
    Asthma 5 (8.77%) 3 (11.5%) 2 (6.45%) 0.651 57
    Chronic kidney disease 1 (1.75%) 0 (0.00%) 1 (3.23%) 1.000 57
    ICU stay
    Time since first symptoms to ICU admission (days) 8.50 [7.00;10.0] 9.00 [8.00;10.0] 8.00 [6.50;10.0] 0.270 56
    Time since hospital admission to ICU admission (days) 1.00 [0.00;3.00] 3.00 [1.00;4.00] 0.00 [0.00;2.00] 0.007 56
    Hospital stay (days) 24.0 [15.0;39.0] 16.0 [11.2;30.2] 30.0 [23.0;44.5] 0.015 57
    ICU stay (days) 14.0 [6.00;30.2] 6.00 [5.00;22.0] 16.0 [11.0;32.0] 0.006 56
    Worst PaO2/FiO2 (mmHg) 134 [90.0;186] 137 [91.0;156] 118 [89.5;206] 0.987 57
    PaO2/FiO2 categories 0.313 57
    PaO2/FiO2 201–300 mmHg 13 (22.8%) 4 (15.4%) 9 (29.0%)
    PaO2/FiO2 101–200 mmHg 23 (40.4%) 13 (50.0%) 10 (32.3%)
    PaO2/FiO2 ≤ 100 mmHg 21 (36.8%) 9 (34.6%) 12 (38.7%)
    High-flow nasal cannula 37 (64.9%) 18 (69.2%) 19 (61.3%) 0.729 57
    Invasive mechanical ventilation 33 (57.9%) 9 (34.6%) 24 (77.4%) 0.003 57
    Invasive mechanical ventilation duration (days) 18.0 [9.75;31.2] 23.0 [17.0;32.0] 16.0 [9.50;30.0] 0.529 32
    Noninvasive mechanical ventilation 33 (57.9%) 12 (46.2%) 21 (67.7%) 0.169 57
    Noninvasive mechanical ventilation duration (days) 3.00 [2.00;5.00] 3.00 [2.00;4.25] 3.00 [2.00;6.00] 0.372 33
    Prone positioning 32 (56.1%) 8 (30.8%) 24 (77.4%) 0.001 57
    Prone positioning duration (hours) 41.0 [23.0;75.0] 37.5 [23.5;91.5] 41.0 [22.5;70.0] 0.701 31
    Antibiotics 49 (86.0%) 21 (80.8%) 28 (90.3%) 0.448 57
    Hydroxychloroquine 25 (43.9%) 10 (38.5%) 15 (48.4%) 0.628 57
    Tocilizumab 28 (49.1%) 10 (38.5%) 18 (58.1%) 0.227 57
    Corticoids 49 (86.0%) 21 (80.8%) 28 (90.3%) 0.448 57
    Remdesivir 13 (22.8%) 6 (23.1%) 7 (22.6%) 1.000 57
    Interferon beta 12 (22.2%) 5 (20.8%) 7 (23.3%) 1.000 54
    Lopinavir/ritonavir 25 (43.9%) 10 (38.5%) 15 (48.4%) 0.628 57
    Anticoagulants 55 (96.5%) 25 (96.2%) 30 (96.8%) 1.000 57
    Complications
    Bacterial pneumonia 6 (10.5%) 1 (4.00%) 5 (15.6%) 0.215 57
    Pulmonary thromboembolism 2 (3.51%) 1 (4.00%) 1 (3.12%) 1.000 57
    Bacteraemia 16 (28.1%) 4 (16.0%) 12 (37.5%) 0.135 57
    Acute kidney injury 16 (28.1%) 5 (20.0%) 11 (34.4%) 0.367 57
    Liver dysfunction 10 (17.5%) 2 (8.00%) 8 (25.0%) 0.160 57
    Hyperglycaemia 24 (42.1%) 9 (36.0%) 15 (46.9%) 0.579 57
    Infectious complication 16 (28.1%) 5 (20.0%) 11 (34.4%) 0.367 57
    Treatments after discharge
    Rehabilitation at discharge 21 (38.2%) 9 (36.0%) 12 (40.0%) 0.980 55
    Corticoids at discharge 17 (32.1%) 9 (36.0%) 8 (28.6%) 0.777 53
    • Note: Continuous variables are expressed as mean (SD) or median [P25;P75]. Categorical variables are expressed as n (%).
    • Abbreviations: BMI, body mass index; FiO2, fraction of inspired oxygen; ICU, intensive care unit; PaO2, oxygen partial pressure.
    TABLE 2. Pulmonary assessment at the short-term visit.
    All Recovery Persistence P value n
    n = 57 n = 26 n = 31
    Symptoms
    Dry cough 11 (20.0%) 2 (8.00%) 9 (30.0%) 0.091 55
    Wet cough 9 (16.4%) 4 (16.0%) 5 (16.7%) 1.000 55
    Dyspnoea 0.075 54
    0 25 (46.3%) 14 (56.0%) 11 (37.9%)
    1 16 (29.6%) 9 (36.0%) 7 (24.1%)
    2 11 (20.4%) 2 (8.00%) 9 (31.0%)
    3 2 (3.70%) 0 (0.00%) 2 (6.90%)
    Muscular fatigue 11 (20.0%) 4 (16.0%) 7 (23.3%) 0.735 55
    Pulmonary function
    FVC (%) 73.1 (14.2) 77.4 (10.6) 69.5 (15.9) 0.030 57
    FEV1 (%) 81.7 (17.0) 85.1 (13.2) 78.8 (19.4) 0.150 57
    FEV1/FVC 0.83 (0.06) 0.82 (0.06) 0.83 (0.06) 0.662 57
    TLC (%) 79.6 (19.4) 86.6 (17.0) 73.7 (19.6) 0.011 55
    DLCO (% of predicted) 60.7 (11.0) 68.9 (7.22) 53.8 (8.59) <0.001 57
    DLCO <0.001 57
    <60% of predicted 24 (42.1%) 2 (7.69%) 22 (71.0%)
    <80% of predicted 33 (57.9%) 24 (92.3%) 9 (29.0%)
    RV (%) 79.1 [66.2;95.6] 79.5 [71.4;87.2] 75.8 [57.8;100] 0.577 55
    KCO (%) 83.3 [75.8;91.2] 82.9 [75.3;91.1] 83.4 [75.9;90.0] 0.879 57
    6-min walking test
    Distance (m) 401 (99.7) 430 (89.8) 375 (102) 0.041 54
    Oxygen saturation average (%) 96.0 [94.0;96.0] 96.0 [95.0;97.0] 95.0 [93.2;96.0] 0.023 55
    Oxygen saturation initial (%) 96.0 [96.0;97.0] 97.0 [96.0;97.0] 96.0 [95.0;97.0] 0.209 55
    Oxygen saturation final (%) 96.0 [94.0;96.0] 96.0 [95.0;97.0] 95.0 [94.0;96.0] 0.043 55
    Oxygen saturation minimal (%) 94.0 [92.5;96.0] 95.0 [93.0;96.0] 94.0 [92.0;95.0] 0.101 55
    Chest CT
    Density
    Ground-glass 34 (60.7%) 15 (60.0%) 19 (61.3%) 1.000 56
    Mixed ground-glass 26 (46.4%) 11 (44.0%) 15 (48.4%) 0.954 56
    Consolidation 14 (25.0%) 4 (16.0%) 10 (32.3%) 0.277 56
    Lesions
    Reticular 20 (35.7%) 12 (48.0%) 8 (25.8%) 0.149 56
    Fibrotic 27 (48.2%) 6 (24.0%) 21 (67.7%) 0.003 56
    TSS score 7.88 (4.07) 5.64 (3.32) 9.68 (3.75) <0.001 56
    • Note: Continuous variables are expressed as mean (SD) or median [P25;P75]. Categorical variables are expressed as n (%).
    • Abbreviations: DLCO, carbon monoxide diffusing capacity; FEV1, forced expiratory volume during the first second; FVC, forced vital capacity; KCO, CO transfer coefficient; RV, residual volume; TLC, total lung capacity; TSS, total severity score.
    TABLE 3. Pulmonary assessment at the long-term visit.
    All Recovery Persistence P value n
    n = 57 n = 26 n = 31
    Symptoms
    Dry cough 16 (29.1%) 6 (24.0%) 10 (33.3%) 0.645 55
    Wet cough 9 (16.4%) 2 (8.00%) 7 (23.3%) 0.160 55
    Dyspnoea <0.001 55
    0 29 (52.7%) 20 (80.0%) 9 (30.0%)
    1 15 (27.3%) 0 (0.00%) 15 (50.0%)
    2 9 (16.4%) 4 (16.0%) 5 (16.7%)
    3 2 (3.64%) 1 (4.00%) 1 (3.33%)
    Muscular fatigue 10 (19.2%) 4 (16.0%) 6 (22.2%) 0.729 52
    Pulmonary function
    FVC (%) 86.6 [78.8;102] 87.0 [82.2;103] 86.0 [75.7;96.5] 0.344 57
    FEV1 (%) 83.8 (15.3) 83.7 (12.0) 83.9 (17.8) 0.966 57
    FEV1/FVC 82.1 (6.14) 82.9 (6.77) 81.5 (5.59) 0.408 57
    TLC (%) 81.0 [72.2;93.2] 83.8 [76.8;93.2] 73.5 [67.4;90.2] 0.094 42
    DLCO (% predicted) 74.9 (15.4) 87.9 (9.91) 63.9 (9.33) <0.001 57
    DLCO <0.001 57
    <60% of predicted 11 (19.3%) 0 (0.00%) 11 (35.5%)
    <80% of predicted 20 (35.1%) 0 (0.00%) 20 (64.5%)
    RV (%) 82.2 (30.0) 83.7 (33.0) 80.6 (27.1) 0.744 42
    KCO (%) 86.0 [74.0;97.0] 97.5 [87.5;103] 76.0 [68.5;85.0] <0.001 57
    6-min walking test
    Distance (m) 453 (96.0) 457 (87.8) 450 (104) 0.805 56
    Oxygen saturation average (%) 96.0 [95.0;96.0] 96.0 [95.0;97.0] 96.0 [94.8;96.0] 0.311 54
    Oxygen saturation initial (%) 96.0 [96.0;97.0] 96.5 [96.0;97.8] 96.0 [96.0;97.0] 0.385 55
    Oxygen saturation final (%) 95.0 [94.0;96.0] 95.0 [94.2;97.0] 95.0 [94.0;96.0] 0.319 54
    Oxygen saturation minimal (%) 94.5 [93.0;95.8] 95.0 [93.2;96.0] 94.0 [91.8;95.0] 0.223 54
    Chest CT
    Density
    Ground-glass 25 (43.9%) 12 (46.2%) 13 (41.9%) 0.959 57
    Mixed ground-glass 27 (47.4%) 8 (30.8%) 19 (61.3%) 0.042 57
    Consolidation 4 (7.02%) 2 (7.69%) 2 (6.45%) 1.000 57
    Lesions
    Reticular 29 (60.4%) 12 (54.5%) 17 (65.4%) 0.639 48
    Fibrotic 11 (22.9%) 4 (18.2%) 7 (26.9%) 0.709 48
    TSS score 5.00 [3.00;6.00] 3.50 [2.25;5.00] 5.00 [4.25;6.00] 0.074 48
    • Note: Continuous variables are expressed as mean (SD) or median [P25;P75]. Categorical variables are expressed as n (%).
    • Abbreviations: DLCO, carbon monoxide diffusing capacity; FEV1, forced expiratory volume during the first second; FVC, forced vital capacity; KCO, CO transfer coefficient; RV, residual volume; TLC, total lung capacity; TSS, total severity score.

    Our initial objective was to identify miRNAs associated with persistent lung dysfunction, defined as DLCO < 80% of predicted, 1 year after hospital discharge. To do that, we developed a feature selection process that combined four supervised machine learning-based methods with the circulating levels of miRNAs quantified at the short-term visit after hospital discharge. Because miRNAs could provide extremely redundant information, we undertook a prior filtration step, discarding those candidates highly correlated to each other (PCC > 0.7) (Figure S3). This step reduced the number of initial miRNAs to 12. The feature subset selected by Boruta and LASSO were similar (miR-9-5p and miR-486-5p) and different to the subsets provided by VSURF (miR-9-5p, miR-122-5p, miR-199a-5p, miR-214-3p and miR-486-5p) and sPLS (miR-9-5p, miR-214-3p and miR-486-5p) (Figure 1a). miR-9-5p and miR-486-5p were consistently selected by all four methods. An additional analysis was performed including those survivors who required the use of IMV during ICU stay. This subgroup of patients was considerably more homogenous in terms of both sociodemographic and clinical characteristics (Table S2). miR-486-5p was selected by Boruta, VSURF and sPLS (Figure S4).

    Details are in the caption following the image
    Molecular mechanisms associated with the persistence of diffusion impairment (DLCO < 80% of predicted) 1 year after hospital discharge. (a) Identification of microRNAs associated with persistent lung diffusion impairment using a consensus of four supervised machine learning feature selection techniques (Boruta, VSURF, LASSO and sPLS) with the intersected miRNAs among the methods selected as candidates. (b) Protein–protein interaction (PPI) network using the experimentally validated target genes extracted from TarBase v8. The PPI network was constructed using STRING DB v12.0. (c–e) Functional enrichment analysis considering the downstream targetome using KEGG (c), Reactome (d) and GO (e) of the R package ClusterProfile 4.0. The plots show the q value (false discovery rate [FDR] adjusted P value) of the top 25 pathways. The intensity of the colours of the bars represents the rich factor, which denotes the ratio of target genes annotated in the molecular processes to all genes annotated in the processes.

    Further evidence into the role of miRNAs in the pathogenesis of the respiratory sequelae and possible therapeutic candidates was evaluated through the analysis of the miRNA targets. The genes regulated by miR-9-5p and miR-486-5p were retrieved using experimentally validated miRNA-target interactions from DIANA-TarBase v8. The targetome of both miRNAs was composed by 1473 genes (1249 targets for miR-9-5p and 283 targets for miR-486-5p, with 59 targets in common) (Table S2). The interactions between the complete subset of genes were analysed using the STRING database. Supporting the complex gene expression network associated with miRNA biology, the targetome exhibited a high-confidence protein–protein interaction network with 1484 interactions (Figure 1b). To explore the molecular mechanisms associated with the persistence of lung dysfunction, the 1473 genes were subjected to functional enrichment analysis. We identified 23 KEGG pathways, 111 Reactome pathways and 474 GO processes (Tables S3, S4, and S5). The top-ranked pathways and processes were associated with inflammation, angiogenesis, senescence, tissue repair and remodelling, cell migration and pathogenesis of infectious agents, among other mechanisms (Figure 1c–e).

    While standard functional analyses can reveal meaningful biological features, several biases should be acknowledged (Garcia-Moreno & Carmona-Saez, 2020). Consequently, we sought to corroborate our findings using RNA-sequencing data derived from whole blood samples collected from survivors of critical COVID-19. The differential expression analysis included survivors with and without lung diffusion impairment 1 year after hospital discharge (median age = 62.0 [53.0;68.0] years, females = 29.4%, 76.5% with DLCO < 80% of predicted 1 year after hospital discharge, samples collected at a median of 97.0 [85.0;106] days after hospital discharge). Among the target genes, six transcripts were up-regulated (PXDN, FAT1, ITGA2, MXRA7, RNF187, TMEM200A) and five down-regulated (CHRM3, NELL2, RAI14, SLC24A2, PPP1R9A) in survivors with persistent lung dysfunction (Figure 2a). Functional enrichment analysis of the differentially expressed targets identified six overlapping pathways and processes, including those associated with cell migration and tissue repair, such as focal adhesion, signalling by MET and L1CAM interactions (Figure 2b and Tables S6, S7, and S8).

    Details are in the caption following the image
    Validation of the molecular mechanisms. (a) Volcano plot showing the P-value versus the fold change (FC) for the microRNA targets in the RNA sequencing dataset (n = 17). Green dots reflect significantly expressed genes considering a P-value <0.05 and a FC > 1.5 or FC < 0.66. (b) Functional enrichment analysis including the intersected molecular mechanisms after comparison of gene-set enrichment analysis using the whole targetome and using the validated target genes in the RNA sequencing dataset. The lollipop plot shows the q value of the pathways. The colours of the points displayed the distinct databases employed: GO, KEGG and Reactome. The intensity and size of the points represent the rich factor of each intersected biological process. (c) Cell enrichment analysis based on lung single-cell RNA-sequencing for the validated target genes using GTEx. Each column shows a cell type and each row represents a gene. The size of the point denotes the number of cells where the gene has been detected and the colour represents the expression level. The GTEx project was supported by the Common Fund of the Office of the Director of the National Institutes of Health and by the NCI, NHGRI, NHLBI, NIDA, NIMH and NINDS. (d) Volcano plot from proteomic dataset (n = 31) representing the P-value versus the fold change for each protein after comparison between recovered (DLCO ≥ 80% of predicted) and persistent pulmonary dysfunction (DLCO < 80% of predicted). Proteins with a P-value <0.05 and a FC > 1.5 or FC < 0.66 are depicted in green and considered as significant. (e) Violin plots of significant proteins. Inner boxplots show median and P25 and P75 ranges of expression values of each protein. FC and P-values are printed for each protein.

    To further explore the expression profile of miRNA targets in lung cells, we retrieved data from the GTEx project, revealing that all genes are expressed in lung epithelial cells, endothelial cells and fibroblasts, with variable expression levels in immune cells (Figure 2c). Subsequently, we examined the repositioning potential of up-regulated miRNA targets using the Drug-Gene Interaction database, identifying two FDA-approved drugs (clopidogrel and aspirin) targeting one gene, ITGA2.

    In pursuance of identifying mediators that support these findings, a validation step was performed using plasma proteomic data comprising survivors of critical COVID-19 (median age = 62.0 [49.0;67.5] years, females = 29.0%, 54.8% with persistent pulmonary dysfunction 1 year after hospital discharge, samples collected at a median of 97.0 [90.5;106] days after hospital discharge). As shown in Figure 2d,e, the plasma levels of PTN (6618) and KIM1, from a total of 364 proteins, were elevated in survivors of critical COVID-19 with persistent diffusion impairment. No FDA-approved drugs targeting PTN or KIM1 were retrieved from the Drug-Gene Interaction database.

    4 DISCUSSION

    The increasing survival rate of critically ill patients paradoxically exerts a profound impact on National Health Systems. Given the substantial number of individuals surviving severe-to-critical COVID-19, there is a pressing concern regarding the prevalence of chronic pulmonary sequelae and their associated societal and economic burdens. Within this context, the management of respiratory sequelae in survivors of severe respiratory infections remains hindered by the limited efficacy of approved treatments. The molecular underpinnings contributing to the persistence of pulmonary sequelae following SARS-CoV-2 infection remain unclear. The results of an in-depth analysis of pathobiological mechanisms are very well suited to dissect the molecular basis of the long-term pulmonary sequelae. The greater accessibility to relatively novel tools has facilitated the implementation of innovative approaches for sophisticated molecular phenotyping (Vanhaverbeke et al., 2022). Here, we employed miRNA profiling of plasma samples collected from critical COVID-19 survivors, alongside the integration of RNA-sequencing and proteomic data, machine learning feature selection methods and bioinformatics, to delineate the early mechanistic pathways linked to the persistence of lung dysfunction and to identify potential candidate drugs.

    In the initial phase, we established an association between two specific miRNAs, miR-9-5p and miR-486-5p, measured in plasma samples shortly after hospital discharge (median = 3 months after hospital discharge), and long-term lung function impairment (median = 12 months after hospital discharge) using four distinct feature selection algorithms. miR-486-5p was consistently selected by different feature selection methods when considering survivors who required the use of IMV during ICU stay. This approach has been previously used to develop classifiers for diseases and identify pathogenetic regulators (Errington et al., 2021). Machine learning methods offer the advantage of capturing previously unknown multidimensional interactions between predictors and outcomes, providing an unbiased approach to detect novel biological features. This specially applies to miRNAs, because they regulate gene expression through complex, coordinated and collective networks, as reported above. The combination of different methods limits the number of selected miRNAs, enhancing confidence and reproducibility. Certainly, plasma levels of miR-9-5p have been previously associated with radiological abnormalities in survivors of severe COVID-19 (García-Hidalgo, González, Benítez, Carmona, Santisteve, Pérez-Pons, et al., 2022). miR-486-5p levels also have been linked with COVID-19 severity (de Gonzalo-Calvo et al., 2021) and intra-ICU mortality (de Gonzalo-Calvo et al., 2023). Both miRNAs have been associated with pathogenetic mechanisms involved in lung damage and repair. For example, miR-486-5p has been shown to promote acute lung injury by inducing an excessive inflammatory response and inhibiting apoptosis (Luo et al., 2020). Conversely, and in line with the complex biology of miRNAs, other studies have proposed miR-486-5p as a key regulator of cell survival in in vitro models of lung alveolar epithelial cells (Li et al., 2018) and anti-fibrotic mediator in pulmonary fibrosis (Ji et al., 2015). Similarly, miR-9-5p has been described as a suppressor of the profibrogenic transformation of fibroblasts and a regulator of organ fibrosis (Fierro-Fernández et al., 2015).

    The precise mechanisms by which acute respiratory infections, including SARS-CoV-2, lead to persistent lung sequelae are not yet fully understood. Data from critically ill patients suggest that failure to resolve epithelial barrier dysfunction, inflammation and matrix remodelling, in addition to other processes such as coagulation, could result in chronic lung dysfunction (Martín-Vicente et al., 2021). In the case of COVID-19, accumulating evidence indicates that post-acute pulmonary lesions are driven by the interplay of multiple complex pathomechanisms. Treatable traits encompass inflammation, alveolar damage, endothelial injury, fibrosis, senescence, mitochondrial dysfunction and neutrophil recruitment (George et al., 2022; Pastor-Fernández et al., 2023; Sibila et al., 2022). The varied histological and immunohistochemical patterns observed in lung tissue from patients with pulmonary sequelae after COVID-19 support this notion (Ravaglia et al., 2022). The functional enrichment analysis performed in our study aligns with the existing literature, revealing an overrepresentation of pathways related to inflammation, angiogenesis and cell senescence, among others. Additionally, our supplementary investigations indicate disruptions in pathways associated with cell migration and tissue repair as hallmark features of persistent lung dysfunction in critical COVID-19 survivors. These findings corroborate previous results from our group obtained through cross-sectional studies using diverse omics approaches (García-Hidalgo, González, Benítez, Carmona, Santisteve, Moncusí-Moix, et al., 2022; García-Hidalgo, Peláez, et al., 2022; Molinero, Gómez, et al., 2022). Furthermore, we have previously reported an association between elevated PTN and KIM1 levels, both of which play pivotal roles in tissue repair (Wang, 2020; Zhang & Cai, 2016), and severe diffusion impairment (DLCO < 60% of predicted) 3 months after hospital discharge (García-Hidalgo, González, Benítez, Carmona, Santisteve, Moncusí-Moix, et al., 2022). Collectively, these findings have significant conceptual implications. At least at systemic level, respiratory sequelae are not a uniform entity but comprise diverse pathobiological mechanisms. This heterogeneity may pose a substantial barrier to improve patient management and could help explain the limited response to treatments and the negative outcomes of clinical trials. Comprehensive investigations are essential to maximize opportunities for developing novel therapeutics.

    Our study provides valuable hypothesis-generating data from which to build future studies. The results suggest that there is a window of opportunity to prevent or treat long-term pulmonary damage in survivors of critical illness. Given the limited availability of treatments, miRNA-based therapeutics may represent a novel source of candidate drugs for addressing the persistence of lung dysfunction. In this regard, miRNA-based drugs have seen significant advances in the past decade and are considered a key technology in the therapeutic market (Bonneau et al., 2019). For instance, in the seminal study from Janssen et al. (2013), the administration of an antisense oligonucleotide that sequestered mature miR-122 to patients with chronic hepatitis C virus infection resulted in the reduction of the viral RNA levels. A recent investigation, combining different animal models and patient cohorts, reported that anti-miR-93-5p therapy prolonged sepsis survival through the regulation of both innate and adaptive immunity, that is, reducing the inflammatory monocytes and increasing circulating effector memory T cells (Dragomir et al., 2023). Another study also demonstrated the therapeutic efficacy of antisense therapy targeting miR-132-3p in various models of heart failure (Foinquinos et al., 2020). Supporting their potential transferability, a clinical trial phase 1b study suggested antisense therapy targeting miR-132-3p as safe and well tolerated (Täubel et al., 2021). miRNA mimic therapy also has provided promising results in bronchopulmonary dysplasia (Wen et al., 2021) and idiopathic pulmonary fibrosis (Chioccioli et al., 2022). Nevertheless, the safety of miRNA restoration therapy is still a matter of debate (Hong et al., 2020). Overall, these studies underscore the importance of considering miRNAs as potential therapeutic targets/agents. This becomes particularly relevant given the limited repositioning potential identified in our study. The drug–gene interaction analyses identified two FDA-approved drugs, clopidogrel and aspirin, targeting only one gene, ITGA2, thus not covering the entire spectrum of mechanisms associated with persistence. The limitations of miRNA-based therapeutics could be seen as an advantage, because the use of miRNAs that regulate multiple underlying pathways, traditionally considered an undesirable side effect, might prove beneficial in multifactorial conditions such as respiratory post-acute sequelae.

    While our current investigation exhibits several strengths, including the use of a “real clinical context” with meticulous patient monitoring and comprehensive pulmonary assessments, it is imperative to acknowledge the limitations inherent to a pilot study. First, the relatively modest sample size may result in model overfitting. The study cohort was collected among SARS-CoV-2 patients during the early stages of the pandemic utilizing a single-centre design. The impact of potential confounding factors, such as age, sex or smoking status, should not be discarded. Nonetheless, differential expression analyses have been adjusted by these variables. Likewise, feature selection methods were applied to a more homogenous subgroup of critical survivors, showing consistent results. Further validation on external cohorts with more varied demographic and clinical characteristics and after sample size calculation are warranted. Whether our findings have an application in other post-acute patients deserves detailed investigations. To this end, we are currently collecting samples and data in a prospective manner (ClinicalTrials.gov Identifier: NCT06083363). Second, we have used a targeted strategy. However, we have selected miRNAs linked to mechanistic pathways implicated in lung disease, COVID-19 and its respiratory consequences. The present results offer robust support for the adoption of high-throughput technologies, such as sequencing. Third, additional studies are indispensable for dissecting the individual contributions of plasma miRNAs to the elucidated mechanistic pathways and their causal links to persistent lung dysfunction. Fourth, although patients with previous chronic pulmonary disease were excluded, we could not discard the presence of subclinical disease. Fifth, our findings align with a systemic impact of the respiratory sequelae. Previous investigations have suggested a compartmentalization of the lung concerning specific pathological mechanisms, particularly within the context of inflammation (Jouan et al., 2021). Furthermore, we have previously delineated disparities in the miRNA profiles observed in respiratory samples and plasma obtained from individuals with critical COVID-19 (de Gonzalo-Calvo et al., 2023; Molinero, Benítez, et al., 2022). Whether these mechanisms and potential targets are associated with organ dysfunction at the lung warrants further investigation. Sixth, we have not performed a biomarker study. While plasma miRNA profiles have traditionally been considered as a potential source of biomarkers, their utility as prognostic tools for managing pulmonary post-acute sequelae appears to be limited. The electronic health data provide enough information for prognostication. The group of patients with persistent diffusion impairment predominantly consists in older survivors with extended hospital and ICU stays, a higher need for IMV and prone positioning during the acute phase, diminished pulmonary function and capacity and an elevated prevalence of radiological abnormalities observed in the short-term follow-up visit after hospital discharge.

    In conclusion, we successfully identified two miRNAs, miR-9-5p and miR-486-5p, linked to persistent lung dysfunction in survivors of critical COVID-19. Our findings provide a basis for understanding the intricate mechanistic pathways involved in the recovery of lung function. Current results offer promising avenues for the development of targeted disease-modifying agents to improve the long-term outcomes of individuals recovering from severe respiratory infections.

    AUTHOR CONTRIBUTIONS

    M.C. García-Hidalgo, I.D. Benítez, F. Barbé and D. de Gonzalo-Calvo were responsible for conceptualization. A. Torres, F Barbé and D. de Gonzalo-Calvo were involved in funding acquisition. F. Barbé and D. de Gonzalo-Calvo were in charge of project administration and supervision. Resources were organized and provided by J. González and D. de Gonzalo-Calvo. Data curation was performed by M.C. García-Hidalgo, M. Aguilà, S. Santisteve, A. Moncusí-Moix and C. Gort-Paniello. M.C. García-Hidalgo, M. Perez-Pons, M. Aguilà, S. Santisteve, R. Peláez and I.M. Larráyoz assisted with the investigation. Experimental methodology was done by M.C. García-Hidalgo, M. Perez-Pons and D. de Gonzalo-Calvo. M.C. García-Hidalgo, I.D. Benítez, R. Peláez, I.M. Larráyoz and D. de Gonzalo-Calvo contributed to formal analysis of the data. Software was developed by M.C. García-Hidalgo and I.D. Benítez. M.C. García-Hidalgo and M. Molinero were responsible for visualization of data. M.C. García-Hidalgo, I.D. Benítez and D. de Gonzalo-Calvo wrote the original draft. The review and editing of the manuscript were made by M. Perez-Pons, M. Molinero, T. Belmonte, C. Rodríguez-Muñoz, G. Torres, R. Peláez, I.M. Larráyoz, J. Caballero, C. Barberà, E. Nova-Lamperti, A. Torres, J. González, F. Barbé and D. de Gonzalo-Calvo.

    ACKNOWLEDGEMENTS

    This work supported by IRBLleida Biobank (B.000682) and Biobank and Biomodels Platform ISCIII PT20/00021. The human sample manipulation was performed in the Cell Culture Facility, Universitat de Lleida (Lleida, Catalonia, Spain).

      CONFLICT OF INTEREST STATEMENT

      The authors declare that they have no competing interests.

      DECLARATION OF TRANSPARENCY AND SCIENTIFIC RIGOUR

      This Declaration acknowledges that this paper adheres to the principles for transparent reporting and scientific rigour of preclinical research as stated in the BJP guidelines for Design and Analysis and as recommended by funding agencies, publishers and other organizations engaged with supporting research.

      DATA AVAILABILITY STATEMENT

      The data that support the findings of this study are available on request from the corresponding author. The external datasets were derived from the following resources available in the public domain: GEO (https://www.ncbi.nlm.nih.gov/geo/GSE228320) and Zenodo (https://zenodo.org/record/8375667).