吉林大学学报(医学版) ›› 2024, Vol. 50 ›› Issue (1): 178-187.doi: 10.13481/j.1671-587X.20240122

• 临床研究 • 上一篇    

基于盆腔器官脱垂相关衰老基因的GEO数据库和LASSO回归算法的生物信息学分析

宁敏琦,何勇,李秉枢,黄国涛,左晓虎,赵芷晗,韩武岳,洪莉()   

  1. 武汉大学人民医院妇产科 湖北省盆底疾病临床医学研究中心,湖北 武汉 430060
  • 收稿日期:2023-02-16 出版日期:2024-01-28 发布日期:2024-01-31
  • 通讯作者: 洪莉 E-mail:dr_hongli@whu.edu.cn
  • 作者简介:宁敏琦(1999-),女,湖南省株洲市人,在读硕士研究生,主要从事盆底功能障碍性疾病方面的研究。
  • 基金资助:
    国家自然科学基金项目(81971364);国家药品监督管理局重点研发计划项目(2021YFC2701300);湖北省科技厅自然科学基金项目(2022CFB124);湖北省科技厅重点研发计划项目(2022BCA045);湖北省卫健委科研项目(WJ2021Q037)

Bioinformatics analysis based on pelvic organ prolapse related aging genes of GEO Database and LASSO regression algorithm

Minqi NING,Yong HE,Bingshu LI,Guotao HUANG,Xiaohu ZUO,Zhihan ZHAO,Wuyue HAN,Li HONG()   

  1. Department of Gynecology and Obstetrics,People’s Hospital,Wuhan University,Clinical Medical Reseach Center of Pelvic Floor Diseases,Hubei Procince,Wuhan 430060,China
  • Received:2023-02-16 Online:2024-01-28 Published:2024-01-31
  • Contact: Li HONG E-mail:dr_hongli@whu.edu.cn

摘要:

目的 通过生物信息学技术筛选与盆腔器官脱垂(POP)密切相关的衰老基因,并阐明关键基因潜在的临床意义和价值。 方法 利用基因表达汇编(GEO)数据库以“pelvic organ prolapse”为关键词检索下载数据集GSE53868和GSE151188获取POP相关基因。从Aging Atlas数据库、CellAge数据库和人类衰老基因组资源(HAGR)数据库获取衰老相关基因,2组基因取交集得到POP相关衰老的差异表达基因(DEGs)。采用R 4.2.1软件进行基因富集分析(GSEA)。采用注释可视化和集成发现(DAVID)数据库对DEGs进行基因本体论(GO)功能富集分析和京都基因与基因组百科全书(KEGG)信号通路富集分析,采用Cytoscape 3.9.1软件构建蛋白-蛋白互作(PPI)网络,用cytoHubba插件筛选排名前10位的核心基因(Hub基因)。采用R软件CIBERSORT反卷积法分析22种免疫细胞在POP组和对照组患者中的浸润情况。采用LASSO回归算法进一步筛选关键基因,分析关键基因与免疫细胞浸润的相关性和诊断效能。 结果 通过Aging Atlas、CellAge和HAGR数据库得到724个衰老相关基因,与POP表达谱取交集,提取到含624个基因的POP相关衰老基因表达矩阵,差异分析后得到29个POP相关DEGs,其中表达上调基因2个,表达下调基因27个。GSEA显示,上调通路主要是糖尿病和细胞衰老等通路,下调通路包括阿尔茨海默病和缺氧诱导因子1(HIF-1)信号通路等。GO功能富集分析主要富集于细胞对脂多糖的反应、炎症反应和细胞增殖的负向调节等生物学过程。KEGG信号通路富集分析主要是白细胞介素17(IL-17)、肿瘤坏死因子(TNF)和核因子κB(NF-κB)信号传导途径。PPI网络分析得到白细胞介素6(IL-6)、白细胞介素1B(IL-1B)、环氧合酶2(PTGS2)和NF-κB抑制因子α(NFKBIA)等10个Hub基因。CIBERSORT反卷积法分析,中性粒细胞和活化的肥大细胞在POP组患者中浸润比例相对较大,且在相关性分析中活化的肥大细胞与大部分DEGs呈正相关关系(r>0.5),巨噬细胞与IL-1B呈明显正相关关系(r>0.6)。LASSO回归算法进一步筛选的10个关键基因中,Jun D原癌基因(JUND)、Snail同源物1(SNAI1)、双调蛋白(AREG)、A型核纤层蛋白基因(LMNA)和超氧化物歧化酶2(SOD2)的诊断效能较高,受试者工作特征(ROC)曲线下面积(AUC)均大于0.750。 结论 在衰老过程中,JUND、SNAI1、AREG、LMNA和SOD2基因可能通过炎症相关通路、转录调控、影响胶原蛋白分泌和代谢等多种途径参与POP病理生理过程,从而影响结缔组织支撑功能,促进POP的发生发展。

关键词: 盆腔器官脱垂, 生物信息学, 差异基因, 富集分析

Abstract:

Objective To screen the aging genes closely associated with pelvic organ prolapse (POP) by bioinformatics techniques, and to clarify the potential clinical significance and value of key genes. Methods Gene Expression Omnibus (GEO) Database was used to download the datasets GSE53868 and GSE151188 for POP-related genes with the keyword “pelvic organ prolapse”. The aging-related genes were obtained from Aging Atlas, CellAge, and the Human Ageing Genomic Resources (HAGR) Databases;the intersection of genes related with POP in two groups provided a list of differentially expressed genes (DEGs) associated with aging in POP; gene Set Enrichment Analysis (GSEA) was conducted with R software version 4.2.1; Gene Ontology (GO) functional enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) signaling pathway enrichment analysis of DEGs were conducted by the Database for Annotation, Visualization and Integrated Discovery (DAVID); the protein-protein interaction (PPI) network was constructed with Cytoscape 3.9.1 software;the top 10 Hub genes were selected by cytoHubba plugin; the infiltration of 22 types of immune cells in the patients in POP group and control group was analyzed by CIBERSORT deconvolution method using R software;the key genes were further screened by LASSO regression algorithm; the correlation and diagnostic efficacy between key genes and immune cell infiltration were analyzed. Results From the Aging Atlas, CellAge, and HAGR Databases, 724 aging-related genes were identified. Intersection with the POP expression profile yielded an aging gene expression matrix related to POP containing 624 genes, and 29 POP-related DEGs were identified after differential analysis, including 2 upregulated genes and 27 downregulated genes. The GSEA results showed that the upregulated pathways were mainly related to diabetes and cellular senescence, whereas the downregulated pathways included Alzheimer’s disease and hypoxia-inducible factor-1 (HIF-1) signaling pathways.The GO functional enrichment analysis mainly enriched in the biological processes such as the response of the cells to lipopolysaccharide, inflammatory response, and negative regulation of cell proliferation. The KEGG signaling pathway enrichment analysis mainly enriched in interleukin-17 (IL-17), tumor necrosis factor (TNF), and nuclear factor-kappa B (NF-κB) signaling pathways. The PPI network analysis got 10 Hub genes including interleukin-6 (IL-6), interleukin-1B (IL-1B), prostaglandin-endoperoxide synthase 2 (PTGS2), and NF-kappa-B inhibitor alpha (NFKBIA). The CIBERSORT deconvolution method results showed a relatively higher infiltration proportion of neutrophils and activated mast cells in the patients in POP group, the activated mast cells had a positive correlation with most of the DEGs (r>0.5) and the macrophages had a significant positive correlation with IL-1B (r>0.6). The key genes Jun D proto-oncogene (JUND), Snail homolog 1 (SNAI1), amphiregulin (AREG), Lamin A/C (LMNA), and superoxide dismutase 2 (SOD2) selected by LASSO regression analysis had high diagnostic efficacies, and the area under receiver operating characteristic curve (ROC) (AUC) were all greater than 0.75. Conclusion During the aging process,the genes such as JUND,SNAI1,AREG,LMNA,and SOD2 may participate in the pathophysiology of POP through various pathways,including inflammation-related pathways,transcription regulation,and affecting collagen secretion and metabolism,thereby influence the connective tissue support function and promote the occurrence and development of POP.

Key words: Pelvic organ prolapse, Bioinformatics, Differential genes, Enrichment analysis

中图分类号: 

  • R711.23