吉林大学学报(信息科学版) ›› 2026, Vol. 44 ›› Issue (3): 489-498.

• •    下一篇

基于 ConvNext 的测井图像描述方法 

肖 红1, 颜高鹏1, 曹茂俊1, 舒 琰2   

  1. 1. 东北石油大学计算机与信息技术学院,黑龙江大庆163318;2. 大庆油田测试技术服务分公司解释评价中心,黑龙江大庆163318
  • 收稿日期:2025-04-14 出版日期:2026-06-02 发布日期:2026-06-02
  • 通讯作者: 颜高鹏(2000— ), 男, 陕西宝鸡人, 东北石油大学硕士研究生, 主要从事人工智能、深度学习研究,(Tel)86-15592680831 E-mail:yangaopeng_01@126. com
  • 作者简介:肖红(1979— ), 女, 黑龙江大庆人, 东北石油大学副教授, 博士, 主要从事深度学习、 图像处理研究, (Tel)86- 13904861067(E-mail)xh_daqing@126. com
  • 基金资助:
    国家自然科学基金资助项目(42172161;52474035); 黑龙江省自然科学基金联合基金重点资助项目(ZL2024D003); 中石 油创新基金资助项目(2024DQ02-0114)

Logging Image Description Method Based on ConvNext

XIAO Hong1, YAN Gaopeng1, CAO Maojun1, SHU Yan2   

  1. 1. School of Computer and Information Technology, Northeast Petroleum University, Daqing 163318, China; 2. Interpretive Evaluation Center, Daqing Oilfield Testing Technology Service Branch, Daqing 163318, China
  • Received:2025-04-14 Online:2026-06-02 Published:2026-06-02

摘要: 针对现有测井图像解释工作高度依赖人工经验和专家意见无法快速理解和给出图像主旨含义严重影响测井图像蕴含信息的深度挖掘和利用率问题提出了一种基于 ConvNex t网络编码-解码架构的测井图像描述方法。 首先在编码器部分采用 ConvNext 网络配合混合空洞卷积, 以增强模型对低分辨率图像细节信息的提取能力。 然后通过将原始模型的加性注意力机制替换为多头注意力机制配合具有时序信息记忆能力的 LSTM(Long Short Term Memory), 有效提升模型对长距离依赖信息的捕捉能力, 从而可生成对测井图像更准确、 更自然的描述。 实验结果表明, 与基线模型方法比较, BLEU-4(Bilingual Evaluation Understudy-4) METROR (Metric for Evaluation of Translation with Explicit ORdering) 以及 CIDEr ( Consensus-based Image Description Evaluation)评价指标分别提升了 3. 84. 0 5. 3, 表明采用 ConvNext 架构描述测井图像信息的研究方案是可行的。

关键词: 图像描述,  ConvNext网络, 混合空洞卷积, 多头注意力机制

Abstract: The existing logging image interpretation work highly relies on manual experience and expert opinions, which can not quickly understand and give the gist meaning of the image, seriously affecting the depth mining and utilization of the information contained in the logging image. A logging image description method based on the ConvNext network coding-decoding architecture is proposed. The ConvNext network with mixed dilated convolution encoder is adopted to enhance model’s ability to extract low resolution image detail information. Then, the mechanism of additive attention is used to replace the original model for long attention mechanism, cooperated with LSTM(Long Short Term Memory) capable of temporal information memory, enhancing the model’s ability to capture long rely on information, which can generate descriptions of logging image more accurate and more natural. The experimental results show that the evaluation indexes of BLEU-4 (Bilingual Evaluation Understudy-4), METROR(Metric for Evaluation of Translation with Explicit ORdering) and CIDEr (Consensus-based Image Description Evaluation) are improved by 3. 8,4. 0 and 5. 3, respectively, compared with the baseline model method. This research scheme using ConvNext architecture to describe log image information is feasible. 

Key words: image description, ConvNext network, mixed cavity convolution, multi-head attention mechanism 

中图分类号: 

  • TP391