基于 ConvNext 的测井图像描述方法

吉林大学学报(信息科学版) ›› 2026, Vol. 44 ›› Issue (3): 489-498.

• • 下一篇

基于 ConvNext 的测井图像描述方法

肖红¹, 颜高鹏¹, 曹茂俊¹, 舒琰²

1. 东北石油大学计算机与信息技术学院,黑龙江大庆163318;2. 大庆油田测试技术服务分公司解释评价中心,黑龙江大庆163318

收稿日期:2025-04-14 出版日期:2026-06-02 发布日期:2026-06-02
通讯作者: 颜高鹏(2000— ), 男, 陕西宝鸡人, 东北石油大学硕士研究生, 主要从事人工智能、深度学习研究,(Tel)86-15592680831 E-mail:yangaopeng_01@126. com
作者简介:肖红(1979— ), 女, 黑龙江大庆人, 东北石油大学副教授, 博士, 主要从事深度学习、图像处理研究, (Tel)86- 13904861067(E-mail)xh_daqing@126. com
基金资助:
国家自然科学基金资助项目(42172161;52474035); 黑龙江省自然科学基金联合基金重点资助项目(ZL2024D003); 中石油创新基金资助项目(2024DQ02-0114)

Logging Image Description Method Based on ConvNext

XIAO Hong¹, YAN Gaopeng¹, CAO Maojun¹, SHU Yan²

1. School of Computer and Information Technology, Northeast Petroleum University, Daqing 163318, China; 2. Interpretive Evaluation Center, Daqing Oilfield Testing Technology Service Branch, Daqing 163318, China

Received:2025-04-14 Online:2026-06-02 Published:2026-06-02

摘要/Abstract

摘要： 针对现有测井图像解释工作高度依赖人工经验和专家意见, 无法快速理解和给出图像主旨含义, 严重影响测井图像蕴含信息的深度挖掘和利用率问题, 提出了一种基于 ConvNex t网络编码-解码架构的测井图像描述方法。首先, 在编码器部分采用 ConvNext 网络配合混合空洞卷积, 以增强模型对低分辨率图像细节信息的提取能力。然后, 通过将原始模型的加性注意力机制替换为多头注意力机制, 配合具有时序信息记忆能力的 LSTM(Long Short Term Memory), 有效提升模型对长距离依赖信息的捕捉能力, 从而可生成对测井图像更准确、更自然的描述。实验结果表明, 与基线模型方法比较, BLEU-4(Bilingual Evaluation Understudy-4)、 METROR (Metric for Evaluation of Translation with Explicit ORdering) 以及 CIDEr ( Consensus-based Image Description Evaluation)评价指标分别提升了 3. 8、4. 0 和 5. 3, 表明采用 ConvNext 架构描述测井图像信息的研究方案是可行的。

关键词: 图像描述, ConvNext网络, 混合空洞卷积, 多头注意力机制

Abstract: The existing logging image interpretation work highly relies on manual experience and expert opinions, which can not quickly understand and give the gist meaning of the image, seriously affecting the depth mining and utilization of the information contained in the logging image. A logging image description method based on the ConvNext network coding-decoding architecture is proposed. The ConvNext network with mixed dilated convolution encoder is adopted to enhance model’s ability to extract low resolution image detail information. Then, the mechanism of additive attention is used to replace the original model for long attention mechanism, cooperated with LSTM(Long Short Term Memory) capable of temporal information memory, enhancing the model’s ability to capture long rely on information, which can generate descriptions of logging image more accurate and more natural. The experimental results show that the evaluation indexes of BLEU-4 (Bilingual Evaluation Understudy-4), METROR(Metric for Evaluation of Translation with Explicit ORdering) and CIDEr (Consensus-based Image Description Evaluation) are improved by 3. 8,4. 0 and 5. 3, respectively, compared with the baseline model method. This research scheme using ConvNext architecture to describe log image information is feasible.

Key words: image description, ConvNext network, mixed cavity convolution, multi-head attention mechanism

中图分类号:

TP391

肖红, 颜高鹏, 曹茂俊, 舒琰. 基于 ConvNext 的测井图像描述方法 [J]. 吉林大学学报(信息科学版), 2026, 44(3): 489-498.

XIAO Hong, YAN Gaopeng, CAO Maojun, SHU Yan. Logging Image Description Method Based on ConvNext[J]. Journal of Jilin University (Information Science Edition), 2026, 44(3): 489-498.

[1]	任伟建, 张义东, 任璐, 张永丰, 孙勤江. 基于XLBMC 的专业领域中文分词方法[J]. 吉林大学学报(信息科学版), 2025, 43(4): 755-762.
[2]	陈雪松, 詹子依, 王浩畅. 融合 SikuBERT 模型与 MHA 的古汉语命名实体识别[J]. 吉林大学学报(信息科学版), 2023, 41(5): 866-875.
[3]	张强, 李志溢, 邓彬. 基于 HDCNN-BIGRU-Attention 油田措施效果预测模型 [J]. 吉林大学学报(信息科学版), 2023, 41(4): 631-638.

基于 ConvNext 的测井图像描述方法

Logging Image Description Method Based on ConvNext

PDF (PC)

赞

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 3

Metrics

本文评价

推荐阅读 10