Intelligent analysis of ship collision accidents via Low-Rank Adaptation-based fine-tuning of medium-scale Large Language Models Author links open overlay panel

Ma, Jun; Cao, Liang; Feng, Yinwei; Karatuğ, Çağlar; Büber, MÜGE; Wang, Xinjian

doi:10.1016/j.ress.2026.112774

Intelligent analysis of ship collision accidents via Low-Rank Adaptation-based fine-tuning of medium-scale Large Language Models Author links open overlay panel

Ma J., Cao L., Feng Y., Karatuğ Ç., Büber M., Wang X.

RELIABILITY ENGINEERING AND SYSTEM SAFETY, cilt.275, sa.2, ss.1-29, 2026 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 275 Sayı: 2
Basım Tarihi: 2026
Doi Numarası: 10.1016/j.ress.2026.112774
Dergi Adı: RELIABILITY ENGINEERING AND SYSTEM SAFETY
Derginin Tarandığı İndeksler: Scopus, Science Citation Index Expanded (SCI-EXPANDED), Compendex, INSPEC, zbMATH
Sayfa Sayıları: ss.1-29
Dokuz Eylül Üniversitesi Adresli: Evet

Özet

The rapid advancement of intelligent maritime accident analysis requires processing large-scale, multilingual data across wide geographic regions. However, significant challenges remain in objectively constructing Risk Influencing Factors (RIFs) and ensuring accurate information extraction with limited computational resources. To address these gaps, a framework for intelligent analysis of ship collision accidents based on Low-Rank Adaptation (LoRA) fine-tuning of medium-scale large language models (LLMs) with limited labeled data was proposed. A bilingual dataset comprising 503 ship collision accident reports was established, and the RIF ontology was derived using a Grounded Theory approach. Using 60 labeled samples, models with parameters were fine-tuned, achieving an F1 score of 94.11% on the most challenging accident RIF extraction subtask, surpassing base models by 34.82%. Then, the extracted information was transformed into a 1061-row ×24-column training data matrix via a semantic similarity model, enabling construction of a TAN-BN model. Finally, sensitivity analysis was conducted to identify key RIFs, and case studies were performed to evaluate model performance and validate the proposed framework. The research results showed that the proposed approach advances large-scale, cross-lingual intelligent maritime accident report analysis by improving accuracy and efficiency, reducing computational costs, and supporting reliable safety management decisions. The source code is publicly available at: https://github.com/AdvMarTech/LoRA_LLM_Accident.