人工智能在普通外科重症预警与决策支持中应用

金信浩; 沈鸿杰; 杨穗碧; 林玲; 章仲恒

doi:10.19538/j.cjps.issn1005-2208.2026.05.08

PDF(2285 KB)

中国实用外科杂志 ›› 2026, Vol. 46 ›› Issue (5) : 638-644. DOI: 10.19538/j.cjps.issn1005-2208.2026.05.08

专题笔谈·外科重症诊治新观点

人工智能在普通外科重症预警与决策支持中应用

金信浩 ¹ ,
沈鸿杰 ² ,
杨穗碧 ² ,
林玲 ¹ ,
章仲恒 ²^,³^,⁴

作者信息 +

Application of artificial intelligence in early warning and decision support for critically ill patients in general surgery

JIN Xin-hao ¹ ,
SHEN Hong-jie ² ,
YANG Sui-bi ² ,
LIN Ling ¹ ,
ZHANG Zhong-heng ²^,³^,⁴

Author information +

文章历史 +

摘要

普通外科重症病人病情复杂、变化迅速，常涉及多器官功能障碍及严重感染等问题，给临床诊疗带来巨大挑战。传统重症评估工具在一定程度上能够反映病人病情严重程度，但多依赖静态指标，难以实现连续动态评估。近年来，随着医疗大数据和人工智能技术的发展，基于机器学习与深度学习算法的智能模型逐渐应用于重症医学领域，在疾病早期识别、风险预测及临床决策支持方面展现出重要价值。人工智能能够整合电子病历、生命体征监测、实验室指标及影像数据，实现对重症病人病情变化的实时分析与预测，从而为临床医师提供更加精准的决策依据。人工智能在普通外科重症病人早期预警、并发症风险预测以及临床决策支持中的应用有良好的前景，但其在临床推广过程中也面临数据质量、模型可解释性及伦理安全等问题。

Abstract

Critically ill patients in general surgery often present with complex conditions and rapid clinical deterioration, frequently involving severe infection and multiple organ dysfunction. Traditional severity scoring systems can partially reflect disease severity but rely mainly on static indicators and lack dynamic predictive capability. With the development of medical big data and artificial intelligence (AI), machine learning and deep learning-based models have been increasingly applied in critical care medicine. These technologies show great potential in early disease detection, risk prediction, and clinical decision support. By integrating electronic health records, vital signs, laboratory results, and imaging data, AI can continuously analyze patient conditions and provide real-time predictions to assist clinicians in making timely decisions. AI has promising prospects in the application in early warning systems, complication prediction, and decision support for critically ill patients in general surgery. However, current challenges such as data quality, model interpretability, and ethical issues are discussed to provide references for the standardized application of AI in surgical critical care.

导出引用

金信浩, 沈鸿杰, 杨穗碧, 等. 人工智能在普通外科重症预警与决策支持中应用[J]. 中国实用外科杂志. 2026, 46(5): 638-644 https://doi.org/10.19538/j.cjps.issn1005-2208.2026.05.08

JIN Xin-hao, SHEN Hong-jie, YANG Sui-bi, et al. Application of artificial intelligence in early warning and decision support for critically ill patients in general surgery[J]. Chinese Journal of Practical Surgery. 2026, 46(5): 638-644 https://doi.org/10.19538/j.cjps.issn1005-2208.2026.05.08

中图分类号： R6

参考文献

列表( 原文顺序 | 文献年度倒序 | 文中引用次数倒序 ) 可视化分析

[1]	Ghneim MH, Tisherman SA. Management of older adult emergency general surgery patients in the intensive care unit: What you need to know[J]. J Trauma Acute Care Surg, 2026, 100(3):349-358. DOI: 10.1097/TA.0000000000004744. https://journals.lww.com/10.1097/TA.0000000000004744 本文引用 [1]

[2]

Kumrawat

, Gupta

, Dhillon

, et al. Prognostic value of admission lactate levels in critically ill patients: A comparative study with sequential organ failure assessment and acute physiology and chronic health evaluation ii scores[J]. Cureus, 2024, 16(10):e71372. DOI:10.7759/cureus.71372.

本文引用 [1]

[3]	季加孚, 李浙民, 李子禹. 数智化时代胃癌诊治新进展与未来研究方向[J]. 中国实用外科杂志, 2026, 46(1):15-18. DOI:10.19538/j.cjps.issn1005-2208.2026.01.04. 本文引用 [1]

[4]

Porcellato

, Lanera

, Ocagli

, et al. Exploring applications of artificial intelligence in critical care nursing: A systematic review[J]. Nurs Rep, 2025, 15(2):55. DOI:10.3390/nursrep15020055.

https://www.mdpi.com/2039-4403/15/2/55

本文引用 [1] 摘要

Background: Artificial intelligence (AI) has been increasingly employed in healthcare across diverse domains, including medical imaging, personalized diagnostics, therapeutic interventions, and predictive analytics using electronic health records. Its integration is particularly impactful in critical care, where AI has demonstrated the potential to enhance patient outcomes. This systematic review critically evaluates the current applications of AI within the domain of critical care nursing. Methods: This systematic review is registered with PROSPERO (CRD42024545955) and was conducted in accordance with PRISMA guidelines. Comprehensive searches were performed across MEDLINE/PubMed, SCOPUS, CINAHL, and Web of Science. Results: The initial review identified 1364 articles, of which 24 studies met the inclusion criteria. These studies employed diverse AI techniques, including classical models (e.g., logistic regression), machine learning approaches (e.g., support vector machines, random forests), deep learning architectures (e.g., neural networks), and generative AI tools (e.g., ChatGPT). The analyzed health outcomes encompassed postoperative complications, ICU admissions and discharges, triage assessments, pressure injuries, sepsis, delirium, and predictions of adverse events or critical vital signs. Most studies relied on structured data from electronic medical records, such as vital signs and laboratory results, supplemented by unstructured data, including nursing notes and patient histories; two studies also integrated audio data. Conclusion: AI demonstrates significant potential in nursing, facilitating the use of clinical practice data for research and decision-making. The choice of AI techniques varies based on the specific objectives and requirements of the model. However, the heterogeneity of the studies included in this review limits the ability to draw definitive conclusions about the effectiveness of AI applications in critical care nursing. Future research should focus on more robust, interventional studies to assess the impact of AI on nursing-sensitive outcomes. Additionally, exploring a broader range of health outcomes and AI applications in critical care will be crucial for advancing AI integration in nursing practices.

[5]	Zhao Q, Li G, Du K. Applications of machine learning and deep learning in precision medicine: Opportunities and challenges in genomics, oncology and clinical integration (Review)[J]. Biomed Rep, 2025, 23(6):192. DOI:10.3892/br.2025.2070. 本文引用 [1]

[6]

Zeng

, Huang

, Zhu

. AI-driven transformation of precision medicine: A comprehensive narrative review of key application areas, emerging paradigms, and future directions[J]. Front public health, 2026, 13:1656603. DOI:10.3389/fpubh.2025.1656603.

https://www.frontiersin.org/articles/10.3389/fpubh.2025.1656603/full

本文引用 [1]

[7]

, Wu

, Chen

, et al. Machine learning-based prediction model for postoperative complications in gastric and colorectal cancer: A prospective nationwide multi-center study[J]. Chin J Cancer Res, 2025, 37(4):624-638. DOI:10.21147/j.issn.1000-9604.2025.04.13.

http://article.cjcrcn.org/en/article/doi/10.21147/j.issn.1000-9604.2025.04.13?viewType=HTML

本文引用 [2]

[8]

Shao

, Li

, Cheng

, et al. Prospective multicenter validation of a machine learning model for predicting anastomotic leakage in patients with gastric adenocarcinoma undergoing total or proximal gastrectomy[J]. Int J Surg, 2025, 111(11):8027-8036. DOI:10.1097/JS9.0000000000003025.

https://journals.lww.com/10.1097/JS9.0000000000003025

本文引用 [2]

[9]	Cira K, Neumann P. Machine learning for anastomotic leak prediction in colorectal surgery-between validation and clinical implementation[J]. JAMA Netw Open, 2025, 8(10):e2538274.DOI:10.1001/jamanetworkopen.2025.38274. 本文引用 [2]

[10]	Kang B, Qiao Y, Zhu J, et al. Serum calcium-based interpretable machine learning model for predicting anastomotic leakage after rectal cancer resection: A multi-center study[J]. World J Gastroenterol, 2025, 31(19):105283. DOI:10.3748/wjg.v31.i19.105283. 本文引用 [2]

[11]

Mazzotta

, Burti

, Causio

, et al. Machine learning approaches for the prediction of postoperative major complications in patients undergoing surgery for bowel obstruction[J]. J Pers Med, 2024, 14(10):1043. DOI:10.3390/jpm14101043.

https://www.mdpi.com/2075-4426/14/10/1043

本文引用 [2] 摘要

Background: Performing emergency surgery for bowel obstruction continues to place a significant strain on the healthcare system. Conventional assessment methods for outcomes in bowel obstruction cases often concentrate on isolated factors, and the evaluation of results for individuals with bowel obstruction remains poorly studied. This study aimed to examine the risk factors associated with major postoperative complications. Methods: We retrospectively analyzed 99 patients undergoing surgery from 2015 to 2022. We divided the patients into two groups: (1) benign-related obstruction (n = 68) and (2) cancer-related obstruction (n = 31). We used logistic regression, KNN, and XGBOOST. We calculated the receiver operating characteristic curve and accuracy of the model. Results: Colon obstructions were more frequent in the cancer group (p = 0.005). Operative time, intestinal resection, and stoma were significantly more frequent in the cancer group. Major complications were at 41% for the cancer group vs. 20% in the benign group (p = 0.03). Uni- and multivariate analysis showed that the significant risk factors for major complications were cancer-related obstruction and CRP. The best model was KNN, with an accuracy of 0.82. Conclusions: Colonic obstruction is associated with tumor-related blockage. Malignant cancer and an increase in C-reactive protein (CRP) are significant risk factors for patients who have undergone emergency surgery due to major complications. KNN could improve the process of counseling and the perioperative management of patients with intestinal obstruction in emergency settings.

[12]

Shickel

, Loftus

, Ruppert

, et al. Dynamic predictions of postoperative complications from explainable, uncertainty-aware, and multi-task deep neural networks[J]. Sci Rep, 2023, 13(1):1224. DOI:10.1038/s41598-023-27418-5.

https://www.ncbi.nlm.nih.gov/pubmed/36681755

本文引用 [2] 摘要

Accurate prediction of postoperative complications can inform shared decisions regarding prognosis, preoperative risk-reduction, and postoperative resource use. We hypothesized that multi-task deep learning models would outperform conventional machine learning models in predicting postoperative complications, and that integrating high-resolution intraoperative physiological time series would result in more granular and personalized health representations that would improve prognostication compared to preoperative predictions. In a longitudinal cohort study of 56,242 patients undergoing 67,481 inpatient surgical procedures at a university medical center, we compared deep learning models with random forests and XGBoost for predicting nine common postoperative complications using preoperative, intraoperative, and perioperative patient data. Our study indicated several significant results across experimental settings that suggest the utility of deep learning for capturing more precise representations of patient health for augmented surgical decision support. Multi-task learning improved efficiency by reducing computational resources without compromising predictive performance. Integrated gradients interpretability mechanisms identified potentially modifiable risk factors for each complication. Monte Carlo dropout methods provided a quantitative measure of prediction uncertainty that has the potential to enhance clinical trust. Multi-task learning, interpretability mechanisms, and uncertainty metrics demonstrated potential to facilitate effective clinical implementation.© 2023. The Author(s).

[13]	王大广, 杨少康, 武平, 等. 基于机器学习算法建立直肠癌低位前切除术后吻合口漏早期诊断模型及其效能评价[J]. 中国实用外科杂志, 2025, 45(7):812-818. DOI:10.19538/j.cjps.issn1005-2208.2025.07.16. 本文引用 [2]

[14]

Desautels

, Calvert

, Hoffman

, et al. Prediction of sepsis in the intensive care unit with minimal electronic health record data: A machine learning approach[J]. JMIR Med Inform, 2016, 4(3):e28. DOI:10.2196/medinform.5909.

https://www.ncbi.nlm.nih.gov/pubmed/27694098

本文引用 [1] 摘要

Sepsis is one of the leading causes of mortality in hospitalized patients. Despite this fact, a reliable means of predicting sepsis onset remains elusive. Early and accurate sepsis onset predictions could allow more aggressive and targeted therapy while maintaining antimicrobial stewardship. Existing detection methods suffer from low performance and often require time-consuming laboratory test results.To study and validate a sepsis prediction method, InSight, for the new Sepsis-3 definitions in retrospective data, make predictions using a minimal set of variables from within the electronic health record data, compare the performance of this approach with existing scoring systems, and investigate the effects of data sparsity on InSight performance.We apply InSight, a machine learning classification system that uses multivariable combinations of easily obtained patient data (vitals, peripheral capillary oxygen saturation, Glasgow Coma Score, and age), to predict sepsis using the retrospective Multiparameter Intelligent Monitoring in Intensive Care (MIMIC)-III dataset, restricted to intensive care unit (ICU) patients aged 15 years or more. Following the Sepsis-3 definitions of the sepsis syndrome, we compare the classification performance of InSight versus quick sequential organ failure assessment (qSOFA), modified early warning score (MEWS), systemic inflammatory response syndrome (SIRS), simplified acute physiology score (SAPS) II, and sequential organ failure assessment (SOFA) to determine whether or not patients will become septic at a fixed period of time before onset. We also test the robustness of the InSight system to random deletion of individual input observations.In a test dataset with 11.3% sepsis prevalence, InSight produced superior classification performance compared with the alternative scores as measured by area under the receiver operating characteristic curves (AUROC) and area under precision-recall curves (APR). In detection of sepsis onset, InSight attains AUROC = 0.880 (SD 0.006) at onset time and APR = 0.595 (SD 0.016), both of which are superior to the performance attained by SIRS (AUROC: 0.609; APR: 0.160), qSOFA (AUROC: 0.772; APR: 0.277), and MEWS (AUROC: 0.803; APR: 0.327) computed concurrently, as well as SAPS II (AUROC: 0.700; APR: 0.225) and SOFA (AUROC: 0.725; APR: 0.284) computed at admission (P<.001 for all comparisons). Similar results are observed for 1-4 hours preceding sepsis onset. In experiments where approximately 60% of input data are deleted at random, InSight attains an AUROC of 0.781 (SD 0.013) and APR of 0.401 (SD 0.015) at sepsis onset time. Even with 60% of data missing, InSight remains superior to the corresponding SIRS scores (AUROC and APR, P<.001), qSOFA scores (P=.0095; P<.001) and superior to SOFA and SAPS II computed at admission (AUROC and APR, P<.001), where all of these comparison scores (except InSight) are computed without data deletion.Despite using little more than vitals, InSight is an effective tool for predicting sepsis onset and performs well even with randomly missing data.

[15]

Bedoya

, Futoma

, Clement

, et al. Machine learning for early detection of sepsis: An internal and temporal validation study[J]. JAMIA Open, 2020, 3(2):252-260. DOI:10.1093/jamiaopen/ooaa006.

https://www.ncbi.nlm.nih.gov/pubmed/32734166

本文引用 [1] 摘要

Determine if deep learning detects sepsis earlier and more accurately than other models. To evaluate model performance using implementation-oriented metrics that simulate clinical practice.We trained internally and temporally validated a deep learning model (multi-output Gaussian process and recurrent neural network [MGP-RNN]) to detect sepsis using encounters from adult hospitalized patients at a large tertiary academic center. Sepsis was defined as the presence of 2 or more systemic inflammatory response syndrome (SIRS) criteria, a blood culture order, and at least one element of end-organ failure. The training dataset included demographics, comorbidities, vital signs, medication administrations, and labs from October 1, 2014 to December 1, 2015, while the temporal validation dataset was from March 1, 2018 to August 31, 2018. Comparisons were made to 3 machine learning methods, random forest (RF), Cox regression (CR), and penalized logistic regression (PLR), and 3 clinical scores used to detect sepsis, SIRS, quick Sequential Organ Failure Assessment (qSOFA), and National Early Warning Score (NEWS). Traditional discrimination statistics such as the C-statistic as well as metrics aligned with operational implementation were assessed.The training set and internal validation included 42 979 encounters, while the temporal validation set included 39 786 encounters. The C-statistic for predicting sepsis within 4 h of onset was 0.88 for the MGP-RNN compared to 0.836 for RF, 0.849 for CR, 0.822 for PLR, 0.756 for SIRS, 0.619 for NEWS, and 0.481 for qSOFA. MGP-RNN detected sepsis a median of 5 h in advance. Temporal validation assessment continued to show the MGP-RNN outperform all 7 clinical risk score and machine learning comparisons.We developed and validated a novel deep learning model to detect sepsis. Using our data elements and feature set, our modeling approach outperformed other machine learning methods and clinical scores.© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association.

[16]	Tommaso V, Davide F, Marco N, et al. Super learner enhances postoperative complication prediction in colorectal surgery[J]. Ann Surg, 2025. Online ahead of print. DOI: 10.1097/SLA.0000000000006847. 本文引用 [1]

[17]	舒欣, 李昊洋, 李雨捷, 等. 基于机器学习建立腹部手术术后脓毒症患者死亡风险预测模型[J]. 陆军军医大学学报, 2023, 45(8): 732-738. DOI:10.16016/j.2097-0927.202212045. 本文引用 [1]

[18]	王孝乾, 齐文杰. 人工智能技术在脓毒症早期辅助诊断中应用的研究进展[J]. 中华危重病急救医学, 2024, 36(1):98-101. DOI:10.3760/cma.j.cn121430-20230411-00266 本文引用 [1]

[19]

Kim

, Lee

, Kim

, et al. Validation of an artificial intelligence-based algorithm for predictive performance and risk stratification of sepsis using real-world data from hospitalised patients: A prospective observational study[J]. BMJ Health Care Inform, 2025, 32(1):e101353. DOI:10.1136/bmjhci-2024-101353.

https://informatics.bmj.com/lookup/doi/10.1136/bmjhci-2024-101353

本文引用 [2] 摘要

The heterogeneous nature of sepsis renders determining its underlying causes difficult, which may delay diagnosis and intervention. VitalCare-SEPsis Score (VC-SEPS) is a deep learning-based algorithm that predicts sepsis and monitors patient conditions based on electronic medical record data. However, few studies have prospectively compared medical artificial intelligence software algorithms and traditional scoring systems to predict sepsis. This prospective observational study attempted to validate the predictive performance and risk stratification of VC-SEPS for early prediction of sepsis.

[20]	徐昉, 秦文健, 周发春. 关注人工智能在脓毒症诊治和解析中的应用价值[J]. 中华医学杂志, 2025, 105(33):2827-2830. DOI:10.3760/cma.j.cn112137-20250418-00961. 本文引用 [1]

[21]

Zhang

, Chen

, Sun

, et al. Identifying septic shock subgroups to tailor fluid strategies through multi-omics integration[J]. Nat Commun, 2024, 15(1):9028. DOI: 10.1038/s41467-024-53239-9.

https://www.ncbi.nlm.nih.gov/pubmed/39424794

本文引用 [1] 摘要

Fluid management remains a critical challenge in the treatment of septic shock, with individualized approaches lacking. This study aims to develop a statistical model based on transcriptomics to identify subgroups of septic shock patients with varied responses to fluid strategy. The study encompasses 494 septic shock patients. A benefit score is derived from the transcriptome space, with higher values indicating greater benefits from restrictive fluid strategy. Adherence to the recommended strategy is associated with a hazard ratio of 0.82 (95% confidence interval: 0.64-0.92). When applied to the baseline hospital mortality rate of 16%, adherence to the recommended fluid strategy could potentially lower this rate to 13%. A proteomic signature comprising six proteins is developed to predict the benefit score, yielding an area under the curve of 0.802 (95% confidence interval: 0.752-0.846) in classifying patients who may benefit from a restrictive strategy. In this work, we develop a proteomic signature with potential utility in guiding fluid strategy for septic shock patients.© 2024. The Author(s).

[22]

Zhang

, Chen

, Liu

, et al. Gene signature for the prediction of the trajectories of sepsis-induced acute kidney injury[J]. Crit Care, 2022, 26(1):398. DOI: 10.1186/s13054-022-04234-3.

本文引用 [1] 摘要

Acute kidney injury (AKI) is a common complication in sepsis. However, the trajectories of sepsis-induced AKI and their transcriptional profiles are not well characterized.

[23]	Hong Y, Chen L, Sun J, et al. Single-cell transcriptome profiling reveals heterogeneous neutrophils with prognostic values in sepsis[J]. iScience, 2022, 25(11):105301. DOI: 10.1016/j.isci.2022.105301. https://linkinghub.elsevier.com/retrieve/pii/S2589004222015735 本文引用 [1]

[24]

Lin

, Lu

, Wang

, et al. Predictive model of acute kidney injury in critically ill patients with acute pancreatitis: A machine learning approach using the MIMIC-Ⅳ database[J]. Ren Fail, 2024, 46(1):2303395. DOI:10.1080/0886022X.2024.2303395.

https://www.tandfonline.com/doi/full/10.1080/0886022X.2024.2303395

本文引用 [1]

[25]

Liu

, Zhu

, Xue

, et al. Machine learning models for mortality prediction in critically ill patients with acute pancreatitis-associated acute kidney injury[J]. Clin Kidney J, 2024, 17(10):sfae284. DOI:10.1093/ckj/sfae284.

https://academic.oup.com/ckj/article/doi/10.1093/ckj/sfae284/7755412

本文引用 [1] 摘要

The occurrence of acute kidney injury (AKI) was associated with an increased mortality rate among acute pancreatitis (AP) patients, indicating the importance of accurately predicting the mortality rate of critically ill patients with acute pancreatitis–associated acute kidney injury (AP-AKI) at an early stage. This study aimed to develop and validate machine learning–based predictive models for in-hospital mortality rate in critically ill patients with AP-AKI by comparing their performance with the traditional logistic regression (LR) model.

[26]	Hadaya J, Verma A, Sanaiha Y, et al. Machine learning-based modeling of acute respiratory failure following emergency general surgery operations[J]. PloS One, 2022, 17(4):e267733. DOI:10.1371/journal.pone.0267733. 本文引用 [1]

[27]

Pappada

, Sathelly

, Schmieder

, et al. An artificial neural network approach to diagnose and predict liver dysfunction and failure in the critical care setting[J]. Hippokratia, 2024, 28(1):1-10. PMID:39399402.

https://www.ncbi.nlm.nih.gov/pubmed/39399402

本文引用 [1] 摘要

Detecting liver dysfunction/failure in the intensive care unit poses a challenge as individuals afflicted with these conditions often appear symptom-free, thereby complicating early diagnoses and contributing to unfavorable patient outcomes. The objective of this endeavor was to improve the chances of early diagnosis of liver dysfunction/failure by creating a predictive model for the critical care setting. This model has been designed to produce an index that reflects the probability of severe liver dysfunction/failure for patients in intensive care units, utilizing machine learning techniques.This effort used comprehensive open-access patient databases to build and validate machine learning-based models for predicting the likelihood of severe liver dysfunction/failure. Two artificial neural network model architectures that derived a novel 0-100 Liver Failure Risk Index were developed and validated using the comprehensive patient databases. Data used to train and develop the models included clinical (patient vital signs) and laboratory results related to liver function which included liver function test results. The performance of the developed models was compared in terms of sensitivity, specificity, and the mean lead time to diagnosis.The best model performance demonstrated an 83.3 % sensitivity and a specificity of 77.5 % in diagnosing severe liver dysfunction/failure. This model accurately identified these patients a median of 17.5 hours before their clinical diagnosis, as documented in their electronic health records. The predictive diagnostic capability of the developed models is crucial to the intensive care unit setting, where treatment and preventative interventions can be made to avoid severe liver dysfunction/failure.Our machine learning approach facilitates early and timely intervention in the hepatic function of critically ill patients by their healthcare providers to prevent or minimize associated morbidity and mortality. HIPPOKRATIA 2024, 28 (1):1-10.Copyright 2024, Hippokratio General Hospital of Thessaloniki.

[28]	王亚峰, 李世朋, 王连才, 等. 可解释性机器学习模型在肝内胆管癌术后肝功能衰竭预测中的应用[J]. 中华肝胆外科杂志, 2025, 31(12):912-917. DOI:10.3760/cma.j.cn113884-20250425-00133. 本文引用 [1]

[29]	Li L, He H, Xiang L, et al. A warning model for predicting patient admissions to the intensive care unit (ICU) following surgery[J]. Perioper Med, 2025, 14(1):60. DOI: 10.1186/s13741-025-00544-6. 本文引用 [1]

[30]

Park

, Baik

, Hong

, et al. Development and external validation of an artificial intelligence model for predicting mortality and prolonged ICU stay in postoperative critically ill patients: A retrospective study[J]. World J Emerg Surg, 2025, 20(1):79. DOI:10.1186/s13017-025-00650-2.

https://www.ncbi.nlm.nih.gov/pubmed/41094587

本文引用 [1] 摘要

Existing predictive models in critical care, specifically for postoperative critically ill patients, often struggle to accurately predict prolonged intensive care unit (ICU) stays, a key aspect of patient care. The integration of artificial intelligence (AI) offers a promising approach for bridging this gap. We aimed to develop an AI-based model to predict mortality and prolonged ICU stay in postoperative critically ill patients, enhance prognostic accuracy, and address the shortcomings of current models.This retrospective study included data from 6,029 postoperative critically ill patients from two medical centers, including a wide range of clinical, surgical, and laboratory variables. Multiple machine-learning models, including extreme gradient boosting, light gradient boosting, category boosting, random forest, and multilayer perceptron, were employed. A soft-voting ensemble model was developed to aggregate the strengths of individual models. The models underwent external validation, and the SHapley Additive exPlanations (SHAP) method was utilized to assess the impact of various features on predictions.In internal validation, the ensemble model demonstrated superior performance with an area under the receiver operating characteristic curve (AUROC) of 0.8812 for mortality and 0.7944 for prolonged ICU stay. It achieved 0.9095 accuracy and an F1 score of 0.7014 for mortality predictions. For prolonged ICU stay, it attained an accuracy of 0.9368 and an F1 score of 0.5762. During external validation, the model maintained high performance, with an AUROC of 0.8330 for mortality and 0.7376 for prolonged ICU stay. It showed 0.9200 accuracy and an F1 score of 0.6768 for mortality and 0.9028 accuracy with an F1 score of 0.5689 for prolonged ICU stay. SHAP analysis confirmed that key predictors, including emergency surgery, serum osmolality, lactate levels, and diastolic blood pressure, remained significant.This study represents a significant advancement in the application of AI in critical care, especially for postoperative critically ill patients. The developed AI model outperformed existing models in predicting mortality and prolonged ICU stay, demonstrating notable accuracy and reliability. Its ability to identify critical, under-emphasized clinical factors could enhance decision-making in critical care settings. Although promising, further validation in diverse clinical settings is essential to confirm the model's efficacy and broader applicability.© 2025. The Author(s).

[31]

Ive

, Olukoya

, Funnell

, et al. AI assisted prediction of unplanned intensive care admissions using natural language processing in elective neurosurgery[J]. NPJ Digit Med, 2025, 8(1):549. DOI:10.1038/s41746-025-01952-0.

https://www.ncbi.nlm.nih.gov/pubmed/40858789

本文引用 [1] 摘要

Timely care in a specialised neuro-intensive therapy unit (ITU) reduces mortality and hospital stays. Planned admissions to ITU following surgery are safer than unplanned ones. However, post-operative care decisions remain subjective. This study used artificial intelligence (AI), specifically natural language processing (NLP) to analyse electronic health records (EHRs) of elective neurosurgery patients from University College London Hospital (UCLH) and predict ITU admissions. Using a refined CogStack-MedCAT NLP model, we extracted clinical concepts from 2268 patient records and trained AI models to classify admissions into ward and ITU. The Random Forest model achieved a recall of 0.87 (CI 0.82-0.91) for ITU admissions, reducing the proportion of unplanned ITU cases missed by human experts from 36% to 4%. Interpretability analysis confirmed the use of clinically relevant concepts. The study highlights the opportunity for AI to aid in allocating resources for neurosurgical patients but requires further research and integration into practice.© 2025. The Author(s).

[32]

Pirracchio

, Hubbard

, Sprung

, et al. Assessment of machine learning to estimate the individual treatment effect of corticosteroids in septic shock[J]. JAMA Netw Open, 2020, 3(12): e2029050. DOI:10.1001/jamanetworkopen.2020.29050.

https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2773970

本文引用 [1]

[33]

Komorowski

, Celi

, Badawi

, et al. The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care[J]. Nat Med, 2018, 24(11):1716-1720. DOI: 10.1038/s41591-018-0213-5.

https://www.ncbi.nlm.nih.gov/pubmed/30349085

本文引用 [1] 摘要

Sepsis is the third leading cause of death worldwide and the main cause of mortality in hospitals, but the best treatment strategy remains uncertain. In particular, evidence suggests that current practices in the administration of intravenous fluids and vasopressors are suboptimal and likely induce harm in a proportion of patients. To tackle this sequential decision-making problem, we developed a reinforcement learning agent, the Artificial Intelligence (AI) Clinician, which extracted implicit knowledge from an amount of patient data that exceeds by many-fold the life-time experience of human clinicians and learned optimal treatment by analyzing a myriad of (mostly suboptimal) treatment decisions. We demonstrate that the value of the AI Clinician's selected treatment is on average reliably higher than human clinicians. In a large validation cohort independent of the training data, mortality was lowest in patients for whom clinicians' actual doses matched the AI decisions. Our model provides individualized and clinically interpretable treatment decisions for sepsis that could improve patient outcomes.

[34]

McLean

, Sgrò

, Brown

, et al. Multimodal machine learning to predict surgical site infection with healthcare workload impact assessment[J]. NPJ Digit Med, 2025, 8(1):121. DOI: 10.1038/s41746-024-01419-8.

https://www.ncbi.nlm.nih.gov/pubmed/39988586

本文引用 [1] 摘要

Remote monitoring is essential for healthcare digital transformation, however, this poses greater burdens on healthcare providers to review and respond as the data collected expands. This study developed a multimodal neural network to automate assessments of patient-generated data from remote postoperative wound monitoring. Two interventional studies including adult gastrointestinal surgery patients collected wound images and patient-reported outcome measures (PROMs) for 30-days postoperatively. Neural networks for PROMs and images were combined to predict surgical site infection (SSI) diagnosis within 48 h. The multimodal neural network model to predict confirmed SSI within 48 h remained comparable to clinician triage (0.762 [0.690-0.835] vs 0.777 [0.721-0.832]), with an excellent performance on external validation. Simulated usage indicated an 80% reduction in staff time (51.5 to 9.1 h) without compromising diagnostic accuracy. This multimodal approach can effectively support remote monitoring, alleviating provider burden while ensuring high-quality postoperative care.© 2025. The Author(s).

[35]

Mascarenhas

, Mendes

, Fonseca

, et al. A novel deep learning model for predicting colorectal anastomotic leakage: A pioneer multicenter transatlantic study[J]. J Clin Med, 2025, 14(15):5462. DOI: 10.3390/jcm14155462.

https://www.mdpi.com/2077-0383/14/15/5462

本文引用 [1] 摘要

Background/Objectives: Colorectal anastomotic leak (CAL) is one of the most severe postoperative complications in colorectal surgery, impacting patient morbidity and mortality. Current risk assessment methods rely on clinical and intraoperative factors, but no real-time predictive tool exists. This study aimed to develop an artificial intelligence model based on intraoperative laparoscopic recording of the anastomosis for CAL prediction. Methods: A convolutional neural network (CNN) was trained with annotated frames from colorectal surgery videos across three international high-volume centers (Instituto Português de Oncologia de Lisboa, Hospital das Clínicas de Ribeirão Preto, and Royal Liverpool University Hospital). The dataset included a total of 5356 frames from 26 patients, 2007 with CAL and 3349 showing normal anastomosis. Four CNN architectures (EfficientNetB0, EfficientNetB7, ResNet50, and MobileNetV2) were tested. The models’ performance was evaluated using their sensitivity, specificity, accuracy, and area under the receiver operating characteristic (AUROC) curve. Heatmaps were generated to identify key image regions influencing predictions. Results: The best-performing model achieved an accuracy of 99.6%, AUROC of 99.6%, sensitivity of 99.2%, specificity of 100.0%, PPV of 100.0%, and NPV of 98.9%. The model reliably identified CAL-positive frames and provided visual explanations through heatmaps. Conclusions: To our knowledge, this is the first AI model developed to predict CAL using intraoperative video analysis. Its accuracy suggests the potential to redefine surgical decision-making by providing real-time risk assessment. Further refinement with a larger dataset and diverse surgical techniques could enable intraoperative interventions to prevent CAL before it occurs, marking a paradigm shift in colorectal surgery.

[36]

Alba

, Xue

, Abraham

, et al. The foundational capabilities of large language models in predicting postoperative risks using clinical notes[J]. NPJ Digit Med, 2025, 8(1):95. DOI: 10.1038/s41746-025-01489-2.

https://www.ncbi.nlm.nih.gov/pubmed/39934379

本文引用 [1] 摘要

Clinical notes recorded during a patient's perioperative journey holds immense informational value. Advances in large language models (LLMs) offer opportunities for bridging this gap. Using 84,875 preoperative notes and its associated surgical cases from 2018 to 2021, we examine the performance of LLMs in predicting six postoperative risks using various fine-tuning strategies. Pretrained LLMs outperformed traditional word embeddings by an absolute AUROC of 38.3% and AUPRC of 33.2%. Self-supervised fine-tuning further improved performance by 3.2% and 1.5%. Incorporating labels into training further increased AUROC by 1.8% and AUPRC by 2%. The highest performance was achieved with a unified foundation model, with improvements of 3.6% for AUROC and 2.6% for AUPRC compared to self-supervision, highlighting the foundational capabilities of LLMs in predicting postoperative risks, which could be potentially beneficial when deployed for perioperative care.© 2025. The Author(s).

[37]

Pinsky

, Bedoya

, Bihorac

, et al. Use of artificial intelligence in critical care: Opportunities and obstacles[J]. Crit Care, 2024, 28(1):113. DOI:10.1186/s13054-024-04860-z.

本文引用 [1] 摘要

Perhaps nowhere else in the healthcare system than in the intensive care unit environment are the challenges to create useful models with direct time-critical clinical applications more relevant and the obstacles to achieving those goals more massive. Machine learning-based artificial intelligence (AI) techniques to define states and predict future events are commonplace activities of modern life. However, their penetration into acute care medicine has been slow, stuttering and uneven. Major obstacles to widespread effective application of AI approaches to the real-time care of the critically ill patient exist and need to be addressed.

[38]	Athukorala VS, Ilmini WMKS. Explainable AI for critical care: A systematic review of interpretable models for sepsis and ICU mortality prediction[J]. BMC Med Inform Decis Mak, 2026, 26(1):64. DOI:10.1186/s12911-026-03344-0. 本文引用 [1]

[39]	Cecconi M, Greco M, Shickel B, et al. Implementing artificial intelligence in critical care medicine: A consensus of 22[J]. Crit Care, 2025, 29(1):290. DOI:10.1186/s13054-025-05532-2. 本文引用 [1]

脚注

利益冲突所有作者均声明不存在利益冲突

基金

新发突发与重大传染病防控国家科技重大专项(2025ZD01902501)

国家重点研发计划项目(2023YFC3603104)

国家自然科学基金项目(82272180)

国家自然科学基金项目(82472243)

浙江大学龙泉创新中心项目(ZJDXLQCXZCJBGS2024016)

浙江省基础公益研究计划浙江省自然科学基金华东医药联合基金/重大项目(LHDMD24H150001)

国家中医药管理局科技司-浙江省中医药管理局共建科技计划项目(GZY-ZJ-K1-24082)

浙江省医药卫生科技计划项目(2024KY1099)

浙江省科技厅“尖兵领雁”重点研发攻关计划项目(2024C03240)

北京市自然科学基金项目(7252298)

吴阶平医学基金会专项研究基金项目(320.6750.2024-23-07)

中央高校基本科研业务项目(226-2025-00024)

浙江省疾病预防控制科技计划项目(2026JKZ042)

PDF(2285 KB)

Accesses

Citation

Detail

段落导航

收稿日期	出版日期
2026-03-14	2026-05-01
发布日期
2026-09-28

选择文件类型/文献管理软件名称

选择包含的内容

摘要

Abstract

关键词

Key words

引用本文

{{custom_sec.title}}

{{custom_sec.title}}

参考文献

脚注

基金

联系方式

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

摘要

Abstract

关键词

Key words

引用本文

{{custom_sec.title}}

{{custom_sec.title}}

参考文献

脚注

基金

联系方式