Artificial intelligence for phase recognition in complex laparoscopic cholecystectomy

Golany, Tomer; Aides, Amit; Freedman, Daniel; Rabani, Nadav; Liu, Yun; Rivlin, Ehud; Corrado, Greg S.; Matias, Yossi; Khoury, Wisam; Kashtan, Hanoch; Reissman, Petachia

doi:10.1007/s00464-022-09405-5

Tomer Golany¹,
Amit Aides²,
Daniel Freedman¹,
Nadav Rabani²,
Yun Liu²,
Ehud Rivlin¹,
Greg S. Corrado²,
Yossi Matias³,
Wisam Khoury⁴,
Hanoch Kashtan⁵ &
…
Petachia Reissman ORCID: orcid.org/0000-0002-7748-6286^6,7

5133 Accesses
16 Citations
12 Altmetric
Explore all metrics

Abstract

Background

The potential role and benefits of AI in surgery has yet to be determined. This study is a first step in developing an AI system for minimizing adverse events and improving patient’s safety. We developed an Artificial Intelligence (AI) algorithm and evaluated its performance in recognizing surgical phases of laparoscopic cholecystectomy (LC) videos spanning a range of complexities.

Methods

A set of 371 LC videos with various complexity levels and containing adverse events was collected from five hospitals. Two expert surgeons segmented each video into 10 phases including Calot’s triangle dissection and clipping and cutting. For each video, adverse events were also annotated when present (major bleeding; gallbladder perforation; major bile leakage; and incidental finding) and complexity level (on a scale of 1–5) was also recorded. The dataset was then split in an 80:20 ratio (294 and 77 videos), stratified by complexity, hospital, and adverse events to train and test the AI model, respectively. The AI-surgeon agreement was then compared to the agreement between surgeons.

Results

The mean accuracy of the AI model for surgical phase recognition was 89% [95% CI 87.1%, 90.6%], comparable to the mean inter-annotator agreement of 90% [95% CI 89.4%, 90.5%]. The model’s accuracy was inversely associated with procedure complexity, decreasing from 92% (complexity level 1) to 88% (complexity level 3) to 81% (complexity level 5).

Conclusion

The AI model successfully identified surgical phases in both simple and complex LC procedures. Further validation and system training is warranted to evaluate its potential applications such as to increase patient safety during surgery.

Evaluation of surgical complexity by automated surgical process recognition in robotic distal gastrectomy using artificial intelligence

Article 23 February 2023

Validation of an artificial intelligence platform for the guidance of safe laparoscopic cholecystectomy

Article 02 August 2022

Implementation of artificial intelligence-based computer vision model in laparoscopic appendectomy: validation, reliability, and clinical correlation

Article 25 April 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Computer vision based artificial intelligence (AI) systems have been successfully used for various purposes, though their utility for analysis or to aid in safety [1, 2] during surgical procedures is still under early evaluation. These investigations are important; for example during laparoscopic cholecystectomy (LC), adverse events such as bile duct injury, bile leakage, bleeding, and bowel injury events are possible, at a rate of 1.5% [3,4,5,6,7,8,9].

An AI system which can recognize surgical phases, may be used for many important tasks like quality measures, adverse events recording and analysis, education, statistics, surgical performance evaluation and more. Currently, these tasks are performed manually in a time consuming fashion by expert surgeons. Use of the system during surgery would further enable real-time monitoring and assisted decision making, which may increase safety and improve patient outcomes. For example, a real-time assistive system could alert the surgeon to an incorrect plane of dissection, a wrong maneuver, or an upcoming complication. Such a system might also be used as a context-aware decision support system by providing early warnings in case of misorientation or other unexpected events. As a specific example in LC, achieving the Critical View of Safety (CVS) is the recommended strategy for minimizing the risk of Bile Duct Injury (BDI) [10, 11]. A system that can detect and verify that CVS has been achieved is potentially quite valuable. The system can also optimize operating room (OR) utilization and staff scheduling, and provide administrative assistance by analyzing the progress of an operation and more accurately predicting the time required for procedure completion.

Computer Vision (CV) algorithms have recently shown success in recognizing surgical phases in LC procedures without adverse events [12, 13], and have displayed promising results in the ability of Artificial Intelligence (AI) to verify CVS during LC [14]. While these works [12, 13] have focused primarily on LC procedures without complications and adverse events, our hypothesis was that CV could also recognize surgical phases in more complex LC procedures. We therefore developed an AI system to recognize the major phases of both straightforward and complicated LC procedures with potentially higher morbidity rates.

Methods

Dataset

We constructed a dataset of 448 cholecystectomy videos, which includes 368 videos which were collected from four hospitals in Israel, and 80 videos from the publicly available Cholec80 dataset [12] collected from a hospital in France. The videos were recorded between November 1, 2010 and October 1, 2020. Eligibility criteria were laparoscopic cholecystectomy for biliary colic or acute and chronic cholecystitis, as well as patients 18 years of age or older. After excluding videos that could not be annotated consistently by surgeons (see the Annotation section), 371 videos remained and were used for this work. The dataset was split in an 80:20 ratio, respectively, for training and testing the AI model, with the splits stratified by surgical complexity, institution, and adverse events during surgery (see the Annotation section below). The splitting was performed on a per-case rather than a per-frame level. That is, frames from a video in the training set did not appear in the test set.

IRB approval was granted prior to commencing the study.

Annotation of surgical phases, adverse events, and level of surgical complexity

Surgical phases and adverse events annotation

All datasets (including the publicly available Cholec80) were annotated. The relevant phases and annotation process was determined via consensus of a group of three experienced senior surgeons (years of experience: 35, 34, and 20), who were distinct from the surgeons who annotated the videos (described below). Each video was annotated according to the following phases: (1) trocar insertion, (2) preparation, (3) Calot triangle dissection, (4) clipping and cutting, (5) gallbladder dissection, (6) gallbladder packaging, (7) cleaning and hemostasis, and (8) gallbladder extraction. Additionally, two special phases were used in annotation. First, segments in which the camera was not placed inside the body were annotated as “out of body”. Second, segments in which the camera was not focused on tools and no surgical action was being performed were annotated as “idle”.

To analyze the ability of the AI model to recognize the major surgical phases in videos of abnormal or challenging LC procedures, a set of important adverse events were also identified by the expert surgeons. The experts agreed on the following list of adverse events, which were therefore annotated (where present): (1) major bleeding, (2) gallbladder perforation, (3) major bile leakage, and (4) incidental finding.

Surgical complexity annotation

In addition to annotating the phases and adverse events described above, annotations were also collected for the complexity level of each procedure. The complexity level was scored on a scale of 1–5 based on intraoperative parameters. The factors to determine the complexity level included state of the gallbladder (based on the Parkland Grading Scale for grading still images of Cholecystitis [15, 16]), presence of intra-abdominal adhesions, normality of anatomy, duct closure device utilized, performance of intraoperative cholangiography, partial or open cholecystectomy requirements and intraoperative adverse events. A detailed mapping between each procedure to its complexity level is described in Table 1 in the Supplement. The annotations of complexity levels and complications were used for assessing the AI model’s ability to accurately recognize the surgical phases in complex LC procedures.

Critical view of safety annotation

The last annotation task was the annotation of the Critical View of Safety (CVS), if achieved, during the Calot triangle dissection phase. We followed the three criteria defined by SAGES to annotate achievement of CVS: (1) the hepatocystic triangle is cleared of fat and fibrous tissue; (2) the lower third of the gallbladder is separated from the liver to expose the cystic plate; and (3) exactly two structures are seen entering the gallbladder.

Annotation quality

The annotations were performed by 13 surgeons with at least 4 years of experience (median: 7, range: 4–15) in general surgery. Annotator training included understanding the definition of each phase and adverse event; learning how to indicate the start and end of each phase; and becoming familiar with the annotation software. To validate the quality of the annotations, each video was annotated by two annotators, and the inter-rater agreement score between them was calculated. The inter-rater agreement score is defined as the number of frames annotated with the same phase label by the two annotators, divided by the total number of annotated frames in the video. Videos with an agreement score below 80% (n = 77) were excluded to arrive at the final set of 371 videos in the dataset (Table 1 and eTable 2 in the Supplement). The videos excluded from the main analysis are analyzed in eFigure 1 in the Supplement.

Table 1 General characteristics of the collected dataset

Full size table

Deep learning model architecture

Our model takes as input a video, and categorizes each frame in the video into 1 of the 10 phases described above. This is achieved using a two-stage setup (Fig. 1). The first stage extracts visual features from single frames of the video, without any temporal context, i.e., this stage of the model has no sense of what is happening before and after that frame. The second stage of the model aggregates temporal information from neighboring frames, i.e., it is at this stage that the model is able to incorporate information from both before and after that frame, to understand what surgical phase the current frame shows.

First stage: feature extraction model

Deep convolutional neural networks [17, 18] have recently shown state-of-the-art results on image classification tasks [19, 20]. While classical classification models focus on extracting hand-crafted features (colors, corners, edges, etc.), and combining them as inputs to supervised machine learning models, deep neural networks learn the features by themselves from the raw data. The extracted features are thus optimized to improve classification performance. In this work, we apply a deep residual convolutional neural network architecture called Resnet50 [21], to extract features from LC frames. Given a single frame taken from a cholecystectomy procedure as input, the goal of the Resnet50 model is to output its estimated likelihood of the frame being in each of the 10 phases. In other words, this model is trained to predict the cholecystectomy phases from single frames. When the training of the Resnet50 model is complete, the network weights are frozen (i.e., fixed for the remainder of the learning procedure), and the last prediction layer is removed. The resulting network is then able to output a single feature vector from each of the raw cholecystectomy frames.

Second stage: temporal aggregation model

While the Resnet50 model learns to identify the surgical phases based only on information from a single frame, our goal was to also incorporate the temporal patterns across LC videos. This is due to the fact that frames before and after a given frame are often helpful or even necessary to understand that frame. For example, the gallbladder dissection phase must precede other phases of the procedure. Thus, frames where the gallbladder appears, but which precede the gallbladder dissection phase, are unlikely to be part of the gallbladder packaging or gallbladder extraction phases.

For the temporal aggregation stage, we utilize a temporal convolution network [22, 23] which recently achieved state-of-the-art results on temporal action segmentation tasks in general videos [24, 25] as well as on surgical phase detection in the Cholec80 dataset [26]. (We note in passing that temporal information was also shown to be useful in analyzing suturing videos [27].) In our case we employ a variant of this network architecture, known as the Multi-Stage Temporal Convolution Network (MS-TCN) network [22] (Fig. 1). The MS-TCN network consists of multiple stages, where each stage is composed of so called “dilated temporal convolution blocks”. The purpose of these blocks is to efficiently aggregate information from the entirety of the procedure, allowing us to learn temporal dependencies over the whole cholecystectomy video. The network is multi-stage: each stage outputs an initial prediction that is refined by the next one. The input to the MS-TCN network is a sequence of feature vectors, one for each frame of video, as described in the previous section; the output is a phase prediction for each frame in the input sequence.

Statistical analysis

We evaluated our model on the test set, using the accuracy metric. Accuracy quantifies the fraction of frames with correctly classified phases and is defined as the number correctly classified frames divided by the total number of evaluated frames. On average, a small fraction (0.16%) of the frames in each video was not annotated due to difficulties in selecting precise start/end frames for annotation in a way that eliminates unannotated gaps. The accuracy was calculated on both the first stage (Resnet50) model alone and the second stage (MS-TCN) model. This frame-level accuracy per-video was then averaged over all videos to ensure each video was equally weighted. For error bars, we computed the 95% empirical confidence interval (CI) by bootstrapping across videos. To place the model’s accuracy in perspective, each video was annotated by a second surgeon (as mentioned in the Annotation section). The inter-surgeon agreement was then computed by evaluating the second surgeon’s accuracy per-video against the first, and similarly averaging across all videos.

Results

The first stage (Resnet50) model achieved overall classification accuracy of 78% [95% CI 75.8%, 80.1%] on the test set. The second stage (MS-TCN) model, which incorporates temporal information across the whole video, obtained higher accuracy, reaching 89% [95% CI 87.1%, 90.6%] accuracy on the test set.

Figure 2 shows evaluation of the per-phase confusion matrix, reached by the full 2-stage model. The per-phase calculation was performed across all frames per-video, and then averaged over all videos in the test set. We noted that the model successfully detected the most critical phases—Calot triangle dissection, clipping and cutting, and gallbladder dissection phase—with accuracies of 92%, 82%, and 96%, respectively. For the preparation phase, the model reached 80% accuracy; however, 12% of these preparation frames were incorrectly predicted as part of the Calot triangle dissection phase instead. We note that these erroneous predictions are distributed along the transition between the two phases.

As described in the Annotation section, the complexity level of each LC video in the dataset was annotated on a scale of 1–5. Figure 3A shows the mean accuracy of our MS-TCN model (orange bars), and the inter-rater score agreement (blue bars) on the test set videos relative to their complexity. As the complexity increases from 1 to 3, the model’s accuracy linearly decreases from 92% [95% CI 90.2%, 94.0%] to 88% [95% CI 81.8%, 92.3%]. At complexity levels 4 and 5, the model accuracy was 81% [95% CI 78.9%, 83.1%].

The inter-rater agreement (between expert surgeons) ranged from 92% on LC procedures with a complexity level of 1 to 90% on LC procedures with a complexity level of 5. We learn from Fig. 3A that for simple LC procedures, the AI model has an ability equal to a surgeon in the recognition of surgical phases. However, on complex LC procedures, the surgeons are superior to the AI model: the annotator agreement score is 9% higher compared to the accuracy of the AI model.

We evaluated how adverse events during LC procedures affect the ability of the AI model to recognize the surgical phases. Figure 4 shows the overall accuracy of the MS-TCN model on the test set, relative to adverse events in LC procedures. The model reached an accuracy of 87% [95% CI 82.5%, 90.7%] in videos with a gallbladder perforation event, 77% on a single video with a major bile leakage event, 86% [95% CI 76.8%, 94.1%] on videos with an incidental finding, and 89% [95% CI 88.7%, 92.6%] on procedures with cholecystitis (blue bars). On videos without adverse events (green bar), the model reached a mean accuracy of 90% [95% CI 88.0%, 91.7%]. Thus as expected, in LC procedures with adverse events, the AI model attained a lower accuracy.

The last research question we wanted to address was how LC procedures from different hospitals affect the ability of an AI model to recognize surgical phases. As described in the Dataset section, our dataset was composed of procedures from five hospitals. As may be expected, some variation was noted in the instruments used as well as in surgical technique. This made the task of identifying the surgical phases more challenging.

Figure 3B shows the overall accuracy of the MS-TCN model, according to both the source hospital as well as the average complexity level. The model attained an accuracy of 86% [95% CI 83.1%, 88.7%] in videos from hospital #1, 89% [95% CI 83.8%, 93.1%] on videos from hospital #2, 91.5% [95% CI 88.3%, 94.3%] on videos from hospital #3, and 89% [95% CI 84.8%, 93%] on videos from hospital #4. On videos from the Cholec80 [1] dataset our model reached an accuracy of 91.4% [95% CI 88.4%, 93.7%].

To understand how effectively the AI model generalizes to various hospitals, we trained the AI model on four of the hospitals and tested it on the fifth. We repeated this experiment five times, where each time a different hospital was set aside as the test set (with the remaining four used as the train set). eFigure 2 in the Supplement shows the average accuracy of the MS-TCN for each experiment. The model attained an accuracy of 79% [95% CI 72.4%, 84.9%] in videos from hospital #1, 84% [95% CI 81.3%, 87.3%] on videos from hospital #2, 89% [95% CI 86.0%, 90.6%] on videos from hospital #3, and 87% [95% CI 76.4%, 94.3%] on videos from hospital #4. Using the four hospitals to train the AI model, and testing it on the Cholec80 [1] dataset, the AI model reached an accuracy of 87% [95% CI 84.0%, 89.8%].

Discussion

In our study, we have presented an AI model to automate the task of phase recognition in LC. Our model successfully detected surgical phases with an overall accuracy of 89%, comparable to the average agreement between surgeon annotators (90%), including successful detection even in procedures with adverse events like major bleeding, major bile leakage, major duct injury, and gallbladder perforation.

The detection of surgical phases is more critical for certain phases than it is for others. For example, successful identification of the Calot triangle dissection phase, confirmation of the critical view of safety (CVS), or the clipping and cutting phase, are of utmost importance for the patient’s safety, while misrecognition of the gallbladder extraction phase is less important and will have a much lower impact on patient safety. As shown, our system was able to reach a very high accuracy (92%) in the Calot Triangle Dissection phase that supports CVS.

We also found that higher complexity levels of LC procedures were associated with both lower accuracy on the part of the AI model, as well as lower inter-rater agreement between surgeons. On less complex LC videos, the AI model achieved an overall accuracy of 92%, equal to the inter-surgeon agreement score. By contrast, in complex LC videos, the annotators reached an average agreement score of 90% compared to 81% by the AI. Importantly however, the accurate identification of the Calot triangle dissection phase was unaffected in complex videos (92%). Furthermore, the performance of the AI model remained high in the presence of adverse events, indicating an overall robustness to adverse events during LC procedures. As mentioned, we used LC videos from five hospitals, and as expected, some variation in the surgical technique and type of instruments used was noted. Interestingly, such variations did not interfere with the accuracy of the AI system in phase recognition reaching 80–87% overall accuracy reflecting the system’s flexibility and reliability.

Non-realtime use of such an AI system to analyze LC videos may provide valuable data to evaluate and track trainees’ surgical skill level over time, and even enable future studies into correlations between specific events occurring during a procedure and outcomes such as successful conclusion of the procedure. Similarly, the AI system can enable finer-grained analysis of time taken for procedures, potentially providing insights that can augment systems which predict surgical duration and hence aid OR planning [28]. Further sophistication and modifications to enable real-time incorporation of such a system into the laparoscopic video camera system is possible, by using no future information, or limited future information (e.g., using a few frames of future information is unlikely to cause appreciable latency). Such real-time use may play a role in active monitoring to improve patient safety, by providing the surgeon with indications of the successful conclusion of the various surgical phases and alerting if there might be potential issues with the surgical view or dissection plane. For instance, if the system was not able to satisfactorily recognize the CVS, an alert could be generated to prompt re-evaluation of their perception of the anatomy, before proceeding to the clipping and cutting phase (which is irreversible). Although the overall complication rate and bile duct injury in LC is very low [29], such a system may improve safety in teaching departments where junior staff are undergoing training. Likewise, similar systems could aid real-time decision making such as whether to proceed with laparoscopy, to change the surgical technique (i.e., retrograde dissection or subtotal cholecystectomy), to convert to open surgery, to drain only, or to abort the procedure. Extensions of such a system to other more complex laparoscopic procedures, such as solid organs surgery, may also be useful.

AI has recently gained popularity in several medical fields like radiology, pathology, and gastroenterology [30,31,32,33]. However, unlike images of diagnostic radiology, the image quality of frames in surgical videos has a significantly greater variability owing to movement during video capture, which renders AI analysis more challenging. In addition, anatomical structures and surgical planes are often hidden under fatty tissue and must be exposed before yielding a clear field of view for an AI system’s interpretation. Previous studies on AI for interpreting laparoscopic videos [12, 17, 34,35,36] have focused on identifying procedure phases and instruments. In a recent study using a large data set of 1243 LC videos [13], the authors showed that AI performance was significantly improved when the number of videos of the input dataset was increased from 50 to 745. Compared to prior work, our dataset made crucial use of videos representing real-world variability across anatomy, surgeon’s technique, operative tools, surgical complexity, and intraoperative complications. In particular, our study included often-encountered complex procedures such as those requiring retrograde dissection, conversion to open procedure, and cholecystitis of varying severity.

This study has several limitations. The AI model was trained to recognize only the normal surgical phases (preparation, Calot triangle dissection, etc.) in videos which included adverse events. However, during an adverse event, the scene might not be related to the current surgical phase, which may have impacted the AI model’s performance in the presence of adverse events. Future work into adapting the AI model to additionally recognize adverse events may help improve performance. In a related vein, some adverse events were rare. For example, only one LC procedure contained a major bile leakage, so additional examples of rarer adverse events would be helpful for both training and evaluating the model to correctly identify such events. Another limitation relates to the non-real time nature of the system, which does not allow it to be used to provide safety indications during the procedure. As noted above, future work will focus on training the network in such a way as to accommodate real-time operation.

In conclusion, this study presents an AI system for accurate recognition of predefined surgical phases in both uncomplicated LC procedures and complex procedures. This study is a first step toward further development of an AI system for surgical skill assessment, efficient OR schedule planning, and importantly to assist the surgeon in avoiding technical errors, alert them to imminent complications, and provide real-time information to be used for better decision making.

References

Fletcher DR, Hobbs MST, Tan P et al (1999) Complications of cholecystectomy: risks of the laparoscopic approach and protective effects of operative cholangiography. Ann Surg 229:449–457
Article CAS PubMed PubMed Central Google Scholar
Deziel DJ (1994) Complications of cholecystectomy: incidence, clinical manifestations, and diagnosis. Surg Clin North Am 74:809–823
Article CAS PubMed Google Scholar
Brunt LM et al (2020) Safe cholecystectomy multi-society practice guideline and state-of-the-art consensus conference on prevention of bile duct injury during cholecystectomy. Surg Endosc 34:2827–2855
Article Google Scholar
Schwaitzberg SD, Scott DJ, Jones DB et al (2014) Threefold increased bile duct injury rate is associated with less surgeon experience in an insurance claims database. Surg Endosc 28:3068–3073
Article PubMed Google Scholar
Törnqvist B, Strömberg C, Akre O et al (2015) Selective intraoperative cholangiography and risk of bile duct injury during cholecystectomy. Br J Surg 102:952–958
Article PubMed Google Scholar
Lilley EJ, Scott JW, Jiang W et al (2017) Intraoperative cholangiography during cholecystectomy among hospitalized medicare beneficiaries with non-neoplastic biliary disease. Am J Surg 214:682–686
Article PubMed Google Scholar
Barrett M, Asbun HJ, Chien H-L et al (2018) Bile duct injury and morbidity following cholecystectomy: a need for improvement. Surg Endosc 32:1683–1688
Article PubMed Google Scholar
Pucher PH et al (2018) Outcome trends and safety measures after 30 years of laparoscopic cholecystectomy: a systematic review and pooled data analysis. Surg Endosc 32:2175–2183
Article PubMed PubMed Central Google Scholar
Fong ZV, Pitt HA, Strasberg SM et al (2018) Diminished survival in patients with bile leak and ductal injury: management strategy and outcomes. J Am Coll Surg 226:568-576.e1
Article PubMed PubMed Central Google Scholar
Strasberg SM, Brunt LM (2010) Rationale and use of the critical view of safety in laparoscopic cholecystectomy. J Am Coll Surg 211:132–138
Article PubMed Google Scholar
Way LW, Stewart L, Gantert W et al (2003) Causes and prevention of laparoscopic bile duct injuries: analysis of 252 cases from a human factors and cognitive psychology perspective. Ann Surg 237:460–469
Article PubMed PubMed Central Google Scholar
Twinanda AP, Shehata S, Mutter D et al (2017) EndoNet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging 36:86–97
Article PubMed Google Scholar
Bar O, Neimark D, Zohar M et al (2020) Impact of data on generalization of AI for surgical intelligence applications. Sci Rep 10:22208
Article CAS PubMed PubMed Central Google Scholar
Mascagni P, Vardazaryan A, Alapatt D et al (2020) Artificial intelligence for surgical safety: automatic assessment of the critical view of safety in laparoscopic cholecystectomy using deep learning. Ann Surg. https://doi.org/10.1097/SLA.0000000000004351
Article PubMed Google Scholar
Madni TD, Leshikar DE, Minshall CT et al (2018) The Parkland grading scale for cholecystitis. Am J Surg 215:625–630
Article PubMed Google Scholar
Madni TD, Nakonezny PA, Barrios E et al (2019) Prospective validation of the Parkland grading scale for cholecystitis. Am J Surg 217:90–97
Article PubMed Google Scholar
Rastegari M, Ordonez V, Redmon J et al (2016) XNOR-Net: ImageNet classification using binary convolutional neural networks. In: Leibe B, Matas J, Sebe N, Welling M (eds) Computer vision—ECCV 2016. Springer International Publishing, pp 525–542
Chapter Google Scholar
Rawat W, Wang Z (2017) Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput 29:2352–2449
Article PubMed Google Scholar
Ciresan DC, Meier U, Masci J et al (2011) Flexible, high performance convolutional neural networks for image classification. IJCAI. https://doi.org/10.5591/978-1-57735-516-8/IJCAI11-210
Article Google Scholar
Li Q, Cai W, Wang X, et al. (2014) Medical image classification with convolutional neural network. In 2014 13th International Conference on Control Automation Robotics Vision (ICARCV), pp. 844–848.
He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. IEEE Conf Comput Vision Pattern Recognition (CVPR) 2016:770–778
Google Scholar
Farha YA, Gall J (2019) MS-TCN: multi-stage temporal convolutional network for action segmentation. IEEE/CVF Conf Comput Vision Pattern Recognition (CVPR) 2019:3575–3584
Google Scholar
van den Oord A, Dieleman S, Zen H, et al. (2016) WaveNet: a generative model for raw audio. arXiv [cs.SD]. http://arxiv.org/abs/1609.03499
Stein S, McKenna SJ (2013) Combining embedded accelerometers with computer vision for recognizing food preparation activities. Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 729–738.
Kuehne H, Arslan A, Serre T (2014) The language of actions: recovering the syntax and semantics of goal-directed human activities. 2014 IEEE Conf Comput Vision Pattern Recognition. https://doi.org/10.1109/cvpr.2014.105
Article Google Scholar
Czempiel T, Paschali M, Keicher M, et al. (2020) TeCNO: surgical phase recognition with multi-stage temporal convolutional networks. arXiv [eess.IV]. http://arxiv.org/abs/2003.10751
Tanwani AK, Sermanet P, Yan A, et al. (2020) Motion2Vec: semi-supervised representation learning from surgical videos. IEEE International Conference on Robotics and Automation (ICRA), pp. 2174–2181.
Strömblad CT, Baxter-King RG, Meisami A et al (2021) Effect of a predictive model on planned surgical duration accuracy, patient wait time, and use of presurgical resources: a randomized clinical trial. JAMA Surg. https://doi.org/10.1001/jamasurg.2020.6361
Article PubMed PubMed Central Google Scholar
Strasberg SM, Hertl M, Soper NJ (1995) An analysis of the problem of biliary injury during laparoscopic cholecystectomy. J Am Coll Surg 180:101–125
CAS PubMed Google Scholar
McBee MP, Awan OA, Colucci AT et al (2018) Deep learning in radiology. Acad Radiol 25:1472–1480
Article PubMed Google Scholar
Choy G, Khalilzadeh O, Michalski M et al (2018) Current applications and future impact of machine learning in radiology. Radiology 288:318–328
Article PubMed Google Scholar
Madabhushi A, Lee G (2016) Image analysis and machine learning in digital pathology: challenges and opportunities. Med Image Anal 33:170–175
Article PubMed PubMed Central Google Scholar
Freedman D, Blau Y, Katzir L et al (2020) Detecting deficient coverage in colonoscopies. IEEE Trans Med Imaging 39:3451–3462
Article PubMed Google Scholar
Jin A, Yeung S, Jopling J, et al. (2018) Tool detection and operative skill assessment in surgical videos using region-based convolutional neural networks. arXiv [cs.CV]. http://arxiv.org/abs/1802.08774
Al Hajj H, Lamard M, Charriere K et al (2017) Surgical tool detection in cataract surgery videos through multi-image fusion inside a convolutional neural network. IEEE Eng Med Biol Soc 2017:2002–2005
Google Scholar
Jin Y, Dou Q, Chen H et al (2018) SV-RCNet: workflow recognition from surgical videos using recurrent convolutional network. IEEE Trans Med Imaging 37:1114–1126
Article PubMed Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Commun ACM 60:84–90
Article Google Scholar

Download references

Acknowledgements

Our appreciation goes to the Google Health labeling software infrastructure team and all the surgeons who assisted with labeling videos. The authors would also like to thank all of their team members and collaborators who provided feedback or participated in discussions.

Author information

Authors and Affiliations

Verily Life Sciences, Tel Aviv, Israel
Tomer Golany, Daniel Freedman & Ehud Rivlin
Google Health, Tel Aviv, Israel
Amit Aides, Nadav Rabani, Yun Liu & Greg S. Corrado
Google Research, Tel Aviv, Israel
Yossi Matias
Department of Surgery, Rappaport Faculty of Medicine, Carmel Medical Center, Technion, Haifa, Israel
Wisam Khoury
Department of Surgery, Rabin Medical Center, The Sackler School of Medicine, Tel-Aviv University, Petah Tikva, Israel
Hanoch Kashtan
Department of Surgery, The Hebrew University School of Medicine, Sharee Zedek Medical Center, Jerusalem, Israel
Petachia Reissman
Digestive Disease Institute, Shaare-Zedek Medical Center, The Hebrew University School of Medicine, P.O. Box 3235, 91031, Jerusalem, Israel
Petachia Reissman

Authors

Tomer Golany
View author publications
You can also search for this author in PubMed Google Scholar
Amit Aides
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Freedman
View author publications
You can also search for this author in PubMed Google Scholar
Nadav Rabani
View author publications
You can also search for this author in PubMed Google Scholar
Yun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ehud Rivlin
View author publications
You can also search for this author in PubMed Google Scholar
Greg S. Corrado
View author publications
You can also search for this author in PubMed Google Scholar
Yossi Matias
View author publications
You can also search for this author in PubMed Google Scholar
Wisam Khoury
View author publications
You can also search for this author in PubMed Google Scholar
Hanoch Kashtan
View author publications
You can also search for this author in PubMed Google Scholar
Petachia Reissman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Petachia Reissman.

Ethics declarations

Disclosures

Dr. Tomer Golany, Dr. Amit Aides, Dr. Daniel Freedman, Mr. Nadav Rabani, Dr. Yun Liu, Dr. Ehud Rivlin, Dr. Greg S. Corrado, and Dr. Yossi Matias, are employees of Google LLC and own Alphabet stock. Dr. Wisam Khoury, Dr. Hanoch Kashtan, and Dr. Petachia Reissman, are consultants of Google LLC.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 163 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Golany, T., Aides, A., Freedman, D. et al. Artificial intelligence for phase recognition in complex laparoscopic cholecystectomy. Surg Endosc 36, 9215–9223 (2022). https://doi.org/10.1007/s00464-022-09405-5

Download citation

Received: 01 December 2021
Accepted: 19 June 2022
Published: 08 August 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s00464-022-09405-5

Artificial intelligence for phase recognition in complex laparoscopic cholecystectomy

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

Evaluation of surgical complexity by automated surgical process recognition in robotic distal gastrectomy using artificial intelligence

Validation of an artificial intelligence platform for the guidance of safe laparoscopic cholecystectomy

Implementation of artificial intelligence-based computer vision model in laparoscopic appendectomy: validation, reliability, and clinical correlation

Methods

Dataset

Annotation of surgical phases, adverse events, and level of surgical complexity

Surgical phases and adverse events annotation

Surgical complexity annotation

Critical view of safety annotation

Annotation quality

Deep learning model architecture

First stage: feature extraction model

Second stage: temporal aggregation model

Statistical analysis

Results

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Disclosures

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 163 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Artificial intelligence for phase recognition in complex laparoscopic cholecystectomy

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

Evaluation of surgical complexity by automated surgical process recognition in robotic distal gastrectomy using artificial intelligence

Validation of an artificial intelligence platform for the guidance of safe laparoscopic cholecystectomy

Implementation of artificial intelligence-based computer vision model in laparoscopic appendectomy: validation, reliability, and clinical correlation

Methods

Dataset

Annotation of surgical phases, adverse events, and level of surgical complexity

Surgical phases and adverse events annotation

Surgical complexity annotation

Critical view of safety annotation

Annotation quality

Deep learning model architecture

First stage: feature extraction model

Second stage: temporal aggregation model

Statistical analysis

Results

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Disclosures

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 163 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation