Title | ||
---|---|---|
U-Netplus: A Modified Encoder-Decoder U-Net Architecture For Semantic And Instance Segmentation Of Surgical Instruments From Laparoscopic Images |
Abstract | ||
---|---|---|
With the advent of robot-assisted surgery, there has been a paradigm shift in medical technology for minimally invasive surgery. However, it is very challenging to track the position of the surgical instruments in a surgical scene, and accurate detection & identification of surgical tools is paramount. Deep learning-based semantic segmentation in frames of surgery videos has the potential to facilitate this task. In this work, we modify the U-Net architecture by introducing a pre-trained encoder and re-design the decoder part, by replacing the transposed convolution operation with an upsampling operation based on nearest-neighbor (NN) interpolation. To further improve performance, we also employ a very fast and flexible data augmentation technique. We trained the framework on 8 x 225 frame sequences of robotic surgical videos available through the MICCAI 2017 EndoVis Challenge dataset and tested it on 8 x 75 frame and 2 x 300 frame videos. Using our U-NetPlus architecture, we report a 90.20% DICE for binary segmentation, 76.26% DICE for instrument part segmentation, and 46.07% for instrument type (i.e., all instruments) segmentation, outperforming the results of previous techniques implemented and tested on these data. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/EMBC.2019.8856791 | 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC) |
Field | DocType | Volume |
Computer vision,Convolution,Computer science,Segmentation,Interpolation,Image segmentation,Artificial intelligence,Encoder,Deep learning,Dice,Upsampling | Conference | 2019 |
ISSN | Citations | PageRank |
1557-170X | 1 | 0.36 |
References | Authors | |
0 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
S. M. Kamrul Hasan | 1 | 1 | 0.36 |
Cristian A. Linte | 2 | 93 | 24.09 |