Scalable gastroscopic video summarization via similar-inhibition dictionary selection. - Citegraph

Paper Info

Title
Scalable gastroscopic video summarization via similar-inhibition dictionary selection.

Abstract
HighlightsWe design a dictionary selection model via the similar-inhibition constraint.We propose a scalable gastroscopic video summarization algorithm.We build the first gastroscopic video summarization dataset with 30 videos. ObjectiveThis paper aims at developing an automated gastroscopic video summarization algorithm to assist clinicians to more effectively go through the abnormal contents of the video. Methods and materialsTo select the most representative frames from the original video sequence, we formulate the problem of gastroscopic video summarization as a dictionary selection issue. Different from the traditional dictionary selection methods, which take into account only the number and reconstruction ability of selected key frames, our model introduces the similar-inhibition constraint to reinforce the diversity of selected key frames. We calculate the attention cost by merging both gaze and content change into a prior cue to help select the frames with more high-level semantic information. Moreover, we adopt an image quality evaluation process to eliminate the interference of the poor quality images and a segmentation process to reduce the computational complexity. ResultsFor experiments, we build a new gastroscopic video dataset captured from 30 volunteers with more than 400k images and compare our method with the state-of-the-arts using the content consistency, index consistency and content-index consistency with the ground truth. Compared with all competitors, our method obtains the best results in 23 of 30 videos evaluated based on content consistency, 24 of 30 videos evaluated based on index consistency and all videos evaluated based on content-index consistency. ConclusionsFor gastroscopic video summarization, we propose an automated annotation method via similar-inhibition dictionary selection. Our model can achieve better performance compared with other state-of-the-art models and supplies more suitable key frames for diagnosis. The developed algorithm can be automatically adapted to various real applications, such as the training of young clinicians, computer-aided diagnosis or medical report generation.

Year	DOI	Venue
2016	10.1016/j.artmed.2015.08.006	Artificial Intelligence in Medicine
Keywords	Field	DocType
Video summarization,Key frame,Similar-inhibition dictionary selection,Image attention prior,Gastroscopic video	Data mining,Computer science,Image quality,Artificial intelligence,Key frame,Automatic summarization,Information retrieval,Segmentation,Ground truth,Data compression,Machine learning,Scalability,Computational complexity theory	Journal
Volume	Issue	ISSN
66	C	0933-3657
Citations	PageRank	References
5	0.40	48
Authors
7

Authors (7 rows)

Cited by (5 rows)

References (48 rows)

Name	Order	Citations	PageRank
Shuai Wang	1	24	2.98
Yang Cong	2	684	38.22
Jun Cao	3	23	1.05
Yunsheng Yang	4	20	3.95
Y. Tang	5	243	33.69
Huaici Zhao	6	14	4.91
Haibin Yu	7	201	25.62

1