Shaimaa Toriah, Atef Ghalwash, Aliaa Youssif



Semantic-Based Video Retrieval Survey

pdf PDF


There is a tremendous growth of digital data due to the stunning progress of digital devices which facilitates capturing them. Digital data include image, text, and video. Video represents a rich source of information. So, there is an urgent need to retrieve, organize, and automate videos. Video retrieval is a vital process in multimedia applications such as video search engines, digital museums, video-on-demand broadcasting. In this paper, the different approaches of video retrieval are clearly and briefly categorized. Moreover, the different methods which try to bridge the semantic gap in video retrieval are discussed in more details.


Semantic Video Retrieval,Concept Detectors,Context Based Concept Fusion


[1] Yusuf Aytar, Mubarak Shah, and Jiebo Luo. Utilizing semantic word similarity measures for video retrieval. 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2008.

[2] L. Ballan, M. Bertini, A. Del Bimbo, and G. Serra. Video annotation and retrieval using ontologies and rule learning. IEEE MultiMedia, 17(4):80– 88, Oct 2010.

[3] Marco Bertini, Alberto Del Bimbo, and Giuseppe Serra. Learning ontology rules for semantic video annotation. Proc of ACM International Conference on Multimedia Many Faces of Multimedia Semantics MS, pages 1–8, 2008.

[4] Video Classification, Jianping Fan, Hangzai Luo, Yuli Gao, and Ramesh Jain. Incorporating Concept Ontology for Hierarchical. 9(5):939–957, 2007.

[5] Ieee International Conference. WHICH THOUSAND WORDS ARE WORTH A PICTURE ? EXPERIMENTS ON VIDEO RETRIEVAL USING A THOUSAND CONCEPTS Wei-Hao Lin and Alexander Hauptmann Language Technologies Institute School of Computer Science Carnegie Mellon University. 2006.

[6] Nizar Elleuch, Mohamed Zarka, Anis Ben Ammar, and Adel M. Alimi. A fuzzy ontology: Based framework for reasoning in visual video content analysis and indexing. In Proceedings of the Eleventh International Workshop on Multimedia Data Mining, MDMKDD ’11, pages 1:1–1:8, New York, NY, USA, 2011. ACM.

[7] Jie Geng, Zhenjiang Miao, and Hai Chi. Temporal-Spatial Refinements for Video Concept Fusion, pages 547–559. Springer Berlin Heidelberg, Berlin, Heidelberg, 2013.

[8] Fei Guo. Indoor Outdoor Image Classification. Computer, 2011.

[9] A Hauptmann, R Yan, W-H Lin, M Christel, and H Wactlar. Can high level concepts fill the semantic gap in video retrieval? A case study with broadcast news. IEEE Transactions on Multimedia, 9(5):958–966, 2007.

[10] Alexander Hauptmann and Wei-hao Lin. How many highlevel concepts will fill the semantic gap in video retrieval ? pages 627–634, 2007.

[11] W Hu, N Xie, L Li, X Zeng, and S Maybank. A Survey on Visual Content-Based Video Indexing and Retrieval. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 41(6):797–819, nov 2011.

[12] Chung-Lin Huang, Huang-Chia Shih, and Chung-Yuan Chao. Semantic analysis of soccer video using dynamic Bayesian network. IEEE Transactions on Multimedia, 8(4):749–760, aug 2006.

[13] Yg Jiang, Yg Jiang, a Yanagawa, a Yanagawa, Sf Chang, Sf Chang, Cw Ngo, and Cw Ngo. CU-VIREO374: fusing Columbia374 and VIREO374 for large scale semantic concept . . . . Columbia University ADVENT Technical Report, 2008.

[14] Yu-gang Jiang, Qi Dai, Jun Wang, and Chong-wah Ngo. Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation. 21(6):3080–3091, 2012.

[15] Ken Hao Liu, Ming Fang Weng, Chi Yao Tseng, Yung Yu Chuang, and Ming Syan Chen. Association and temporal rule mining for postfiltering of semantic concept detection in video. IEEE Transactions on Multimedia, 10(2):240–251, 2008.

[16] M Naphade, I Kozintsev, T Huang, and K Ramchandran. A factor graph framework for semantic indexing and retrieval in video. In 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries, pages 35–39, 2000.

[17] Milind Naphade, John R. Smith, Jelena Tesic, Shih Fu Chang, Winston Hsu, Lyndon Kennedy, Alexander Hauptmann, and Jon Curtis. Largescale concept ontology for multimedia. IEEE Multimedia, 13(3):86–91, 2006.

[18] J R Smith and Shih-Fu Chang. Visually searching the Web for content. IEEE MultiMedia, 4(3):12–20, 1997.

[19] Cees G. M. Snoek and Marcel Worring. Concept-Based Video Retrieval. Foundations and Trends® in Information Retrieval, 2(4):215–322, 2007.

[20] D W Tjondronegoro and Y P P Chen. Knowledge-Discounted Event Detection in Sports Video. IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 40(5):1009–1024, sep 2010.

[21] Xiao-Yong Wei, Yu-Gang Jiang, and Chong-Wah Ngo. Exploring interconcept relationship with context space for semantic video indexing. Proceeding of the ACM International Conference on Image and Video Retrieval, pages 15:1—-15:8, 2009.

[22] Xiao Yong Wei, Yu Gang Jiang, and Chong Wah Ngo. Concept-driven multi-modality fusion for video search. IEEE Transactions on Circuits and Systems for Video Technology, 21(1):62–73, 2011.

[23] Xiao-yong Wei, Chong-wah Ngo, and Yu-gang Jiang. Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces. pages 1–12.

[24] L Xie, S F Chang, A Divakaran, and H Sun. Structure analysis of soccer video with hidden Markov models. In 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, volume 4, pages IV–4096–IV–4099, may 2002.

[25] Jun Yang and Alexander G Hauptmann. Cross-Domain Video Concept Detection Using Adaptive SVMs.

[26] Ting Yao, Chong-wah Ngo, and Shiai Zhu. Predicting Domain Adaptivity : Redo or Recycle ? pages 5–8, 2012.

[27] Zheng-Jun Zha, Tao Mei, Zengfu Wang, and Xian-Sheng Hua. Building a comprehensive ontology to refine video concept detection. In Proceedings of the International Workshop on Workshop on Multimedia Information Retrieval, MIR ’07, pages 227–236, New York, NY, USA, 2007. ACM.

Cite this paper

Shaimaa Toriah, Atef Ghalwash, Aliaa Youssif. (2018) Semantic-Based Video Retrieval Survey. International Journal of Computers, 3, 85-90


Copyright © 2018 Author(s) retain the copyright of this article.
This article is published under the terms of the Creative Commons Attribution License 4.0