1. School of Automation and Electrical Engineering, Shenyang Ligong University, Shenyang 110159, China;2. Innovation Center for Smart Medical Technologies & Devices, Binjiang Institute of Zhejiang University, Hangzhou 310053, China;3.School of Computer and Mathematical Sciences, University of Adelaide, Adelaide 5000, Australia;4. College of Computer Science and Technology, Zhejiang University, Hangzhou 310058, China;5.College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou 310058, China
GUO Yuping, GAO Hongwei, YU Jiahui, GE Jinchao, HAN Meng, JU Zhaojie. Video action recognition meets vision-language models exploring human factors in scene interaction:a review[J]. Optoelectronics Letters,2025,(10):626-640
Copy
