Wenhao Chai is currently a graduate student at University of Washington, with Information Processing Lab advised by Prof. Jenq-Neng Hwang. Previously, he was an undergradate student at Zhejiang University, with CVNext Lab advised by Prof. Gaoang Wang. He is fortunate to work with Prof. Christopher D Manning at Stanford University, and have worked with Prof. Saining Xie and Prof. Yilun Du. He has internship at Pika Labs and Microsoft Research Asia. His research primarily in large multimodal models (LMMs) for video understanding, embodied agent, and generative models. He has published related papers in top-tier conferences and journals such as CVPR, ICCV, ECCV, and AAAI. He has also organized workshops and tutorials at CVPR and AAAI, and served as a reviewer for NeurIPS, ICLR, ICML, CVPR, ECCV, and AISTATS. My current research focus on developing embodied AI agents inspired by cognitive science principles to interact with the physical world, building upon video understanding as a core perceptual mechanism. I propose a long-short term memory framework modeled after the human memory system, enabling pre-trained video LMMs to comprehend multi-hour video content without additional fine-tuning. To enhance efficiency, I introduce token merging, significantly reducing visual tokens with minimal performance degradation. I also demonstrate step-by-step agent system development in Minecraft, showcasing cognitive-inspired agent capabilities in virtual environments.
Florin-Alexandru Vasluianu, †. TimSeizinger, †. ZhuyunZhou, Zongwei Wu, †. CailianChen, R. Timofte, Wei Dong, Han Zhou, Yuqiong Tian, Jun Chen, Xueyang Fu, Xin Lu, Yurui Zhu, Xi Wang, Dong Li, Jie Xiao, Yunpeng Zhang, Zheng Zha, Zhao Zhang, Suiyi Zhao, Bomin Wang, Yan Luo, Yanyan Wei, Zhihao Zhao, Long Sun, Tingting Yang, Jin-Mei Pan, Jian-Ping Dong, Jinhui Tang, Bilel Benjdira, Mohammed Nassif, A. Koubâa, Ahmed Elhayek, Anas M. Ali, Kyotaro Tokoro, Kento Kawai, Kaname Yokoyama, Takuya Seno, Yuki Kondo, N. Ukita, LI, Bo Yang, Zhiqi Wu Gao, Chen Yihan, Sixiang Yu, Chen Kai Zhang, Tian Ye, Wenbin Zou, Yunlong Lin, Zhaohu Xing, Jinbin Bai, Wenhao Chai, Lei Zhu, Ritik Maheshwari, Rakshank Verma, Rahul Tekchandani Praful, Hambarde Satya, Narayan Tazi, Santosh Kumar, Vipparthi Subrahmanyam, Murala Jaeho, Lee Seongwan, Kim Sharif, Nodirkhuja Khujaev, Roman A. Tsoy, Fan Gao, Weidan Yan, Wenze Shao, Dengyin Zhang Bin, Chen Siqi, Yanxin Zhang, Yu Qian, Yuanbo Chen, Zhou Tong, Rongfeng Tong, Wei Ruiqi, Sun Yue Liu, Nikhil Akalwadi, Amogh Joshi, Sampada Malagi, Chaitra Desai, R. Tabib, U. Mudenagudi, Ali Murtaza, U. Khairuddin, Ahmad Athif, Mohd Faudzi, Adinath Dukre, Vivek Deshmukh, Shruti S. Phutke, Ashutosh Kulkarni, Vipparthi Anil Gonde, Subrahmanyam Murala, Arun karthik K Manasa, N. Shri, Hari Priya, Wei Hao, X. Yan, Minghan Fu, LVGroup Hfut, Ustc ShadowTitan, Zhi-Ming Wu, Gao Chen, Yi-Kang Yu, Sixiang Chen, Kai Zhang, Rahul Tekchandani, Praful Hambarde, S. Tazi, Jae-Hyeon Lee, Seongwan Kim, Finetuned MambaIR, Dengyin Zhang, Bin Chen, Siqi Zhang, Yanxin Qian, Yuanbin Chen, Yuanbo Zhou, Tong Tong, Rongfeng Wei, Ruiqi Sun, Yue Liu
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024
Tian Ye, Sixiang Chen, Wenhao Chai, Zhaohu Xing, Jing Qin, Ge Lin, Lei Zhu
Computer Vision and Pattern Recognition 2024
Haojia Cheng, Wenhao Chai, Jiabao Hu, Wenhao Ruan, Mingyu Shi, Hyunjun Kim, Yifan Cao, Y. Narazaki
Journal of Infrastructure Intelligence and Resilience 2024
Y. Narazaki, Wendong Pang, Gaoang Wang, Wenhao Chai
Journal of Bridge Engineering 2024
Tian Ye, Sixiang Chen, Yun Liu, Wenhao Chai, Jinbin Bai, Wenbin Zou, Yunchen Zhang, Mingchao Jiang, Erkang Chen, Chenghao Xue
ACM Multimedia 2023
Shidong Cao, Wenhao Chai, Shengyu Hao, Gaoang Wang
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023
Yifan Cao, C. Tan, Wenzhuo Qian, Wenhao Chai, Luhang Cui, Wen-xiong Yang, Xinben Hu, Yongjian Zhu, Wenhui Zhou, Xingfa Shen
2022 IEEE International Conference on Unmanned Systems (ICUS) 2022
Zhongyu Jiang, Zhuoran Zhou, Lei Li, Wenhao Chai, Cheng-Yen Yang, Jenq-Neng Hwang
Wenhao Chai, Zhongyu Jiang, Jenq-Neng Hwang, Gaoang Wang
Ruizhe Chen, Xiaotian Zhang, Meng Luo, Wenhao Chai, Zuozhu Liu
arXiv.org 2024