Augmenting Human Cognition with Al Agents that Use Computers, Prof. Yu Su, LOVE Workshop @ CVPR'25
Joya Chen
Augmenting Human Cognition with Al Agents that Use Computers, Prof. Yu Su, LOVE Workshop @ CVPR'25
28:59
Training Multimodal Agents for Real Robot Manipulation at Scale, Dr. Karl Pertsch, LOVE@CVPR'25
Joya Chen
Training Multimodal Agents for Real Robot Manipulation at Scale, Dr. Karl Pertsch, LOVE@CVPR'25
27:00
Towards Grounded Reasoning in Multimodal Agents, Prof. Katerina Fragkiadaki, LOVE Workshop @ CVPR'25
Joya Chen
Towards Grounded Reasoning in Multimodal Agents, Prof. Katerina Fragkiadaki, LOVE Workshop @ CVPR'25
20:27
Multimodal Video Models for Robot Learning, Prof. Michael S. Ryoo, Multimodal Video Agent @ CVPR'25
Joya Chen
Multimodal Video Models for Robot Learning, Prof. Michael S. Ryoo, Multimodal Video Agent @ CVPR'25
24:33
Accelerating LLM and Generative AI | Prof. Song Han | Multimodal Video Agent Workshop @ CVPR'25
Joya Chen
Accelerating LLM and Generative AI | Prof. Song Han | Multimodal Video Agent Workshop @ CVPR'25
27:45
LiveCC Real-Time Video Commentary
Joya Chen
LiveCC Real-Time Video Commentary
2:08
Dr. Chunyuan Li's Talk on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
Joya Chen
Dr. Chunyuan Li's Talk on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
54:00
Prof. Dima Damen's Talk on on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
Joya Chen
Prof. Dima Damen's Talk on on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
45:45
Track 2A: Text-Guided Video Editing & Track 2B: Text-to-Video Generation on LOVEU Workshop @ CVPR'24
Joya Chen
Track 2A: Text-Guided Video Editing & Track 2B: Text-to-Video Generation on LOVEU Workshop @ CVPR'24
40:28
Track1: Long-Term Video QA on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
Joya Chen
Track1: Long-Term Video QA on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
35:38
Prof. Marc Pollefeys' Talk on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
Joya Chen
Prof. Marc Pollefeys' Talk on LOng-form VidEo Understanding (LOVEU) Workshop @ CVPR'24
34:37
VideoLLM-online: Online Video Large Language Model for Streaming Video (with English ChatTTS)
Joya Chen
VideoLLM-online: Online Video Large Language Model for Streaming Video (with English ChatTTS)
1:13
Affordance Grounding from Demonstration Video to Target Image | CVPR 2023
Joya Chen
Affordance Grounding from Demonstration Video to Target Image | CVPR 2023
8:02
LOVEU@CVPR'22 Track3 Winner Talk
Joya Chen
LOVEU@CVPR'22 Track3 Winner Talk
3:57