We will organize the CVPR 2026 2nd Workshop on Computer Vision for Children with UIUC, ETH Zurich, University of Basel, Shenzhen Children's Hospital
Researcher for Embodied AI, LLM/VLM Post-training, Social AI.
We will organize the CVPR 2026 2nd Workshop on Computer Vision for Children with UIUC, ETH Zurich, University of Basel, Shenzhen Children's Hospital
Researcher for Embodied AI, LLM/VLM Post-training, Social AI.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
[CVPR 2026] GazeAnywhere: Gaze Target Estimation Anywhere with Concepts
Python 11
[COLM 2025] What is the Visual Cognition Gap between Humans and Multimodal LLMs?
JavaScript 7
Unofficial implemention of lanenet model for real time lane detection Pytorch Version
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving