Skip to content
View IrohXu's full-sized avatar
🤒
Out sick
🤒
Out sick

Block or report IrohXu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
IrohXu/README.md

Hi there 👋

📫 Research Update

We will organize the CVPR 2026 2nd Workshop on Computer Vision for Children with UIUC, ETH Zurich, University of Basel, Shenzhen Children's Hospital

⚡️ A quick introduction

Researcher for Embodied AI, LLM/VLM Post-training, Social AI.

🤝🏻 Connect, Follow, Subscribe

Twitter
LinkedIn
Email: xucao [at] pediamed [dot] ai

Pinned Loading

  1. huggingface/diffusers huggingface/diffusers Public

    🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

    Python 33k 6.8k

  2. GazeAnywhere GazeAnywhere Public

    [CVPR 2026] GazeAnywhere: Gaze Target Estimation Anywhere with Concepts

    Python 11

  3. PediaMedAI/Cognition-MLLM PediaMedAI/Cognition-MLLM Public

    [COLM 2025] What is the Visual Cognition Gap between Humans and Multimodal LLMs?

    JavaScript 7

  4. lanenet-lane-detection-pytorch lanenet-lane-detection-pytorch Public

    Unofficial implemention of lanenet model for real time lane detection Pytorch Version

    Python 170 43

  5. LLVM-AD/MAPLM LLVM-AD/MAPLM Public

    [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding

    Python 164 3

  6. Awesome-Multimodal-LLM-Autonomous-Driving Awesome-Multimodal-LLM-Autonomous-Driving Public

    [WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

    308 13