Skip to content
View CSfufu's full-sized avatar
  • 0.0.0.0/0
  • Shanghai China
  • 21:05 (UTC +08:00)

Highlights

  • Pro

Block or report CSfufu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CSfufu/README.md
  • 👋 Hi, I’m @CSfufu
  • I am currently focus on VLM Agentic reasoning and Reinforcement Learning.

Pinned Loading

  1. hiyouga/EasyR1 hiyouga/EasyR1 Public

    EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

    Python 5k 374

  2. Revisual-R1 Revisual-R1 Public

    [ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement l…

    Python 213 3

  3. verl-project/verl verl-project/verl Public

    verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

    Python 21.5k 3.9k

  4. Osilly/Vision-DeepResearch Osilly/Vision-DeepResearch Public

    [ICML 2026] Multimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of reasoning turns to dozens and the number of search-engine inte…

    Python 635 56

  5. rllm-org/rllm rllm-org/rllm Public

    Democratizing Reinforcement Learning for LLMs

    Python 5.6k 568

  6. shawn0728/OpenSearch-VL shawn0728/OpenSearch-VL Public

    🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools, and fatal-aware agentic reinforcement…

    Python 194 17