CSfufu

Follow

凪 CSfufu

Follow

neither here nor there

60 followers · 51 following

0.0.0.0/0
Shanghai China
21:05 (UTC +08:00)

Achievements

Achievements

Highlights

Pro

CSfufu/README.md

👋 Hi, I’m @CSfufu
I am currently focus on VLM Agentic reasoning and Reinforcement Learning.

Pinned Loading

hiyouga/EasyR1 hiyouga/EasyR1 Public

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5k 374
Revisual-R1 Revisual-R1 Public

[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement l…

Python 213 3
verl-project/verl verl-project/verl Public

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21.5k 3.9k
Osilly/Vision-DeepResearch Osilly/Vision-DeepResearch Public

[ICML 2026] Multimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of reasoning turns to dozens and the number of search-engine inte…

Python 635 56
rllm-org/rllm rllm-org/rllm Public

Democratizing Reinforcement Learning for LLMs

Python 5.6k 568
shawn0728/OpenSearch-VL shawn0728/OpenSearch-VL Public

🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools, and fatal-aware agentic reinforcement…

Python 194 17