- 👋 Hi, I’m @CSfufu
- I am currently focus on VLM Agentic reasoning and Reinforcement Learning.
【次の交差点でお会いします、よろしくお願いします】
-
Zhejiang University
- Shanghai China
-
19:53
(UTC +08:00)
Highlights
- Pro
Pinned Loading
-
XiaoYee/Awesome_Efficient_LRM_Reasoning
XiaoYee/Awesome_Efficient_LRM_Reasoning Public😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
-
Revisual-R1
Revisual-R1 Public🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to …
-
hiyouga/EasyR1
hiyouga/EasyR1 PublicEasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
-
shawn0728/ARES
shawn0728/ARES Public[ICLR 2026]🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth…
-
verl-project/verl
verl-project/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

