Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 23 days ago • 145
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models Paper • 2604.10866 • Published Apr 13 • 67
P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis Paper • 2508.04626 • Published Aug 6, 2025
Mitigating Overthinking through Reasoning Shaping Paper • 2510.09535 • Published Oct 10, 2025 • 5 • 3