Rohan Arora's picture

Rohan Arora

rohan-arora

·

AI & ML interests

None yet

Recent Activity

published an article 8 days ago

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

upvoted a paper 23 days ago

MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments

updated a dataset about 1 month ago

ibm-research/ITBench-Lite

View all activity

Organizations

published an article 8 days ago

Article

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ibm-research

•

8 days ago

• 14

published an article 4 months ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

ibm-research

•

Feb 18

• 19