Agentic Reasoning for Robust Vision Systems via Increased Test-Time Compute

Jan 15, 2025·

Chung-en (Johnny) Yu

Brian Jalaian

Nathaniel D Bastian

· 0 min read

Abstract

We propose the Visual Reasoning Agent (VRA), a training-free, agentic reasoning framework that achieves up to 40% absolute accuracy gains on challenging visual reasoning benchmarks. VRA leverages increased test-time compute through multi-step reasoning and tool use to enhance vision system robustness.

Type

Preprint

Publication

arXiv preprint arXiv:2509.16343

Last updated on Jan 23, 2026

Agentic Ai Deep Learning Ai Optimization

← Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models Jul 1, 2025

ORCA: Agentic Reasoning For Hallucination and Adversarial Robustness in Vision-Language Models Jan 15, 2025 →