
Benchmarking Voice and Text Agents for Enterprise Workflows
Authors are listed in alphabetical order. Introduction As enterprises adopt AI assistants, evaluating how well these agents handle real-world tasks — especially through voice interfaces — has become crucial. Traditional benchmarks largely focus on general conversational abilities or narrow tool-use scenarios, leaving a gap in














