Amazon launches Nova Act: the new AI agent that autonomously navigates the web
The new technology core functionality of the future Alexa+ update
3' min read
3' min read
Amazon enters the arena of agent-based artificial intelligence with the launch of Nova Act, a sophisticated multipurpose AI agent designed to autonomously interact with web browsers in performing everyday tasks. The innovative technology represents a significant step forward in the Seattle-based company's strategy to gain market share in this emerging and competitive technology sector where OpenAI's Operator and Anthropic's Computer Use are already present.
The Nova Act project and the AGI workshop
Nova Act is the first public product of the AGI Lab in San Francisco, an initiative led by former OpenAI researchers David Luan and Pieter Abbeel. Both had already founded their own startups - Luan created Adept, while Abbeel co-founded Covariant - before being hired by Amazon last year to lead its efforts in the field of AI agents.
David Luan explained that he designed the Nova Act SDK, a toolkit for prototyping agents, with the aim of reliably automating short and simple tasks. The tools provided allow developers to precisely define when human intervention is required within an agent workflow. The hope is that this will enable the realisation of more reliable, though not completely autonomous, agentic applications. Developers can access these tools through the nova.amazon.com portal, which also serves as a showcase for Amazon's entire range of Nova models.
Advanced features and performance
.Nova Act's suite of tools includes advanced capabilities that enable AI agents to intuitively navigate complex web pages, fill out forms with relevant data, interact with interactive elements such as calendars or date selectors, and even understand the visual context of website user interfaces, a problem that has historically hampered the effectiveness of automated agents.
Amazon claims that Nova Act outperformed competing OpenAI and Anthropic agents in various internal tests. For example, in the ScreenSpot Web Text test - which evaluates the interaction of an AI agent with text displayed on the screen - Nova Act scored 94 per cent, outperforming OpenAI's CUA (88 per cent) and Anthropic's Claude 3.7 Sonnet (90 per cent). It should be noted, however, that Amazon did not use more popular benchmarks, such as WebVoyager, for evaluating agents.
-U63334273713wRE-1440x752@IlSole24Ore-Web.jpeg?r=650x341)
