Amazon Releases Nova Act: A Browser-Based AI Agent Development Platform
2 minutes
Amazon has announced the release of Nova Act, a new artificial intelligence model specifically designed to perform automated actions within web browsers. The company has made available a research preview of the Nova Act Software Development Kit (SDK) through nova.amazon.com, allowing developers to begin experimenting with the technology.
What is Nova Act?
Nova Act represents a departure from traditional large language models (LLMs) that primarily focus on text generation and knowledge retrieval. Instead, it is designed to execute practical tasks within web browsers, such as managing calendar events, submitting forms, or handling routine administrative tasks.
The technology aims to address a significant limitation in current AI automation: while many tasks can be automated through APIs, numerous real-world scenarios require direct interaction with web interfaces. Nova Act is designed to bridge this gap by enabling AI agents to interact with web browsers in ways that previously required human intervention.
Technical Capabilities
The Nova Act SDK provides developers with tools to:
- Break down complex workflows into discrete, reliable commands
- Implement detailed instruction sets for specific actions
- Integrate with existing APIs
- Utilize browser automation through Playwright
- Incorporate Python code for testing, debugging, and parallel processing
According to Amazon's internal evaluations, Nova Act has achieved over 90% accuracy on specific UI interaction tasks that often challenge other models, including date selection, dropdown menu navigation, and popup handling.
Current Applications and Future Development
The platform is currently being integrated into Amazon's Alexa+ service, where it enables web navigation for tasks that cannot be completed through traditional API integrations alone.
Amazon has indicated that Nova Act represents an early stage in a broader development roadmap. The company's research team emphasizes that future versions will incorporate reinforcement learning across diverse environments, moving beyond the current supervised fine-tuning approach.
Conclusion
While Nova Act shows promise in automating web-based tasks, it's important to note that the technology is still in its research preview phase. The current focus is on reliability for specific, well-defined tasks rather than fully autonomous operation for complex scenarios.
The SDK requires developers to carefully structure their automation workflows and may need additional error handling and validation for production environments. Amazon recommends starting with simple, well-defined tasks before attempting more complex automation scenarios.
Valeriia Kuka
Valeriia Kuka, Head of Content at Learn Prompting, is passionate about making AI and ML accessible. Valeriia previously grew a 60K+ follower AI-focused social media account, earning reposts from Stanford NLP, Amazon Research, Hugging Face, and AI researchers. She has also worked with AI/ML newsletters and global communities with 100K+ members and authored clear and concise explainers and historical articles.