Opera has unveiled Browser Operator, an integrated AI worker that will carry out instructions for users right inside the browser.
Rather than becoming a separate tool, the Browser Operator acts as an extension of the browser itself-it enables users to automate repetitive tasks such as purchasing items, filling out online forms, and compiling web content.
Based on AI integration on third-party servers that sends sensitive data over, Browser Operator does all of the tasks onboard on the Opera browser.
Opera showed a demo video of how Browser Operator helps the user in the task of buying socks-a task with many steps-instead of going through the product pages or manually typing in bank or credit card details; the user has to simply give over this part of the buying process to Browser Operator, enabling him to focus on more important aspects of life-say, family.
Using natural language processing via the Opera’s AI Composer Engine, Browser Operator accurately interprets a user’s written command and carries out a set of corresponding actions within the browser. All activities occur on-site or locally on a user’s device through the browser’s own engines for secure and total command execution in a timely manner.
Wherever the process requires such delicate actions as entering payment information or approving an order, the Browser Operator pauses and requests input from the user. You may also enter the process at any time.
Every step that the Browser Operator makes is audited and accessible for inspection; users will always know how their tasks are being carried out. If things go wrong – for example, an incorrect order is placed – you can further give commands to the AI agent to eliminate the problem, such as cancelling the order or modifying a form.
The key differentiators: Privacy, performance, and precision
What set Browser Operator apart were a localized, privacy-first architecture. Its competitors, on the other hand, would need screenshots or videos to comprehensively understand the content of the web page.
As such, it acquires several advantages:
- Faster completion of tasks: compared to conventional screenshots or videos that rely on the view, Browser Operator accesses HTML page elements directly and avoids unnecessary overhead, seeing a single DOM page as a whole.
- Greater privacy: since all operations take place within the browser, user data (e.g. logins, cookies, and browsing history) never leaves the local device. There are no screenshots, keystrokes, or personal information sent back to Opera’s servers.
- The AI can interact with page elements even if they are hidden from the user view: behind cookie popups or verification dialogs, for example—thus seamlessly accessing web content.
In allowing the browsers to perform functions autonomously, Opera is taking the next big step towards the realization of “agentic” browsers—not only as tools for accessing the web but as facilitators enhancing productivity.