Openai launched the new AI agent 'Operator', will do all the work on its own
Obnews Tech Desk: Openai has recently launched its advanced AI agent 'Operator', which can carry out various tasks on the web for users. This agent can look at the webpage using a browser and can do interactions like typing, clicking and scrolling. According to Openai, this is their first AI agent that can function independently and does not require a particular command. Currently it has been launched as a research preview, which will be taken by users' feedback. Currently it is available only to Chatgpt PRO users in the US.
Where will the use of 'Operator' be used?
'Operator' can be used for many routine browser functions. This includes works such as filling the form, ordering grocery, and making memes. It uses the same interfaces and tools that users do daily, which increases its usefulness. This not only helps save time but also provides new opportunities for businesses to connect with customers.
future plans
Openai's goal is to make 'Operator' available soon to Plus, Team and Enterprise users. It is planned to integrate it in the future Chatgpt. According to the company's blog, the 'Operator' is based on a new model Computer-Rusing Agent (CUA), which connects the vision capabilities and advanced region of GPT-4o. It learns to effectively interact effectively with graphical user interface (GUI) such as buttons, menu and text fields.
Click here to read other technology news
'Operator' specialty
'Operator' can “see” the browser through screenshot and “interaction” through the mouse and keyboard action. The special thing is that it does not require custom API integration. If it faces a mistake, it is capable of improving it using its regional abilities.
Comments are closed.