OpenAI introduces Operator, their first web-based AI agent

0
16
OpenAI introduces Operator, their first web-based AI agent
OpenAI introduces Operator, their first web-based AI agent

Operator, a new AI agent from OpenAI, can browse the web and carry out tasks for the user. According to the business, “it can view a webpage and interact with it by typing, clicking, and scrolling using its own browser.” It’s interesting to note that this is the first autonomous AI agent from OpenAI. In this instance, it functions effectively as an intermediary that may accomplish activities while connecting with the internet. Since OpenAI has just published a research preview, it is subject to criticism and has limits. For now, ChatGPT Pro members in the US may access this version.

Operator and its operations

Numerous repetitive browser tasks, including filling out forms, placing grocery orders, or even making memes, may be assigned to the Operator. AI broadens its useful applications by utilizing the same tools and interfaces that humans use on a daily basis. This allows businesses to interact with consumers in new ways while also saving people time on repetitive chores.

In the future, OpenAI intends to include these features into ChatGPT and extend them to Plus, Team, and Enterprise users.

In the blog, OpenAI states, “Operator is powered by a new model called Computer-Using Agent (CUA). Combining GPT-4o’s vision capabilities with advanced reasoning through reinforcement learning, CUA is trained to interact with graphical user interfaces (GUIs)—the buttons, menus, and text fields people see on a screen.“

Without the requirement for specialized API integrations, the operator may “see” the browser through screenshots and “interact” with it using common mouse and keyboard commands. Is it superior, though, in terms of hallucinations? According to OpenAI, the Operator may utilize its reasoning skills to repair itself if it encounters difficulties or mistakes. It smoothly returns control to the user in the event that it hits a standstill and needs assistance, guaranteeing a smooth and cooperative experience.

By starting new conversations, users may have Operator do many jobs at once, much as when they utilize multiple tabs on a browser.

o3-mini to come to free users

CEO Sam Altman said on X (previously Twitter) that OpenAI has chosen to launch o3-mini in the free tier, just after the firm unveiled Operator, its first AI agent.

The o3-mini is a major improvement over the o1-mini because it incorporates improved reasoning skills that allow for sequential logical analysis. The day of light for this AI model has not yet arrived. Altman said it might be introduced in two weeks.

Also read: Viksit Workforce for a Viksit Bharat

Do Follow: The Mainstream formerly known as CIO News LinkedIn Account | The Mainstream formerly known as CIO News Facebook | The Mainstream formerly known as CIO News Youtube | The Mainstream formerly known as CIO News Twitter

About us:

The Mainstream formerly known as CIO News is a premier platform dedicated to delivering latest news, updates, and insights from the tech industry. With its strong foundation of intellectual property and thought leadership, the platform is well-positioned to stay ahead of the curve and lead conversations about how technology shapes our world. From its early days as CIO News to its rebranding as The Mainstream on November 28, 2024, it has been expanding its global reach, targeting key markets in the Middle East & Africa, ASEAN, the USA, and the UK. The Mainstream is a vision to put technology at the center of every conversation, inspiring professionals and organizations to embrace the future of tech.