TopRatedTech

Tech News, Gadget Reviews, and Product Analysis for Affiliate Marketing

TopRatedTech

Tech News, Gadget Reviews, and Product Analysis for Affiliate Marketing

OpenAI pushes AI agent capabilities with new developer API

Builders utilizing the Responses API can entry the identical fashions that energy ChatGPT Search: GPT-4o search and GPT-4o mini search. These fashions can browse the web to reply questions and cite sources of their responses.

That is notable as a result of OpenAI says the added net search means dramatically improves the factual accuracy of its AI fashions. On OpenAI’s SimpleQA benchmark, which goals to measure confabulation fee, GPT-4o search scored 90 %, whereas GPT-4o mini search achieved 88 %—each considerably outperforming the bigger GPT-4.5 mannequin with out search, which scored 63 %.

Regardless of these enhancements, the know-how nonetheless has vital limitations. Apart from points with CUA correctly navigating web sites, the improved search functionality does not utterly clear up the issue of AI confabulations, with GPT-4o search nonetheless making factual errors 10 % of the time.

Alongside the Responses API, OpenAI launched the open supply Agents SDK, offering builders free instruments to combine fashions with inside programs, implement safeguards, and monitor agent actions. This toolkit follows OpenAI’s earlier launch of Swarm, a framework for orchestrating a number of brokers.

These are nonetheless early days within the AI agent area, and issues will possible enhance quickly. Nonetheless, in the meanwhile, the AI agent motion stays weak to unrealistic claims, as demonstrated earlier this week when users discovered that Chinese language startup Butterfly Impact’s Manus AI agent platform did not ship on a lot of its guarantees, highlighting the persistent hole between promotional claims and sensible performance on this rising know-how class.

Source link

OpenAI pushes AI agent capabilities with new developer API

Leave a Reply

Your email address will not be published. Required fields are marked *

Scroll to top