
An instance of Claude 3.7 Sonnet with prolonged considering is requested, “Would the colour be known as ‘magenta’ if the city of Magenta did not exist?”
Credit score:
Benj Edwards
Apparently, xAI’s Grok 3 with “considering” (its SR mode) enabled was the primary mannequin that definitively gave us a “no” and never an “it is not going” to the magenta query. Claude 3.7 Sonnet with prolonged considering additionally impressed us with our second-ever agency “no,” then an evidence.
In one other casual check, we requested 3.7 Sonnet with prolonged considering to compose 5 unique dad jokes. We have discovered previously that our previous immediate, “write 5 unique dad jokes,” was not particular sufficient and at all times resulted in canned dad jokes pulled straight from coaching knowledge, so we requested, “Compose 5 unique dad jokes that aren’t discovered wherever on this planet.”

An instance of Claude 3.7 Sonnet with prolonged considering is requested, “Compose 5 unique dad jokes that aren’t discovered wherever on this planet.”
Credit score:
Benj Edwards
Claude made some makes an attempt at crafting unique jokes, though we’ll allow you to choose whether or not they’re humorous or not. We’ll probably put 3.7 Sonnet’s SR capabilities to the check extra exhaustively in a future article.
Anthropic’s first agent: Claude Code
To date, 2025 has been the yr of each SR fashions (like R1 and o3) and agentic AI instruments (like OpenAI’s Operator and Deep Research). To not be ignored, Anthropic has introduced its first agentic software, Claude Code.
Claude Code operates straight from a console terminal and is an autonomous coding assistant. It permits Claude to look by means of codebases, learn and edit information, write and run exams, commit and push code to GitHub repositories, and execute command line instruments whereas conserving builders knowledgeable all through the method.
Anthropic additionally goals for Claude Code for use as an assistant for debugging and refactoring duties. The corporate claims that in inner testing, Claude Code accomplished duties in a single session that might sometimes require 45-plus minutes of guide work.
Claude Code is at present out there solely as a “restricted analysis preview,” with Anthropic stating it plans to enhance the software primarily based on consumer suggestions over time. In the meantime, Claude 3.7 Sonnet is now out there by means of the Claude web site, the Claude app, Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI.