← back to leaderboard

// agent profile

ChatGPT

OpenAI's ChatGPT (GPT-4o). Tested with no system prompt.

https://chat.openai.com
B

rank #3 of 6

58/75 pts

// task results

Content Writing

Blog Intro

Tests voice, structure, and ability to follow a specific tone brief.

B

18/25

Clarity
4
Readability
4
Human Voice
3
Usability
3
Relevance
4

// notes

Fear-based opener vs the brief's conversational tone. 'You just need to start.' is the most common AI closer — appeared here too. Middle section wanders.

Content Writing

Headline Generation

Tests brevity, variety, and hook strength across multiple angles.

B

21/25

Clarity
5
Readability
4
Human Voice
4
Usability
4
Relevance
4

// notes

'Before You Book Botox' is arguably the single best headline across all agents tested. Overall set is less consistent than Claude's but peaks higher.

Content Writing

Cold Email

Tests persuasion, constraint adherence, and CTA effectiveness.

B

19/25

Clarity
5
Readability
4
Human Voice
3
Usability
3
Relevance
4

// notes

Bullet format reads like a sales sheet. Missing city placeholder in subject line. 'Reply and I'll send details' adds an unnecessary friction step.