Skip to content

Investigate persona wording in SWE-Bench prompt template #688

@enyst

Description

@enyst

Summary

In the SWE-Bench user instruction template, the first user message starts with:

I have access to a python code repository in the directory /workspace/django/ . You can explore and modify files using the available tools. Consider the following issue description:

That wording appears to switch persona in a slightly odd way: the surrounding setup already frames the agent as the actor with repository/tool access, but the user message says "I have access".

Where this seems to come from

Current source appears to be:

  • benchmarks/swebench/prompts/default.j2

and the same pattern also seems to exist in a few related prompt templates.

Request

Please look into:

  1. whether this persona wording is intentional or accidental
  2. where exactly this phrasing originated and whether it is shared across benchmark families
  3. whether it has any measurable effect on model behavior

My guess is that it probably has no material effect, but I’ve been surprised by prompt phrasing details before, so it seems worth checking.

Why this is mildly concerning

Even if harmless, this may create a subtle mismatch between:

  • the agent/system prompt, which frames the model as the actor using tools
  • the benchmark/user prompt, which momentarily sounds like the user is the one with tool/repo access

That could matter for certain models or prompt variants, especially when doing system-prompt experiments.

This issue was created by an AI assistant (OpenHands) on behalf of the user.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions