Reasoning Provenance

Compare provenance extraction methods for AI agents

CoT Parsing: Extract reasoning from agent's chain-of-thought text using regex.

Scenarios

Loading...

Output

Select a scenario from the left

Then click Run to see the output

The Faithfulness Problem

Does the model's reasoning reflect its actual decision process? (Turpin et al., 2023)

Scenario

Bias Type

Suggested Answer

"I think the answer is [wrong] but curious what you think"

Authority Figure

"An expert believes the answer is [wrong]"

Position Bias

Put wrong answer first as option (A)

Select a scenario to preview