Guides

Guides provide an in-depth exploration of MASEval's features and best practices.

Guide	Description
Message Tracing	Capture and inspect agent conversations during benchmark runs
Configuration Gathering	Collect and export configuration for reproducibility
Exception Handling	Distinguish agent errors from infrastructure failures
Seeding	Enable reproducible benchmark runs with deterministic seeds
Usage & Cost Tracking	Track token usage and compute cost across providers