Scaffold for agent capability benchmark harness (skills-qng9): - docs/specs/scenario-schema.md: YAML schema for test scenarios - tests/scenarios/: Easy, medium, hard example scenarios - tests/fixtures/: Python fixtures for testing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| src | ||
| tests | ||
| README.md | ||
Python Math Library Fixture
Simple Python project for testing agent capability on basic tasks.
Structure
src/
math_utils.py # Math utility functions
tests/
test_math_utils.py
Usage
# Run tests
pytest tests/ -v
Scenarios Using This Fixture
easy/add-factorial.yaml- Add factorial function