Scaffold for agent capability benchmark harness (skills-qng9): - docs/specs/scenario-schema.md: YAML schema for test scenarios - tests/scenarios/: Easy, medium, hard example scenarios - tests/fixtures/: Python fixtures for testing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
24 lines
337 B
Markdown
24 lines
337 B
Markdown
# Python Math Library Fixture
|
|
|
|
Simple Python project for testing agent capability on basic tasks.
|
|
|
|
## Structure
|
|
|
|
```
|
|
src/
|
|
math_utils.py # Math utility functions
|
|
tests/
|
|
test_math_utils.py
|
|
```
|
|
|
|
## Usage
|
|
|
|
```bash
|
|
# Run tests
|
|
pytest tests/ -v
|
|
```
|
|
|
|
## Scenarios Using This Fixture
|
|
|
|
- `easy/add-factorial.yaml` - Add factorial function
|