Scaffold for agent capability benchmark harness (skills-qng9): - docs/specs/scenario-schema.md: YAML schema for test scenarios - tests/scenarios/: Easy, medium, hard example scenarios - tests/fixtures/: Python fixtures for testing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
28 lines
673 B
Markdown
28 lines
673 B
Markdown
# Flask User API Fixture
|
|
|
|
Flask application with user management for testing agent capability on medium/hard tasks.
|
|
|
|
## Structure
|
|
|
|
```
|
|
src/
|
|
__init__.py # App factory
|
|
cache.py # Cache utilities
|
|
routes/
|
|
users.py # User endpoints
|
|
models/
|
|
user.py # User model
|
|
tests/
|
|
test_users.py # User endpoint tests
|
|
stress_test_cache.py # Concurrency stress test
|
|
```
|
|
|
|
## Scenarios Using This Fixture
|
|
|
|
- `medium/add-caching-to-api.yaml` - Add caching to user lookup
|
|
- `hard/fix-race-condition.yaml` - Debug cache race condition
|
|
|
|
## Notes
|
|
|
|
This fixture intentionally contains a subtle race condition for the hard scenario.
|