Scaffold for agent capability benchmark harness (skills-qng9): - docs/specs/scenario-schema.md: YAML schema for test scenarios - tests/scenarios/: Easy, medium, hard example scenarios - tests/fixtures/: Python fixtures for testing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
673 B
673 B
Flask User API Fixture
Flask application with user management for testing agent capability on medium/hard tasks.
Structure
src/
__init__.py # App factory
cache.py # Cache utilities
routes/
users.py # User endpoints
models/
user.py # User model
tests/
test_users.py # User endpoint tests
stress_test_cache.py # Concurrency stress test
Scenarios Using This Fixture
medium/add-caching-to-api.yaml- Add caching to user lookuphard/fix-race-condition.yaml- Debug cache race condition
Notes
This fixture intentionally contains a subtle race condition for the hard scenario.