bd sync: 2026-01-15 14:54:20

2026-01-15 14:54:20 -08:00 · 2026-01-15 14:54:20 -08:00 · 5a80055ed8
parent bccf7514ce
commit 5a80055ed8
2 changed files with 2 additions and 2 deletions
--- a/.beads/issues.jsonl
+++ b/.beads/issues.jsonl
@ -145,7 +145,7 @@
 {"id":"skills-dtk","title":"Consolidate isStale() and staleLevel() logic","description":"[SMELL] MED state.nim:214-223,225-241 - Both compute heartbeat age with overlapping checks. Have isStale() call staleLevel() \\!= 'ok', or extract shared ageStatus() helper.","status":"closed","priority":2,"issue_type":"task","created_at":"2026-01-10T19:49:51.730374503-08:00","created_by":"dan","updated_at":"2026-01-11T15:38:00.466085606-08:00","closed_at":"2026-01-11T15:38:00.466085606-08:00","close_reason":"Closed"}
 {"id":"skills-du0a","title":"HQ: Structured communication templates","description":"**Raised by:** gpt (primary), gemini\n\n**Problem:**\nBD comments as message bus lacks structure. Questions/answers get buried. No threading. No way to signal \"requires HQ decision\" vs \"FYI\". No attachments/artifacts.\n\n**gpt:**\n\u003e \"Require periodic worker 'state summaries' in a fixed schema (Summary / Risks / Next / Questions / Diffstat). Have HQ post a running 'Decision Log' comment for traceability.\"\n\n**Suggested templates:**\n\n**HQ kickoff comment:**\n- Scope (in/out)\n- Acceptance criteria bullets\n- Required checks (tests/lint/typecheck)\n- Risk areas / files\n- \"Ask before changing X\"\n\n**Worker update (every heartbeat):**\n- What changed\n- What's next\n- Blockers/questions\n- Commands run + results\n- Link to diff/commits\n\n**Suggested fixes:**\n1. Kickoff template posted by HQ\n2. Periodic state summary schema for workers\n3. Decision log comment (edited/superseding)\n4. \"Latest instruction\" pinning","status":"open","priority":2,"issue_type":"task","created_at":"2026-01-12T09:23:26.910596266-08:00","created_by":"dan","updated_at":"2026-01-12T09:23:26.910596266-08:00"}
 {"id":"skills-e8h","title":"Investigate waybar + niri integration improvements","description":"Look into waybar configuration and niri compositor integration.\n\nPotential areas:\n- Waybar modules for niri workspaces\n- Status indicators\n- Integration with existing niri-window-capture skill\n- Custom scripts in pkgs/waybar-scripts\n\nRelated: dotfiles has home/waybar.nix (196 lines) and pkgs/waybar-scripts/","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-28T20:11:23.115445797-05:00","created_by":"dan","updated_at":"2025-12-28T20:37:16.465731945-05:00","closed_at":"2025-12-28T20:37:16.465731945-05:00","close_reason":"Moved to dotfiles repo - waybar config lives there"}
-{"id":"skills-e94u","title":"ui-query: extract shared find_windows() to common module","status":"open","priority":2,"issue_type":"task","owner":"dan@delpad","created_at":"2026-01-15T14:22:03.79081457-08:00","created_by":"dan","updated_at":"2026-01-15T14:22:03.79081457-08:00"}
+{"id":"skills-e94u","title":"ui-query: extract shared find_windows() to common module","status":"closed","priority":2,"issue_type":"task","owner":"dan@delpad","created_at":"2026-01-15T14:22:03.79081457-08:00","created_by":"dan","updated_at":"2026-01-15T14:54:16.135426883-08:00","closed_at":"2026-01-15T14:54:16.135426883-08:00","close_reason":"Extracted to common.py, imported in get-text.py and find-element.py"}
 {"id":"skills-e96","title":"skill: semantic-grep using LSP","description":"Use workspace/symbol, documentSymbol, and references instead of ripgrep.\n\nExample: 'Find all places where we handle User objects but only where we modify the email field directly'\n- LSP references finds all User usages\n- Filter by AST analysis for .email assignments\n- Return hit list for bead or further processing\n\nBetter than regex for Go interfaces, Rust traits, TS types.","status":"open","priority":3,"issue_type":"task","created_at":"2025-12-24T02:29:57.119983837-05:00","updated_at":"2025-12-24T02:29:57.119983837-05:00","dependencies":[{"issue_id":"skills-e96","depends_on_id":"skills-gga","type":"blocks","created_at":"2025-12-24T02:30:06.632906383-05:00","created_by":"daemon"}]}
 {"id":"skills-ebh","title":"Compare bd-issue-tracking skill files with upstream","description":"Fetch upstream beads skill files and compare with our condensed versions to identify differences","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-03T20:14:07.886535859-08:00","updated_at":"2025-12-03T20:19:37.579815337-08:00","closed_at":"2025-12-03T20:19:37.579815337-08:00"}
 {"id":"skills-ebl","title":"Benchmark vision model UI understanding","description":"## Goal\nMeasure how well vision models can answer UI questions from screenshots.\n\n## Test cases\n1. **Element location**: \"Where is the Save button?\" → coordinates\n2. **Element identification**: \"What buttons are visible?\" → list\n3. **State detection**: \"Is the checkbox checked?\" → boolean\n4. **Text extraction**: \"What does the error message say?\" → text\n5. **Layout understanding**: \"What's in the sidebar?\" → structure\n\n## Metrics\n- Accuracy: Does the answer match ground truth?\n- Precision: How close are coordinates to actual element centers?\n- Latency: Time from query to response\n- Cost: Tokens consumed per query\n\n## Prompt engineering questions\n- Does adding a grid overlay help coordinate precision?\n- What prompt format gives most actionable coordinates?\n- Can we get bounding boxes vs point coordinates?\n\n## Comparison baseline\n- Manual annotation of test screenshots\n- AT-SPI data (once enabled) as ground truth\n\n## Depends on\n- Test screenshots from real apps\n- Ground truth annotations","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-17T14:13:10.038933798-08:00","updated_at":"2025-12-29T15:26:19.822655148-05:00","closed_at":"2025-12-29T15:26:19.822655148-05:00","close_reason":"Benchmark complete. Vision models excellent for semantic understanding, approximate for coordinates. Recommend hybrid AT-SPI + vision. See docs/research/vision-ui-benchmark-2025-12-29.md"}
--- a/.beads/sync_base.jsonl
+++ b/.beads/sync_base.jsonl
@ -67,7 +67,7 @@
 {"id":"skills-6jw","title":"spec-review: Add severity labeling to prompts and reviews","description":"Reviews produce flat lists mixing blockers with minor nits. Hard to make decisions.\n\nAdd to prompts:\n- Require severity labels: Blocker / High / Medium / Low\n- Sort output by severity\n- Include impact and likelihood for each issue","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-15T00:23:23.334156366-08:00","updated_at":"2025-12-15T13:00:32.678573181-08:00","closed_at":"2025-12-15T13:00:32.678573181-08:00"}
 {"id":"skills-6n4u","title":"Spike: manually test Claude in worker context","description":"Simple experiment - no harness, no automation.\n\n## Goal\nLearn what happens when Claude tries to complete a task using the worker system.\n\n## Steps\n1. Set up a simple fixture (python-math-lib exists)\n2. worker spawn test-spike\n3. cd into worktree\n4. Invoke Claude with task: \"Add a factorial function to src/math_utils.py\"\n5. Include system context about worker commands (start, done, heartbeat)\n6. Watch what happens\n\n## Observe\n- Does Claude understand the worker flow?\n- Does it run worker start / worker done correctly?\n- Does it complete the task?\n- How many turns? What goes wrong?\n- What prompting helps?\n\n## Output\nNotes on what worked, what didn't, what's needed.\nInform whether we need the full harness or something simpler.","status":"closed","priority":2,"issue_type":"task","created_at":"2026-01-11T16:38:49.022489161-08:00","created_by":"dan","updated_at":"2026-01-11T19:15:47.037731412-08:00","closed_at":"2026-01-11T19:15:47.037731412-08:00","close_reason":"Spike completed successfully.\n\nObservations:\n1. Worker flow works: spawn → start → work → done → approve → merge\n2. Found and fixed bug: relative worktree paths (skills-5ry7)\n3. Claude (me) completed the task using worker CLI without special prompting\n4. ~7 tool calls to complete simple task\n5. Test script needed sqlite3 in PATH - added check\n\nThe worker system is functional for basic agent tasks."}
 {"id":"skills-6ppz","title":"HQ: Dependency deadlocks and task scoping","description":"**Raised by:** flash-or, gpt\n\n**Problem:**\nReal-world coding is rarely perfectly modular. Worker A adds DB field, Worker B needs that field. bd ready is too simple - doesn't detect partial dependencies or cross-cutting changes.\n\n**flash-or:**\n> \"Deadlocks. 'bd ready' is too simple a check. Implement a 'Blocks/Blocked By' metadata in bd. HQ needs to be able to pause Worker B's spawn until Worker A's merge is complete.\"\n\n**gpt:**\n> \"How to split a too-big issue into smaller ones (and who does it). How to handle cross-cutting changes that touch multiple tasks (and merge ordering). Add a 'triage & slicing' step before spawning.\"\n\n**Suggested fixes:**\n1. Better dependency modeling in bd (partial deps)\n2. Triage & slicing step before spawn\n3. \"mutex areas\" for shared modules/config\n4. Conflict prediction (recent churn/hot files)\n5. Architectural guardrails for risky areas","status":"open","priority":3,"issue_type":"task","created_at":"2026-01-12T09:25:49.315978432-08:00","created_by":"dan","updated_at":"2026-01-12T09:25:49.315978432-08:00"}
-{"id":"skills-6qfe","title":"ui-query: fix argument quoting in wrapper script","status":"open","priority":3,"issue_type":"bug","owner":"dan@delpad","created_at":"2026-01-15T14:22:04.428356834-08:00","created_by":"dan","updated_at":"2026-01-15T14:22:04.428356834-08:00"}
+{"id":"skills-6qfe","title":"ui-query: fix argument quoting in wrapper script","status":"closed","priority":3,"issue_type":"bug","owner":"dan@delpad","created_at":"2026-01-15T14:22:04.428356834-08:00","created_by":"dan","updated_at":"2026-01-15T14:23:42.834190743-08:00","closed_at":"2026-01-15T14:23:42.834190743-08:00","close_reason":"Fixed: use printf %q for proper argument escaping"}
 {"id":"skills-6x1","title":"Epic: emes plugin architecture alignment","description":"Convert skills to emes-style plugin architecture for portability across Claude Code, Gemini, and VPS deployment (ops-jrz1).\n\n**emes tools (evil-mind-evil-sword org):**\n- tissue: Git-native issue tracking (machine-first)\n- idle: Quality gate (blocks exit until reviewer approves)\n- jwz: Async messaging with identity/git context\n- marketplace: Plugin distribution registry\n\n**Conversion work:**\n1. Add .claude-plugin/plugin.json to each skill\n2. Restructure: SKILL.md → skills/<name>.md (auto-discovery)\n3. Add hooks/ where applicable (quality gates)\n4. Create marketplace.json registry\n5. Test with ops-jrz1 deployment\n\n**Key principles from emes:**\n- Pull context on-demand (not big upfront injections)\n- Mechanical enforcement via hooks (not prompts)\n- References over inline content\n- Machine-first interfaces (JSON output)\n\n**Candidates for conversion:**\n- orch (simple CLI wrapper)\n- worklog (scripts + templates)\n- code-review (has lenses, might want hooks)\n- ops-review (same pattern)","status":"closed","priority":2,"issue_type":"epic","created_at":"2026-01-09T10:59:12.291560832-08:00","created_by":"dan","updated_at":"2026-01-09T17:14:41.429380141-08:00","closed_at":"2026-01-09T17:14:41.429380141-08:00","close_reason":"Dual-publish complete. Ongoing cross-agent work continues under skills-hf1"}
 {"id":"skills-73yu","title":"Validate taskId to prevent path traversal and command injection","description":"[SECURITY] HIGH git.nim:36,37,53,59,89 - taskId used unsanitized in branch names and file paths. If taskId contains '../' or shell metacharacters, could escape worktree dir. Validate taskId matches safe pattern (alphanumeric + dash/underscore only).","status":"closed","priority":1,"issue_type":"bug","created_at":"2026-01-10T19:52:13.24918965-08:00","created_by":"dan","updated_at":"2026-01-10T20:32:28.374723485-08:00","closed_at":"2026-01-10T20:32:28.374723485-08:00","close_reason":"Created utils.nim with common helpers"}
 {"id":"skills-7a00","title":"Add LLM-as-judge verification","description":"Use an LLM to evaluate task completion quality.\n\nComponents:\n- Judge prompt template\n- Rubric format (list of criteria)\n- Scoring mechanism (0-1 per criterion, aggregate)\n- Model selection (haiku for cost, sonnet for quality)\n\nShould take: task description, rubric, code diff/result\nShould output: score, reasoning, pass/fail per criterion","status":"closed","priority":3,"issue_type":"task","created_at":"2026-01-11T16:19:45.933759107-08:00","created_by":"dan","updated_at":"2026-01-11T16:38:26.564395054-08:00","closed_at":"2026-01-11T16:38:26.564395054-08:00","close_reason":"Pausing - need to validate approach with simpler spike first","dependencies":[{"issue_id":"skills-7a00","depends_on_id":"skills-y0p0","type":"blocks","created_at":"2026-01-11T16:20:20.700175136-08:00","created_by":"dan"}]}