refactor(orch): replace Anthropic models with Qwen/DeepSeek

- Remove claude/sonnet/opus/haiku aliases
- Add qwen (qwen3-8b), deepseek (v3), r1 (deepseek-r1)
- Add Model Selection guidance section
- Update GPT aliases to gpt-5.1, flash to preview version

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
dan 2025-12-01 19:08:32 -08:00
parent 24679f6cd6
commit 54a51ecdea
2 changed files with 43 additions and 22 deletions

View file

@ -15,13 +15,13 @@ Enables agents to query multiple AI models simultaneously and aggregate their pe
```bash
# Get consensus on a decision
orch consensus "Should we use SQLite or PostgreSQL for this use case?" flash gemini claude
cd ~/proj/orch && uv run orch consensus "Should we use SQLite or PostgreSQL for this use case?" flash gemini claude
# Devil's advocate
orch consensus "Should we adopt microservices?" gpt5:for claude:against flash:neutral
cd ~/proj/orch && uv run orch consensus "Should we adopt microservices?" gpt5:for claude:against flash:neutral
# Code review with file context
orch consensus "Is this approach correct?" flash gemini --file src/handler.py --mode critique
cd ~/proj/orch && uv run orch consensus "Is this approach correct?" flash gemini --file src/handler.py --mode critique
```
## Files

View file

@ -20,6 +20,16 @@ Invoke this skill when:
- You want to stress-test an approach with opposing viewpoints
- Complex tradeoffs where different perspectives would help
## Invocation
The `orch` CLI must be run from its project directory:
```bash
cd ~/proj/orch && uv run orch <command> [args...]
```
All examples below use this pattern.
## Commands
### orch consensus
@ -27,18 +37,30 @@ Invoke this skill when:
Query multiple models for their verdict on a question.
```bash
orch consensus "PROMPT" MODEL1 MODEL2 [MODEL3...]
cd ~/proj/orch && uv run orch consensus "PROMPT" MODEL1 MODEL2 [MODEL3...]
```
**Model Aliases** (use these):
- `flash` → gemini-2.5-flash (fast, cheap)
- `flash` → gemini-2.5-flash-preview (fast, cheap)
- `gemini` → gemini-3-pro-preview (strong reasoning)
- `claude` → claude-sonnet-4.5 (balanced)
- `haiku` → claude-haiku-4.5 (fast, cheap)
- `opus` → claude-opus-4.1 (strongest)
- `gpt` / `gpt5` → gpt-5 (strong reasoning)
- `qwen` → qwen3-8b (fast, cheap)
- `deepseek` → deepseek-v3 (balanced)
- `r1` → deepseek-r1 (strongest reasoning)
- `gpt` / `gpt5` → gpt-5.1 (strong reasoning)
- `gpt4` → gpt-4o (legacy)
## Model Selection
**Quick sanity check**: Use `flash qwen` for fast, cheap validation. Good for "am I missing something obvious?" checks.
**Standard consensus**: Use `flash gemini deepseek` for balanced perspectives across providers. Default for most decisions.
**Deep analysis**: Include `r1` or `gpt` when stakes are high or reasoning is complex. These models think longer but cost more.
**Diverse viewpoints**: Mix providers (Google + DeepSeek + OpenAI) rather than multiple models from one provider. Different training leads to genuinely different perspectives.
**Cost-conscious**: `flash` and `qwen` are 10-20x cheaper than premium models. Start cheap, escalate if needed.
**Options**:
- `--mode vote` (default) - Models give Support/Oppose/Neutral verdict
- `--mode brainstorm` - Generate ideas without judgment
@ -51,14 +73,14 @@ orch consensus "PROMPT" MODEL1 MODEL2 [MODEL3...]
**Stances** (devil's advocate):
Append `:for`, `:against`, or `:neutral` to bias a model's perspective:
```bash
orch consensus "Should we rewrite in Rust?" gpt5:for claude:against gemini:neutral
cd ~/proj/orch && uv run orch consensus "Should we rewrite in Rust?" gpt5:for deepseek:against gemini:neutral
```
### orch chat
Single-model conversation (when you don't need consensus):
```bash
orch chat "MESSAGE" --model gemini
cd ~/proj/orch && uv run orch chat "MESSAGE" --model gemini
```
## Usage Patterns
@ -66,37 +88,37 @@ orch chat "MESSAGE" --model gemini
### Quick Second Opinion
When you've reasoned through something and want validation:
```bash
orch consensus "I think we should use SQLite for this because [reasons]. Is this sound?" flash gemini
cd ~/proj/orch && uv run orch consensus "I think we should use SQLite for this because [reasons]. Is this sound?" flash gemini
```
### Architecture Decision
When facing a tradeoff:
```bash
orch consensus "Microservices vs monolith for a 3-person team building an e-commerce site?" flash gemini claude --mode vote
cd ~/proj/orch && uv run orch consensus "Microservices vs monolith for a 3-person team building an e-commerce site?" flash gemini deepseek --mode vote
```
### Code Review
Include the code as context:
```bash
orch consensus "Is this error handling approach correct and complete?" flash gemini --file src/handler.py
cd ~/proj/orch && uv run orch consensus "Is this error handling approach correct and complete?" flash gemini --file src/handler.py
```
### Devil's Advocate
Get opposing viewpoints deliberately:
```bash
orch consensus "Should we adopt Kubernetes?" gpt5:for claude:against flash:neutral
cd ~/proj/orch && uv run orch consensus "Should we adopt Kubernetes?" gpt5:for deepseek:against flash:neutral
```
### Brainstorm
Generate diverse ideas:
```bash
orch consensus "How could we improve the CI/CD pipeline?" flash gemini claude --mode brainstorm
cd ~/proj/orch && uv run orch consensus "How could we improve the CI/CD pipeline?" flash gemini deepseek --mode brainstorm
```
### Critique Your Work
Find weaknesses before presenting:
```bash
orch consensus "What are the flaws in this API design?" flash gemini --file api-spec.yaml --mode critique
cd ~/proj/orch && uv run orch consensus "What are the flaws in this API design?" flash gemini --file api-spec.yaml --mode critique
```
## Output Format
@ -114,7 +136,7 @@ Reasoning: ...
[gemini] gemini-3-pro-preview - SUPPORT
Reasoning: ...
[claude] claude-sonnet-4.5 - OPPOSE
[deepseek] deepseek-v3 - OPPOSE
Reasoning: ...
```
@ -122,15 +144,14 @@ Reasoning: ...
1. **Use for genuine uncertainty** - Don't use orch for trivial decisions or to avoid thinking
2. **Provide context** - Better prompts get better consensus; use `--file` when relevant
3. **Choose models wisely** - flash/haiku for quick checks, opus/gpt5 for complex reasoning
3. **Choose models wisely** - flash/qwen for quick checks, r1/gpt for complex reasoning
4. **Consider stances** - Devil's advocate is powerful for stress-testing ideas
5. **Parse the reasoning** - The verdict matters less than understanding the reasoning
## Requirements
- `orch` CLI installed and in PATH (from ~/proj/orch)
- API keys configured for llm library (ANTHROPIC_API_KEY, GOOGLE_API_KEY, OPENAI_API_KEY)
- Run via: `cd ~/proj/orch && uv run orch ...` if not globally installed
- `orch` CLI source at ~/proj/orch
- API keys configured for llm library (OPENROUTER_KEY, GOOGLE_API_KEY, OPENAI_API_KEY)
## Examples