Model & CLI Commands
Commands for switching AI models, switching between CLI providers, configuring fallback chains, forwarding requests across providers, toggling Gemini thinking mode, and flushing model assignments. These commands let you dynamically reconfigure your AI backend mid-session without restarting.
Command Summary
| Command | Arguments | Description |
|---|---|---|
/model | [name] | Switch the active AI model |
/cli | [name] | Switch the active CLI provider |
/fallback | [cli1 cli2 ...|off] | Set or disable the fallback order |
/forward | [on|off] | Enable or disable request forwarding to another provider |
/thought | [on|off] | Toggle Gemini thinking (extended reasoning) mode |
/flush | [cli] [model] | off | Pin a model to a CLI, or clear the pin |
/model [name]
Switch the active AI model within the current CLI provider. The model name must be a valid identifier recognized by the active provider. When called without arguments, displays the currently active model.
Arguments
| Argument | Required | Description |
|---|---|---|
name | No | The model identifier to switch to. Omit to display the current model. |
Supported Models (examples)
| Provider | Model Identifiers |
|---|---|
| Claude | claude-opus-4-8, claude-opus-4-8[1m], claude-sonnet-4-6, claude-haiku-4-5 |
| Gemini | gemini-3-flash-preview, gemini-3.0-pro-preview, gemini-3.1-pro-preview, gemini-2.5-pro, gemini-2.5-flash |
| OpenAI | gpt-5.5, gpt-5.4, gpt-5.4-mini, gpt-5.3-codex |
| Grok | grok-3, grok-3-mini |
claude-e when you want to use the Claude allowance bundled with the subscription plan.Usage Examples
# Show current model
/model
# Switch to Claude Opus 4.8
/model claude-opus-4-8
# Switch to Sonnet for lighter tasks
/model claude-sonnet-4-6
# Switch to Gemini 2.5 Pro
/model gemini-3-flash-preview
/model call. You can also say "더 똑똑한 모델로 바꿔줘" and the orchestrator picks the strongest available model./cli [name]
Switch the active CLI provider. This changes the entire backend — the API endpoint, authentication, available models, and tool capabilities all change with the provider. When called without arguments, displays the current CLI.
Arguments
| Argument | Required | Description |
|---|---|---|
name | No | The CLI provider to switch to. Omit to display the current CLI. |
Supported CLIs
| CLI Name | Provider | Default Model |
|---|---|---|
claude | Anthropic Claude Code | claude-opus-4-8 |
claude-e | Claude E interactive wrapper | claude-opus-4-8 |
codex | OpenAI Codex CLI | gpt-5.5 |
codex-app | OpenAI Codex app server | gpt-5.5 |
gemini | Google Gemini CLI | gemini-3-flash-preview |
grok | xAI Grok | grok-3 |
opencode | OpenCode (multi-provider) | Varies |
copilot | GitHub Copilot | Varies |
Usage Examples
# Show current CLI
/cli
# Switch to Claude
/cli claude
# Switch to Gemini
/cli gemini
# Switch to Codex
/cli codex
# Switch to Grok
/cli grok
What Happens on Switch
When you switch CLIs, CLI-JAW:
- Gracefully disconnects from the current provider
- Initializes the new provider with its default model (unless a
/flushpin exists) - Carries over the conversation context where supported
- Updates the status bar and dashboard to reflect the new provider
/fallback [cli1 cli2 ...|off]
Configure a fallback chain of CLI providers. If the primary CLI fails (rate limit, network error, API outage), CLI-JAW automatically tries the next provider in the chain. This ensures uninterrupted operation even when a single provider is down.
Arguments
| Argument | Required | Description |
|---|---|---|
cli1 cli2 ... | No | Space-separated list of CLI names in priority order |
off | No | Disable the fallback chain entirely |
Usage Examples
# Set fallback: try Gemini first, then Codex, then Grok
/fallback gemini codex grok
# Set Claude as the only fallback
/fallback claude
# Disable fallback (only use the active CLI)
/fallback off
# Show current fallback chain
/fallback
How Fallback Works
- A request is sent to the active CLI
- If the active CLI returns an error (rate limit, timeout, 5xx), the request is retried on the first fallback CLI
- If that also fails, it moves to the next CLI in the chain
- If all fallbacks are exhausted, the error is surfaced to the user
/forward [on|off]
Enable or disable request forwarding to another provider. When forwarding is on, certain requests can be routed to a secondary provider for specialized handling — for example, sending a code generation request to Claude while using Gemini as the primary for conversation, or routing image analysis to a vision-capable model.
Arguments
| Argument | Required | Description |
|---|---|---|
on | No | Enable forwarding |
off | No | Disable forwarding |
Usage Examples
# Enable forwarding
/forward on
# Disable forwarding
/forward off
# Show current forwarding status
/forward
When to Use Forwarding
| Scenario | Primary CLI | Forward To |
|---|---|---|
| Code generation while chatting | gemini | claude |
| Vision tasks from a text-only CLI | codex | gemini |
| Cost optimization (light tasks local, heavy tasks forwarded) | grok | claude |
/thought [on|off]
Toggle Gemini's thinking (extended reasoning) mode. When enabled, Gemini uses additional compute to reason step-by-step before responding, similar to Claude's extended thinking. This is useful for complex tasks like multi-step math, logic puzzles, or architectural decisions. Only applies when Gemini is the active CLI.
Arguments
| Argument | Required | Description |
|---|---|---|
on | No | Enable thinking mode |
off | No | Disable thinking mode |
Usage Examples
# Enable Gemini thinking
/thought on
# Disable thinking (faster responses, lower cost)
/thought off
# Show current thinking status
/thought
When to Use Thinking Mode
- Enable for: complex reasoning, multi-step problems, architecture decisions, debugging intricate issues, math/logic
- Disable for: simple questions, quick edits, file operations, routine tasks (saves time and tokens)
/flush [cli] [model] | off
Pin a specific model to a CLI provider, or clear an existing pin. When a flush pin is set, switching to that CLI automatically loads the pinned model instead of the provider's default. Use off to clear all pins or a specific pin.
Arguments
| Argument | Required | Description |
|---|---|---|
cli | No | The CLI provider to pin a model to |
model | No | The model to pin to that CLI |
off | No | Clear all flush pins, or clear a specific CLI's pin when preceded by a CLI name |
Usage Examples
# Pin Claude to always use Opus 4.8
/flush claude claude-opus-4-8
# Pin Gemini to always use 2.5 Pro
/flush gemini gemini-3-flash-preview
# Clear the Claude pin (reverts to default model on next switch)
/flush claude off
# Clear all pins
/flush off
# Show current flush pins
/flush
How Flush Pins Work
Flush pins are stored in the session and persist until explicitly cleared or the session ends. They override the provider's default model selection:
- You set
/flush claude claude-opus-4-8 - Later, you switch to Gemini with
/cli gemini - When you switch back with
/cli claude, it automatically usesclaude-opus-4-8.
Natural Language Mapping
CLI-JAW's command parser understands natural language in both Korean and English. Here are common phrases and their command mappings:
| Natural Language | Mapped Command |
|---|---|
| "클로드로 바꿔줘" | /cli claude |
| "제미나이로 전환해줘" | /cli gemini |
| "오퍼스 모델로 바꿔줘" | /model claude-opus-4-8 |
| "소넷으로 내려줘" | /model claude-sonnet-4-6 |
| "더 똑똑한 모델 써줘" | /model (upgrades to strongest available) |
| "더 빠른 모델로" | /model (downgrades to fastest available) |
| "클로드 안 되면 제미나이 써줘" | /fallback gemini |
| "폴백 꺼줘" | /fallback off |
| "포워딩 켜줘" | /forward on |
| "생각 모드 켜줘" | /thought on |
| "빠르게 답해줘" | /thought off |
| "클로드는 항상 오퍼스 써줘" | /flush claude claude-opus-4-8 |
| "플러시 초기화해줘" | /flush off |
| "Switch to Claude" | /cli claude |
| "Use the smartest model" | /model (upgrades to strongest) |
| "Set fallback to Gemini then Grok" | /fallback gemini grok |
Common Workflows
Cost-Optimized Setup
Use a lightweight model for routine tasks and pin a powerful model for when you need it:
# Default to Opus 4.8 for strongest Claude work
/model claude-opus-4-8
# Pin Opus for when you switch to heavy tasks
/flush claude claude-opus-4-8
# Set Gemini Flash as fallback for rate limits
/fallback gemini
Multi-Provider Resilience
Ensure uninterrupted service by chaining fallbacks:
# Primary: Claude, Fallback: Gemini -> Grok -> Codex
/cli claude
/fallback gemini grok codex
# Pin each provider to its best model
/flush claude claude-opus-4-8
/flush gemini gemini-3-flash-preview
/flush grok grok-3
Gemini Deep Reasoning Session
Switch to Gemini with thinking enabled for complex analysis:
# Switch to Gemini Pro with thinking
/cli gemini
/model gemini-3-flash-preview
/thought on
# When done, switch back to Claude
/cli claude
/thought off