Model & CLI Commands

Commands for switching AI models, switching between CLI providers, configuring fallback chains, forwarding requests across providers, toggling Gemini thinking mode, and flushing model assignments. These commands let you dynamically reconfigure your AI backend mid-session without restarting.

All commands on this page can be invoked via slash command in the chat input, or via natural language in Korean or English. CLI-JAW understands phrases like "클로드로 바꿔줘" and maps them to the appropriate command automatically.

Command Summary

Command	Arguments	Description
`/model`	`[name]`	Switch the active AI model
`/cli`	`[name]`	Switch the active CLI provider
`/fallback`	`[cli1 cli2 ...\|off]`	Set or disable the fallback order
`/forward`	`[on\|off]`	Enable or disable request forwarding to another provider
`/thought`	`[on\|off]`	Toggle Gemini thinking (extended reasoning) mode
`/flush`	`[cli] [model] \| off`	Pin a model to a CLI, or clear the pin

/model [name]

Switch the active AI model within the current CLI provider. The model name must be a valid identifier recognized by the active provider. When called without arguments, displays the currently active model.

Arguments

Argument	Required	Description
`name`	No	The model identifier to switch to. Omit to display the current model.

Supported Models (examples)

Provider	Model Identifiers
Claude	`claude-opus-4-8`, `claude-opus-4-8[1m]`, `claude-sonnet-4-6`, `claude-haiku-4-5`
Gemini	`gemini-3-flash-preview`, `gemini-3.0-pro-preview`, `gemini-3.1-pro-preview`, `gemini-2.5-pro`, `gemini-2.5-flash`
OpenAI	`gpt-5.5`, `gpt-5.4`, `gpt-5.4-mini`, `gpt-5.3-codex`
Grok	`grok-3`, `grok-3-mini`

Subscription note: GPT 5.5 and Claude Opus 4.8 require a Pro-tier subscription or higher. Starting in June, choose claude-e when you want to use the Claude allowance bundled with the subscription plan.

Usage Examples

# Show current model
/model

# Switch to Claude Opus 4.8
/model claude-opus-4-8

# Switch to Sonnet for lighter tasks
/model claude-sonnet-4-6

# Switch to Gemini 2.5 Pro
/model gemini-3-flash-preview

"~해줘" tip: Say "오퍼스로 바꿔줘" or "소넷으로 전환해줘" and CLI-JAW maps it to the right /model call. You can also say "더 똑똑한 모델로 바꿔줘" and the orchestrator picks the strongest available model.

/cli [name]

Switch the active CLI provider. This changes the entire backend — the API endpoint, authentication, available models, and tool capabilities all change with the provider. When called without arguments, displays the current CLI.

Arguments

Argument	Required	Description
`name`	No	The CLI provider to switch to. Omit to display the current CLI.

Supported CLIs

CLI Name	Provider	Default Model
`claude`	Anthropic Claude Code	`claude-opus-4-8`
`claude-e`	Claude E interactive wrapper	`claude-opus-4-8`
`codex`	OpenAI Codex CLI	`gpt-5.5`
`codex-app`	OpenAI Codex app server	`gpt-5.5`
`gemini`	Google Gemini CLI	`gemini-3-flash-preview`
`grok`	xAI Grok	`grok-3`
`opencode`	OpenCode (multi-provider)	Varies
`copilot`	GitHub Copilot	Varies

Usage Examples

# Show current CLI
/cli

# Switch to Claude
/cli claude

# Switch to Gemini
/cli gemini

# Switch to Codex
/cli codex

# Switch to Grok
/cli grok

"~해줘" tip: Say "클로드로 바꿔줘" to switch to Claude, "제미나이로 전환해줘" for Gemini, or "코덱스 써줘" for Codex. You can also say "다른 AI로 바꿔줘" and the orchestrator will ask which provider you prefer.

What Happens on Switch

When you switch CLIs, CLI-JAW:

Gracefully disconnects from the current provider
Initializes the new provider with its default model (unless a /flush pin exists)
Carries over the conversation context where supported
Updates the status bar and dashboard to reflect the new provider

/fallback [cli1 cli2 ...|off]

Configure a fallback chain of CLI providers. If the primary CLI fails (rate limit, network error, API outage), CLI-JAW automatically tries the next provider in the chain. This ensures uninterrupted operation even when a single provider is down.

Arguments

Argument	Required	Description
`cli1 cli2 ...`	No	Space-separated list of CLI names in priority order
`off`	No	Disable the fallback chain entirely

Usage Examples

# Set fallback: try Gemini first, then Codex, then Grok
/fallback gemini codex grok

# Set Claude as the only fallback
/fallback claude

# Disable fallback (only use the active CLI)
/fallback off

# Show current fallback chain
/fallback

How Fallback Works

A request is sent to the active CLI
If the active CLI returns an error (rate limit, timeout, 5xx), the request is retried on the first fallback CLI
If that also fails, it moves to the next CLI in the chain
If all fallbacks are exhausted, the error is surfaced to the user

"~해줘" tip: Say "폴백 설정해줘 제미나이 코덱스" to set a fallback chain, or "폴백 꺼줘" to disable it. You can also say "클로드 안 되면 제미나이 써줘" and the orchestrator configures the chain for you.

/forward [on|off]

Enable or disable request forwarding to another provider. When forwarding is on, certain requests can be routed to a secondary provider for specialized handling — for example, sending a code generation request to Claude while using Gemini as the primary for conversation, or routing image analysis to a vision-capable model.

Arguments

Argument	Required	Description
`on`	No	Enable forwarding
`off`	No	Disable forwarding

Usage Examples

# Enable forwarding
/forward on

# Disable forwarding
/forward off

# Show current forwarding status
/forward

When to Use Forwarding

Scenario	Primary CLI	Forward To
Code generation while chatting	gemini	claude
Vision tasks from a text-only CLI	codex	gemini
Cost optimization (light tasks local, heavy tasks forwarded)	grok	claude

"~해줘" tip: Say "포워딩 켜줘" to enable, "포워딩 꺼줘" to disable. You can also say "코드는 클로드한테 보내줘" and the orchestrator enables forwarding with the right target.

/thought [on|off]

Toggle Gemini's thinking (extended reasoning) mode. When enabled, Gemini uses additional compute to reason step-by-step before responding, similar to Claude's extended thinking. This is useful for complex tasks like multi-step math, logic puzzles, or architectural decisions. Only applies when Gemini is the active CLI.

Arguments

Argument	Required	Description
`on`	No	Enable thinking mode
`off`	No	Disable thinking mode

Usage Examples

# Enable Gemini thinking
/thought on

# Disable thinking (faster responses, lower cost)
/thought off

# Show current thinking status
/thought

When to Use Thinking Mode

Enable for: complex reasoning, multi-step problems, architecture decisions, debugging intricate issues, math/logic
Disable for: simple questions, quick edits, file operations, routine tasks (saves time and tokens)

"~해줘" tip: Say "생각 모드 켜줘" or "thinking 켜줘" to enable. Say "생각 모드 꺼줘" or "빠르게 답해줘" to disable. Works only when Gemini is the active CLI — the command is silently ignored for other providers.

/flush [cli] [model] | off

Pin a specific model to a CLI provider, or clear an existing pin. When a flush pin is set, switching to that CLI automatically loads the pinned model instead of the provider's default. Use off to clear all pins or a specific pin.

Arguments

Argument	Required	Description
`cli`	No	The CLI provider to pin a model to
`model`	No	The model to pin to that CLI
`off`	No	Clear all flush pins, or clear a specific CLI's pin when preceded by a CLI name

Usage Examples

# Pin Claude to always use Opus 4.8
/flush claude claude-opus-4-8

# Pin Gemini to always use 2.5 Pro
/flush gemini gemini-3-flash-preview

# Clear the Claude pin (reverts to default model on next switch)
/flush claude off

# Clear all pins
/flush off

# Show current flush pins
/flush

How Flush Pins Work

Flush pins are stored in the session and persist until explicitly cleared or the session ends. They override the provider's default model selection:

You set /flush claude claude-opus-4-8
Later, you switch to Gemini with /cli gemini
When you switch back with /cli claude, it automatically uses claude-opus-4-8.

"~해줘" tip: Say "클로드는 항상 오퍼스 써줘" to pin Opus to Claude, or "플러시 초기화해줘" to clear all pins. You can also say "제미나이 모델 고정해줘 프로로" to pin Gemini to Pro.

Natural Language Mapping

CLI-JAW's command parser understands natural language in both Korean and English. Here are common phrases and their command mappings:

Natural Language	Mapped Command
"클로드로 바꿔줘"	`/cli claude`
"제미나이로 전환해줘"	`/cli gemini`
"오퍼스 모델로 바꿔줘"	`/model claude-opus-4-8`
"소넷으로 내려줘"	`/model claude-sonnet-4-6`
"더 똑똑한 모델 써줘"	`/model` (upgrades to strongest available)
"더 빠른 모델로"	`/model` (downgrades to fastest available)
"클로드 안 되면 제미나이 써줘"	`/fallback gemini`
"폴백 꺼줘"	`/fallback off`
"포워딩 켜줘"	`/forward on`
"생각 모드 켜줘"	`/thought on`
"빠르게 답해줘"	`/thought off`
"클로드는 항상 오퍼스 써줘"	`/flush claude claude-opus-4-8`
"플러시 초기화해줘"	`/flush off`
"Switch to Claude"	`/cli claude`
"Use the smartest model"	`/model` (upgrades to strongest)
"Set fallback to Gemini then Grok"	`/fallback gemini grok`

Common Workflows

Cost-Optimized Setup

Use a lightweight model for routine tasks and pin a powerful model for when you need it:

# Default to Opus 4.8 for strongest Claude work
/model claude-opus-4-8

# Pin Opus for when you switch to heavy tasks
/flush claude claude-opus-4-8

# Set Gemini Flash as fallback for rate limits
/fallback gemini

Multi-Provider Resilience

Ensure uninterrupted service by chaining fallbacks:

# Primary: Claude, Fallback: Gemini -> Grok -> Codex
/cli claude
/fallback gemini grok codex

# Pin each provider to its best model
/flush claude claude-opus-4-8
/flush gemini gemini-3-flash-preview
/flush grok grok-3

Gemini Deep Reasoning Session

Switch to Gemini with thinking enabled for complex analysis:

# Switch to Gemini Pro with thinking
/cli gemini
/model gemini-3-flash-preview
/thought on

# When done, switch back to Claude
/cli claude
/thought off

Workflow Commands Tool Commands