Model & CLI Commands

Commands for switching AI models, switching between CLI providers, configuring fallback chains, forwarding requests across providers, toggling Gemini thinking mode, and flushing model assignments. These commands let you dynamically reconfigure your AI backend mid-session without restarting.

All commands on this page can be invoked via slash command in the chat input, or via natural language in Korean or English. CLI-JAW understands phrases like "클로드로 바꿔줘" and maps them to the appropriate command automatically.

Command Summary

CommandArgumentsDescription
/model[name]Switch the active AI model
/cli[name]Switch the active CLI provider
/fallback[cli1 cli2 ...|off]Set or disable the fallback order
/forward[on|off]Enable or disable request forwarding to another provider
/thought[on|off]Toggle Gemini thinking (extended reasoning) mode
/flush[cli] [model] | offPin a model to a CLI, or clear the pin

/model [name]

Switch the active AI model within the current CLI provider. The model name must be a valid identifier recognized by the active provider. When called without arguments, displays the currently active model.

Arguments

ArgumentRequiredDescription
nameNoThe model identifier to switch to. Omit to display the current model.

Supported Models (examples)

ProviderModel Identifiers
Claudeclaude-opus-4-8, claude-opus-4-8[1m], claude-sonnet-4-6, claude-haiku-4-5
Geminigemini-3-flash-preview, gemini-3.0-pro-preview, gemini-3.1-pro-preview, gemini-2.5-pro, gemini-2.5-flash
OpenAIgpt-5.5, gpt-5.4, gpt-5.4-mini, gpt-5.3-codex
Grokgrok-3, grok-3-mini
Subscription note: GPT 5.5 and Claude Opus 4.8 require a Pro-tier subscription or higher. Starting in June, choose claude-e when you want to use the Claude allowance bundled with the subscription plan.

Usage Examples

# Show current model
/model

# Switch to Claude Opus 4.8
/model claude-opus-4-8

# Switch to Sonnet for lighter tasks
/model claude-sonnet-4-6

# Switch to Gemini 2.5 Pro
/model gemini-3-flash-preview
"~해줘" tip: Say "오퍼스로 바꿔줘" or "소넷으로 전환해줘" and CLI-JAW maps it to the right /model call. You can also say "더 똑똑한 모델로 바꿔줘" and the orchestrator picks the strongest available model.

/cli [name]

Switch the active CLI provider. This changes the entire backend — the API endpoint, authentication, available models, and tool capabilities all change with the provider. When called without arguments, displays the current CLI.

Arguments

ArgumentRequiredDescription
nameNoThe CLI provider to switch to. Omit to display the current CLI.

Supported CLIs

CLI NameProviderDefault Model
claudeAnthropic Claude Codeclaude-opus-4-8
claude-eClaude E interactive wrapperclaude-opus-4-8
codexOpenAI Codex CLIgpt-5.5
codex-appOpenAI Codex app servergpt-5.5
geminiGoogle Gemini CLIgemini-3-flash-preview
grokxAI Grokgrok-3
opencodeOpenCode (multi-provider)Varies
copilotGitHub CopilotVaries

Usage Examples

# Show current CLI
/cli

# Switch to Claude
/cli claude

# Switch to Gemini
/cli gemini

# Switch to Codex
/cli codex

# Switch to Grok
/cli grok
"~해줘" tip: Say "클로드로 바꿔줘" to switch to Claude, "제미나이로 전환해줘" for Gemini, or "코덱스 써줘" for Codex. You can also say "다른 AI로 바꿔줘" and the orchestrator will ask which provider you prefer.

What Happens on Switch

When you switch CLIs, CLI-JAW:

/fallback [cli1 cli2 ...|off]

Configure a fallback chain of CLI providers. If the primary CLI fails (rate limit, network error, API outage), CLI-JAW automatically tries the next provider in the chain. This ensures uninterrupted operation even when a single provider is down.

Arguments

ArgumentRequiredDescription
cli1 cli2 ...NoSpace-separated list of CLI names in priority order
offNoDisable the fallback chain entirely

Usage Examples

# Set fallback: try Gemini first, then Codex, then Grok
/fallback gemini codex grok

# Set Claude as the only fallback
/fallback claude

# Disable fallback (only use the active CLI)
/fallback off

# Show current fallback chain
/fallback

How Fallback Works

  1. A request is sent to the active CLI
  2. If the active CLI returns an error (rate limit, timeout, 5xx), the request is retried on the first fallback CLI
  3. If that also fails, it moves to the next CLI in the chain
  4. If all fallbacks are exhausted, the error is surfaced to the user
"~해줘" tip: Say "폴백 설정해줘 제미나이 코덱스" to set a fallback chain, or "폴백 꺼줘" to disable it. You can also say "클로드 안 되면 제미나이 써줘" and the orchestrator configures the chain for you.

/forward [on|off]

Enable or disable request forwarding to another provider. When forwarding is on, certain requests can be routed to a secondary provider for specialized handling — for example, sending a code generation request to Claude while using Gemini as the primary for conversation, or routing image analysis to a vision-capable model.

Arguments

ArgumentRequiredDescription
onNoEnable forwarding
offNoDisable forwarding

Usage Examples

# Enable forwarding
/forward on

# Disable forwarding
/forward off

# Show current forwarding status
/forward

When to Use Forwarding

ScenarioPrimary CLIForward To
Code generation while chattinggeminiclaude
Vision tasks from a text-only CLIcodexgemini
Cost optimization (light tasks local, heavy tasks forwarded)grokclaude
"~해줘" tip: Say "포워딩 켜줘" to enable, "포워딩 꺼줘" to disable. You can also say "코드는 클로드한테 보내줘" and the orchestrator enables forwarding with the right target.

/thought [on|off]

Toggle Gemini's thinking (extended reasoning) mode. When enabled, Gemini uses additional compute to reason step-by-step before responding, similar to Claude's extended thinking. This is useful for complex tasks like multi-step math, logic puzzles, or architectural decisions. Only applies when Gemini is the active CLI.

Arguments

ArgumentRequiredDescription
onNoEnable thinking mode
offNoDisable thinking mode

Usage Examples

# Enable Gemini thinking
/thought on

# Disable thinking (faster responses, lower cost)
/thought off

# Show current thinking status
/thought

When to Use Thinking Mode

"~해줘" tip: Say "생각 모드 켜줘" or "thinking 켜줘" to enable. Say "생각 모드 꺼줘" or "빠르게 답해줘" to disable. Works only when Gemini is the active CLI — the command is silently ignored for other providers.

/flush [cli] [model] | off

Pin a specific model to a CLI provider, or clear an existing pin. When a flush pin is set, switching to that CLI automatically loads the pinned model instead of the provider's default. Use off to clear all pins or a specific pin.

Arguments

ArgumentRequiredDescription
cliNoThe CLI provider to pin a model to
modelNoThe model to pin to that CLI
offNoClear all flush pins, or clear a specific CLI's pin when preceded by a CLI name

Usage Examples

# Pin Claude to always use Opus 4.8
/flush claude claude-opus-4-8

# Pin Gemini to always use 2.5 Pro
/flush gemini gemini-3-flash-preview

# Clear the Claude pin (reverts to default model on next switch)
/flush claude off

# Clear all pins
/flush off

# Show current flush pins
/flush

How Flush Pins Work

Flush pins are stored in the session and persist until explicitly cleared or the session ends. They override the provider's default model selection:

  1. You set /flush claude claude-opus-4-8
  2. Later, you switch to Gemini with /cli gemini
  3. When you switch back with /cli claude, it automatically uses claude-opus-4-8.
"~해줘" tip: Say "클로드는 항상 오퍼스 써줘" to pin Opus to Claude, or "플러시 초기화해줘" to clear all pins. You can also say "제미나이 모델 고정해줘 프로로" to pin Gemini to Pro.

Natural Language Mapping

CLI-JAW's command parser understands natural language in both Korean and English. Here are common phrases and their command mappings:

Natural LanguageMapped Command
"클로드로 바꿔줘"/cli claude
"제미나이로 전환해줘"/cli gemini
"오퍼스 모델로 바꿔줘"/model claude-opus-4-8
"소넷으로 내려줘"/model claude-sonnet-4-6
"더 똑똑한 모델 써줘"/model (upgrades to strongest available)
"더 빠른 모델로"/model (downgrades to fastest available)
"클로드 안 되면 제미나이 써줘"/fallback gemini
"폴백 꺼줘"/fallback off
"포워딩 켜줘"/forward on
"생각 모드 켜줘"/thought on
"빠르게 답해줘"/thought off
"클로드는 항상 오퍼스 써줘"/flush claude claude-opus-4-8
"플러시 초기화해줘"/flush off
"Switch to Claude"/cli claude
"Use the smartest model"/model (upgrades to strongest)
"Set fallback to Gemini then Grok"/fallback gemini grok

Common Workflows

Cost-Optimized Setup

Use a lightweight model for routine tasks and pin a powerful model for when you need it:

# Default to Opus 4.8 for strongest Claude work
/model claude-opus-4-8

# Pin Opus for when you switch to heavy tasks
/flush claude claude-opus-4-8

# Set Gemini Flash as fallback for rate limits
/fallback gemini

Multi-Provider Resilience

Ensure uninterrupted service by chaining fallbacks:

# Primary: Claude, Fallback: Gemini -> Grok -> Codex
/cli claude
/fallback gemini grok codex

# Pin each provider to its best model
/flush claude claude-opus-4-8
/flush gemini gemini-3-flash-preview
/flush grok grok-3

Gemini Deep Reasoning Session

Switch to Gemini with thinking enabled for complex analysis:

# Switch to Gemini Pro with thinking
/cli gemini
/model gemini-3-flash-preview
/thought on

# When done, switch back to Claude
/cli claude
/thought off