Configuration & Auth

Chord separates behavior configuration and credentials:

~/.config/chord/config.yaml: providers, models, extensions, defaults
~/.config/chord/auth.yaml: API keys / OAuth credentials
.chord/config.yaml: project-level overrides
~/.config/chord/agents/ and .chord/agents/: agent role definitions

Configuration layers

A practical precedence model is:

Built-in defaults
Global config
Project config
Agent-level config

This lets you keep personal defaults, project-specific behavior, and per-agent capabilities separate.

Project config is loaded from .chord/config.yaml without injecting built-in defaults first, then merged onto the already-loaded global config. Runtime commands treat the current working directory as the project root, so the project-layer config is read from ./.chord/config.yaml under the startup cwd rather than by searching parent directories. That means:

omitted project fields stay truly unset instead of silently shadowing global defaults;
malformed project config is treated as a startup error, not ignored;
global-only keys such as paths.* and maintenance.* are ignored in project config;
most scalar and object values override the global value at the same key;
model_pools merge by pool name, with same-name project pools overriding the global definition;
append-style extension points keep global entries and add project entries: currently skills.paths and per-trigger hook arrays under hooks.* append rather than replace.

On the first run, if you launch chord in an interactive terminal and config.yaml is missing, Chord starts a one-time setup wizard. It writes a minimal config.yaml and, when needed, auth.yaml, reuses matching existing credentials from auth.yaml when possible, then prints the resolved file locations. Redirected stdin does not by itself make startup non-interactive; if Chord can still open the controlling TTY, the wizard uses that TTY. If no controlling TTY is available, it exits instead of waiting for input.

Streaming tool execution (early execution)

Chord executes a small safe subset of tools speculatively while the model response is still streaming (as soon as tool arguments are complete), instead of waiting for the provider to fully finalize the response. This reduces the “finalize gap”.

Always enabled; there is no early_tool_execution toggle.
Eligible tools: Read, Grep, Glob, rollback-safe file mutation tools (Write, Edit, Delete), plus a conservative read-only subset of Shell (single command only; no pipes/redirects/&&/;):
- pwd, ls, cat, which
- git status|log|diff|show|branch|rev-parse
Not eligible: non-read-only Shell, interactive/control tools, or any call that requires permission action ask.
Speculative file mutations are real on-disk writes/deletes, but the runtime captures pre-state first and rolls them back if finalize discards the call. Within a turn, conflicting speculative file mutations for the same path are skipped and left to the finalized execution path. Read-like speculative calls are also skipped while any unpromoted speculative file mutation exists in the same turn (regardless of path), so they never observe uncommitted file state — at the cost of serializing read speculation behind in-flight writes.
Speculative results may be shown early in the UI, but they are only appended to the conversation context after finalize validation.

Minimal provider config

OpenRouter

providers:
  openrouter:
    type: chat-completions
    api_url: https://openrouter.ai/api/v1/chat/completions
    models:
      openai/gpt-5.5:
        limit:
          context: 400000
          input: 272000
          output: 128000
        modalities:
          input: [text, image]

OpenAI Chat Completions

providers:
  bigmodel:
    type: chat-completions
    api_url: https://open.bigmodel.cn/api/paas/v4/chat/completions
    models:
      glm-5.1:
        limit:
          context: 200000
          output: 131072

OpenAI Responses

providers:
  openai:
    type: responses
    api_url: https://api.openai.com/v1/responses
    models:
      gpt-5.5:
        limit:
          context: 400000
          input: 272000
          output: 128000

Read model limits in this order:

limit.context is the total window. For most models, input + requested output just needs to fit inside this number.
limit.input is only needed when the provider also lists a separate input cap. Some GPT models work this way; if you omit it, Chord derives the usable input budget from limit.context after reserving effective requested output.
limit.output is the model’s own output capacity. Chord’s default requested output cap (max_output_tokens) is still 32000, so real requests use the smaller output limit unless you raise it.

Chord’s gpt-5.5 examples use context=400000, input=272000, output=128000. Provider docs sometimes call this setup split limits; see Glossary.

OpenAI Codex preset

providers:
  codex:
    preset: codex
    type: responses
    models:
      gpt-5.5:
        limit:
          context: 400000
          input: 272000
          output: 128000

Google Gemini

providers:
  gemini:
    api_url: https://generativelanguage.googleapis.com/v1beta/models
    models:
      gemini-3.1-pro-preview:
        limit:
          context: 1048576
          output: 65536

For Gemini, set api_url to the /models base path. Chord detects type: generate-content from the /models suffix, so type can be omitted. Do not include the model name or :streamGenerateContent?alt=sse; Chord appends /{model}:streamGenerateContent?alt=sse automatically. The model map key, such as gemini-3.1-pro-preview, is the model ID sent to Gemini.

Gemini thinking options use the same unified thinking object as other providers (no separate gemini_thinking key):

thinking.budget → generationConfig.thinkingConfig.thinkingBudget
- Gemini: ✅ used
- Anthropic: ⚠️ only when thinking.type: enabled (mapped to Anthropic budget mode)
- OpenAI: ❌ ignored
thinking.include_thoughts → generationConfig.thinkingConfig.includeThoughts
- Gemini: ✅ used
- Anthropic / OpenAI: ❌ ignored
thinking.level → generationConfig.thinkingConfig.thinkingLevel (minimal|low|medium|high, Gemini 3+; not all models support minimal)
- Gemini (3+): ✅ used
- Gemini 2.x / Anthropic / OpenAI: ❌ ignored

Example:

providers:
  gemini:
    api_url: https://generativelanguage.googleapis.com/v1beta/models
    models:
      gemini-2.5-flash:
        limit:
          context: 1048576
          output: 65536
        thinking:
          budget: -1
          include_thoughts: true
      gemini-3-pro:
        limit:
          context: 1048576
          output: 65536
        thinking:
          budget: -1
          level: high

If type is omitted, Chord auto-detects it from provider config:

preset: codex → responses
api_url ending in /responses → responses
api_url ending in /chat/completions → chat-completions
api_url ending in /messages → messages
api_url ending in /models → generate-content

If none of these rules match, set type explicitly.

Thinking bilingual appended translation

If your model outputs English thinking / reasoning and you want an appended translation (for example, Chinese) in the TUI, you can enable thinking_translation:

model_pools:
  translation:
    - openai/gpt-5.4-mini

thinking_translation:
  target_language: zh-Hans
  model_pool: translation

Notes:

This feature only translates thinking / reasoning. It does not translate the assistant final answer.
Translation uses an LLM provider you already configured. thinking_translation.model_pool must point to a top-level model_pools entry.
target_language and model_pool are both required. If either is missing, the feature is disabled.
Use a separate low-cost translation pool when possible. The pool can contain multiple provider/model[@variant] refs; translation runs a single fallback round across pool entries in order: if one candidate fails (including network/5xx/timeout), it moves to the next candidate, and an empty translation result also triggers trying the next candidate.
The thinking-translation layer does not impose its own whole-translation timeout and does not use a circuit breaker. A temporary failure for one thinking block only skips that block; it does not block later thinking translations or the main response.
Per-provider request/header/stream idle timeouts still apply at the LLM transport layer. With the default auxiliary client settings, these are one-minute-class timeouts, so a stalled model/key can fail over while the pool still gets a chance to run.
The translated content is appended under the corresponding thinking card, separated by a neutral header like Translated · <target_language>. The translation is rendered with the same Markdown / code-highlighting pipeline and is not written back into model context.
Translations are reused only within the current process (in-memory). They are not persisted to disk and are not replayed across sessions.

More detailed fields are described in the config reference below.

auth.yaml

Provider keys must match the provider name in config.yaml:

The first-run wizard can create this file for you. It supports either literal API keys or $ENV_VAR placeholders.

anthropic:
  - "$ANTHROPIC_API_KEY"

openai:
  - "$OPENAI_API_KEY"

You can list multiple keys for rotation or backup.

For preset: codex OAuth providers, Chord now keeps frequently changing runtime status (quota snapshots, reset times, last warm-up timestamps, shared OAuth status cache) in auth.state.yaml, not in auth.yaml.

That split is intentional:

auth.yaml remains the user-edited source of truth for credentials and stable OAuth fields such as refresh, access, expires, account_id, and email;
auth.state.yaml is machine-managed shared runtime state, so quota / reset updates do not constantly rewrite auth.yaml while the user may also be editing it.

A typical auth.state.yaml entry looks like:

openai:
  openai:account_id:acc-1:
    account_id: acc-1
    email: user@example.com
    access: "..."
    status: normal
    updated_at: 1774009702606
    last_warmup_at: 1774009702606
    codex_primary_used_pct: 12.5
    codex_primary_window_minutes: 60
    codex_primary_reset_at: 1774013302000
    codex_secondary_used_pct: 40
    codex_secondary_window_minutes: 10080
    codex_secondary_reset_at: 1774600000000

These cached Codex quota/reset fields are restart-stable scheduling and display hints, not hard blocks by themselves:

they help startup / first-pick ordering choose accounts that are more likely to still have quota;
they let key switches immediately show the last cached snapshot before a fresh warm-up completes;
they do not by themselves make the account absolutely unselectable;
real hard blocking still comes from confirmed request failures and runtime cooldown state.

Environment variables in auth.yaml

Provider credentials in auth.yaml support environment-variable expansion for scalar API-key values:

anthropic:
  - "$ANTHROPIC_API_KEY"

openai:
  - "${OPENAI_API_KEY}"

Expansion is applied when the scalar starts with $. Unset variables expand to an empty string and are filtered out, unless the YAML value is a literal empty string. This expansion applies to auth.yaml credentials, not generally to every field in config.yaml.

If you intentionally need an empty API key, write a literal empty string:

local-provider:
  - ""

Do not rely on an unset environment variable for this case. An unset $ENV_VAR is treated as a missing credential and is filtered out.

OAuth

Only providers with preset: codex are treated as OAuth providers.

For Codex providers, key_order supports an additional value:

sequential
random
smart

preset: codex defaults to key_order: smart when key_order is omitted. Other providers still default to sequential.

smart keeps existing key_rotation behavior, but ranks selectable Codex OAuth accounts by:

preferring accounts whose shared cached snapshot shows both tracked windows still have remaining quota;
then preferring accounts where at least one tracked window still has remaining quota;
then preferring higher remaining headroom when rate-limit snapshots are available;
then preferring the nearer known reset time when candidates are otherwise similar;
still falling back to unknown / stale candidates when no better option exists.

When a Codex client becomes active, Chord may also background-probe additional OAuth slots to refresh cached headroom snapshots. That warm-up is best-effort, low-concurrency, cancels when the active client is replaced, and may update persisted OAuth credential status when an account becomes unusable.

Warm-up priority is also state-aware:

OAuth slots that have never been warmed up in shared state are probed first;
older cached entries are refreshed before recently refreshed ones;
after warm-up or polling returns a newer snapshot, Chord writes it to auth.state.yaml and other processes adopt it lazily when they next read key-selection or rate-limit state.

# auto-select a configured codex provider
chord auth

# explicitly choose a provider
chord auth codex

# headless / SSH environments
chord auth codex --device-code

Model pools (selecting provider/model)

Chord selects the active model via named model pools. Each pool entry should be a full provider/model[@variant] reference so the provider endpoint, auth, protocol, and variant tuning are unambiguous.

Pool definitions live in config.yaml (global or project-level), and agent configs only reference pool names. Agent configs cannot define inline pools.

Define model pools in config.yaml

# ~/.config/chord/config.yaml or .chord/config.yaml
model_pools:
  thinking:
    - anthropic/claude-opus-4.7
    - openai/gpt-5.5
  non-thinking:
    - anthropic/claude-sonnet-4

Project-level .chord/config.yaml model_pools are merged into the global config (same-name pools override).

Reference pools from agents

# ~/.config/chord/agents/builder.yaml or .chord/agents/builder.yaml
name: builder
mode: main
model_pools: [thinking, non-thinking]

name: reviewer
mode: subagent
model_pools: [thinking]

When no pool is explicitly selected, Chord falls back to the agent’s first pool in the model_pools: [...] list.

At runtime, use /models to switch the pool for the current view (per project, persisted across restarts). In the main view this means the current main role; in a SubAgent view it means that SubAgent’s agent pool selection. Switching pools updates the full fallback chain for subsequent LLM calls, even if the currently selected provider/model exists in both pools (in-flight requests keep using their starting snapshot). You can also set a named agent directly with /models --agent <name> <pool>. For SubAgents, the default behavior is simply to use the first pool listed in model_pools: [...]; switching back to that first pool restores the default behavior.

Reusing model templates with YAML anchors

Chord does not have a special model_templates schema field. You can still use YAML anchors and merge keys to avoid repeating model limits and variants. The top-level model_templates key below is just an ignored anchor container.

model_templates:
  gpt-400k: &gpt-400k
    limit:
      context: 400000
      input: 272000
      output: 128000
    reasoning:
      summary: auto
    text:
      verbosity: medium
    variants:
      high:
        reasoning:
          effort: high
      xhigh:
        reasoning:
          effort: xhigh
    modalities:
      input: [text, image]

  gpt-1m: &gpt-1m
    limit:
      context: 1050000
      input: 922000
      output: 128000
    reasoning:
      summary: auto
    text:
      verbosity: medium
    variants:
      high:
        reasoning:
          effort: high
      xhigh:
        reasoning:
          effort: xhigh
    modalities:
      input: [text, image]

  claude-1m: &claude-1m
    limit:
      context: 1000000
      output: 65536
    thinking:
      type: adaptive
      effort: medium
    variants:
      high:
        thinking:
          type: adaptive
          effort: high
      xhigh:
        thinking:
          type: adaptive
          effort: xhigh
    modalities:
      input: [text, image]

providers:
  codex:
    preset: codex
    models:
      gpt-5.5: *gpt-400k

  openai:
    api_url: https://api.openai.com/v1/responses
    models:
      gpt-5.4: *gpt-1m
      gpt-5.5: *gpt-400k

  anthropic:
    type: messages
    api_url: https://api.anthropic.com/v1/messages
    models:
      claude-opus-4.7: *claude-1m

Model fields used in the example:

limit.context: total request window in tokens when the provider exposes it.
limit.input: use this only when the provider also publishes a separate input cap. Chord uses it to decide when to compact before the prompt is too large and how to retry after a provider rejects a too-large request. If omitted, Chord derives the input budget from limit.context minus the effective requested output (max_output_tokens, capped by limit.output). It does not by itself reduce requested output tokens; output clamping follows limit.output, max_output_tokens, and any total-context (limit.context) remainder.
limit.output: model maximum output token capacity. Runtime requests are also capped by max_output_tokens, so the effective request uses the smaller value.
reasoning: OpenAI reasoning options, mainly for Responses-style reasoning models. summary controls reasoning summary output; variants commonly override reasoning.effort.
text.verbosity: OpenAI text verbosity hint, where supported.
thinking: Anthropic extended-thinking options. type: adaptive lets Chord derive an appropriate thinking budget from effort; variants can override thinking.effort.
variants: named model parameter presets. Use a model ref like openai/gpt-5.5@high or anthropic/claude-opus-4.7@xhigh to select one.
modalities.input: supported input modalities. Supported values are text, image, and pdf. When omitted, Chord defaults to text and image for backward compatibility.

Only fields defined by Chord’s model schema are used. modalities.output is not currently interpreted, so it is intentionally omitted from the example.

Project-level config

If a project needs local defaults, create this file at the project root:

.chord/config.yaml

Common uses include:

Project-specific permission rules
Project-specific LSP / MCP / Hooks / Skills settings

Provider request compression

Provider-level compress controls gzip compression for upstream request bodies. It is different from context compaction: it only changes HTTP request transfer encoding and does not summarize or remove conversation history.

providers:
  openai:
    compress: true

When enabled, Chord gzip-compresses the request body only if compression reduces the payload size; otherwise it sends the request uncompressed. Leave this unset unless your provider or gateway benefits from compressed request bodies.

Output token cap

Use max_output_tokens to set a global cap on requested output tokens. The effective request limit is still clamped by each model’s limit.output and available total context (limit.context when known), so runtime uses the smallest applicable value.

limit.input is separate: use it only for models whose providers publish an extra input cap beyond the total context window. Lowering max_output_tokens can reduce cost and long-response failure risk, but it does not increase a provider’s input allowance or replace limit.input.

max_output_tokens: 32000

Stream retry cap

Use stream_retry_rounds to put a hard ceiling on public LLM retry rounds. Each round can still walk the current model pool and each provider key in the normal order; this setting limits how many full rounds CompleteStream will make before giving up.

A “round” here means the whole public retry pass, not a single provider/model attempt. For example, stream_retry_rounds: 2 allows at most two full passes through the active routing chain. Once the cap is reached, Chord stops even for retry classes that would normally wait and continue, such as all-keys-cooling or concurrent-request 429 responses.

0 keeps the default behavior: retry until success, cancellation, or a terminal failure.
Positive values stop after that many rounds, even for cooling / concurrent-request retry classes.
This is mainly useful for automation or headless environments that prefer bounded latency over maximum persistence.

stream_retry_rounds: 3

Local TUI options

These options affect the local TUI. They can be set in the global config and can also be overridden by project-level .chord/config.yaml when appropriate.

desktop_notification: true
ime_switch_target: com.apple.keylayout.ABC
prevent_sleep: true

desktop_notification: enables terminal notifications in local TUI mode, mainly when the terminal is unfocused. Chord auto-selects a notification escape sequence by terminal (OSC 9 or OSC 777) and sends notifications for events such as permission confirmations, questions waiting for input, and agents returning to idle.
ime_switch_target: uses im-select (im-select.exe on Windows) to switch to the specified input method when entering Normal mode, and restore the previous input method when returning to Insert mode. This is useful when you want command keys to use an English keyboard layout.
prevent_sleep: prevents macOS idle sleep while any agent is active. It is only effective in local TUI mode.

WebFetch

WebFetch uses a built-in browser-like User-Agent by default. You can override it in config when a site needs a different header:

web_fetch:
  user_agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/136.0.0.0 Safari/537.36

This setting works in both global config and project-level .chord/config.yaml; project config overrides the global value.

You can also configure a proxy for WebFetch requests:

web_fetch:
  proxy: socks5://127.0.0.1:1080  # http, https, socks5 supported

proxy: nil (default) — inherits the global proxy setting
proxy: "" (empty string) — explicitly disables proxy (“direct” mode)
proxy: "http://...", "https://...", "socks5://..." — uses specified proxy

WebFetch intentionally remains a lightweight static HTTP reader. It does not run a local browser; JS-heavy pages may be marked as Content-Quality: suspect-shell when the returned HTML looks like an application shell rather than readable content.

MCP

MCP servers can expose many tools. Use allowed_tools to expose only selected remote tool names and avoid sending unused tool schemas to the model:

mcp:
  search:
    url: https://mcp.exa.ai/mcp
    allowed_tools:
      - web_search_exa
      - web_fetch_exa

The server name (search above) is user-defined. With this example, Chord registers only mcp_search_web_search_exa and mcp_search_web_fetch_exa. Filtered tools are not registered and do not enter the LLM tool surface.

Manual (on-demand) MCP servers

By default, configured MCP servers auto-start and become part of the default LLM tool context. For an MCP server you do not need in every conversation, set manual: true: it stays disabled at startup, Chord normally does not connect to it, and its tool descriptions are not added to the default context, reducing context overhead. Enable it manually only when you need it:

mcp:
  exa:
    url: https://mcp.exa.ai/mcp
    manual: true

When manual: true, the server starts in a disabled (gray) state and does not connect until you enable it.
Only servers configured with manual: true can be changed at runtime with /mcp. Auto-start servers are read-only in the MCP selector and are not affected by /mcp enable|disable.
Enable/disable at runtime with /mcp (menu in TUI) or with explicit commands:
- /mcp enable <server>
- /mcp disable <server>
- /mcp status

Startup consistency

Auto-start MCP servers still connect asynchronously after the TUI starts, but the first LLM request waits until each auto-start server has either connected successfully or reached a terminal failure state. This avoids tool-surface inconsistency between the agent and the model.

Agent config

Built-in roles include builder and planner. You can also add custom agents or override built-ins. Agent files can live in:

~/.config/chord/agents/
.chord/agents/

Supported file formats:

.md: YAML frontmatter plus a Markdown body. The body becomes the system prompt.
.yaml / .yml: plain YAML. Use prompt or system_prompt for the system prompt.

Markdown agent example:

---
name: backend-coder
description: Backend developer
mode: subagent
model_pools: [default]
permission:
  Write: ask
  Edit: ask
---

You are an agent focused on backend development.

Equivalent YAML agent example:

name: backend-coder
description: Backend developer
mode: subagent
model_pools: [default]
permission:
  Write: ask
  Edit: ask
prompt: |
  You are an agent focused on backend development.

Common fields include:

name: agent name. If omitted, Chord uses the filename without extension.
description: short description shown to the main agent when delegation is available.
mode: main for a MainAgent role, or subagent for a SubAgent. Empty and unknown values behave as main; sub_agent and sub are accepted as SubAgent aliases.
model_pools: list of pool names this agent can use (ordered). Pool definitions live in config.yaml top-level model_pools. Inline variants such as openai/gpt-5.5@high are specified in the pool definitions.
variant: default variant when a model ref does not include @variant.
permission: per-tool permission policy for this agent.
mcp: MCP config scoped to this agent.
delegation: limits such as max_children, max_depth, and child_join.
prompt / system_prompt: system prompt for plain YAML files.

Example:

name: builder
mode: main
model_pools: [default]
permission:
  "*": deny
  Read: allow
  Grep: allow
  Glob: allow
  Shell: allow
  Edit: ask
  Write: ask

Context compaction

When the main conversation approaches the model context limit, Chord can run durable compaction automatically. Common settings:

context:
  compact_threshold: 0.8
  compaction:
    model_pool: compact
    reserved: 16000

Set context.compaction.model_pool to pick a dedicated compaction model pool. If it is omitted, compaction clones the current agent model pool and starts from the current sticky cursor instead of falling back to a single model.

Automatic compaction is enabled when context.compact_threshold > 0; set context.compact_threshold: 0 to disable it.

Automatic compaction is driven by the usable input-side budget. If a model config sets limit.input, Chord starts from that value; otherwise it starts from limit.context - effective_max_output, where effective output is max_output_tokens (or the runtime default) capped by the model’s limit.output. If context.compaction.reserved is set, Chord subtracts it before applying compact_threshold. The TUI Context indicator in the info panel and footer uses this same input-budget calculation, so its percentage matches auto-compaction thresholds instead of the model’s total context window.

You can reserve headroom for tokenizer drift, tool-schema overhead, and compaction/recovery safety margin:

context:
  compact_threshold: 0.8
  compaction:
    reserved: 16000

With that setting, a model configured as input: 272000 will trigger automatic compaction based on a usable input budget of 256000 tokens.

When a provider publishes both a total context window and a separate input cap, use all three fields when you know them:

providers:
  openai:
    models:
      gpt-5.5:
        limit:
          context: 400000
          input: 272000
          output: 128000

This matters because reducing output does not increase a provider’s hard input allowance. Keeping automatic compaction enabled is recommended when your selected models have smaller input budgets or split input/output limits.

Provider/model diagnostics

# smoke-test all providers with representative models
chord doctor models

# test one provider's representative model
chord doctor models --provider openai

# test an exact model or variant
chord doctor models --model openai/gpt-5.5@high
chord doctor models --provider openai --model gpt-5.5@high

# audit each entry in a model pool independently
chord doctor models --pool thinking

Use this command as an auth, endpoint, transport, model, and variant tuning smoke test. It uses the same merged global + project config view as normal runtime startup, so project-level provider/proxy/model overrides are included. Pool diagnostics request each pool entry independently rather than following the normal fallback chain.

Configuration cheatsheet

The full top-level keys of config.yaml (both global ~/.config/chord/config.yaml and project-level .chord/config.yaml). All keys are optional unless noted.

Key	Type	Default	Scope	Summary
`providers`	`map[name]Provider`	—	global / project	Per-provider config (`type`, `api_url`, `preset`, `models`, `compress`). See Minimal provider config.
`model_pools`	`map[name][]ref`	—	global / project	Reusable named pools of full `provider/model[@variant]` refs. See Model pools.
`thinking_translation`	object	disabled	global / project	Optional appended translation for thinking / reasoning cards. Requires `target_language` and `model_pool`; failures only skip the affected thinking block.
`context`	object	see below	global / project	`compact_threshold`, `compaction.model_pool`, `compaction.reserved`. See Context compaction.
`skills`	object	empty	global / project	`paths: [...]` — additional skill directories beyond the defaults.
`confirm_timeout`	int (seconds)	`0` (no timeout)	global / project	Timeout for confirmation dialogs in TUI; `0` means wait forever.
`diff`	object	`{inline_max_columns: 200}`	global / project	TUI diff rendering. `inline_max_columns` caps one-line inline diff width.
`desktop_notification`	bool	`false`	global / project	Enable local-TUI terminal notifications when the terminal is unfocused; Chord auto-selects OSC 9 or OSC 777 by terminal. (Unsupported terminals ignore the sequence.)
`prevent_sleep`	bool	`false`	global / project	Prevent macOS idle sleep while any agent is active. macOS-only; no-op elsewhere.
`keymap`	`map[action][]key`	see Keybindings	global / project	Override key bindings. Action names use lower snake_case.
`commands`	`map[/cmd]text`	empty	global / project	Custom slash commands; `"/cmd"` → text inserted as a user message. See Customization — Custom slash commands.
`ime_switch_target`	string	empty	global / project	IM identifier passed to `im-select` / `im-select.exe` when entering Normal mode. Linux/macOS/Windows.
`log_level`	string	`info`	global / project	`debug` / `info` / `warn` / `error`. `debug` is verbose.
`paths`	object	XDG defaults	global only	`state_dir`, `cache_dir`, `sessions_dir`, `logs_dir`. CLI flags and `CHORD_*` env vars override.
`maintenance`	object	disabled	global only	`size_check_on_startup`, `size_check_interval_hours`, `warn_state_bytes`, `warn_cache_bytes`.
`lsp`	`map[name]Server`	empty	global / project	Per-language-server config. See Customization — LSP.
`mcp`	`map[name]MCP`	empty	global / project / agent	Per-MCP-server config. See MCP.
`hooks`	object	empty	global / project / agent	Hooks per trigger point. See Hooks.
`max_output_tokens`	int	model-default	global / project	Global cap on requested output tokens. Effective limit is also clamped by each model’s `limit.output`; reasoning requests also respect it.
`stream_retry_rounds`	int	`0` (retry until success/cancel)	global / project	Hard cap on public LLM full-round retries. `0` keeps retrying until success, cancellation, or terminal failure.
`proxy`	string	empty (use env / direct)	global / project	Global proxy URL. Per-tool override via `web_fetch.proxy`.
`web_fetch`	object	empty	global / project	`user_agent`, `proxy` (inherits global if nil; empty string = direct). See WebFetch.
`worktree`	object	empty	global / project	Defaults for `chord --worktree` and `chord worktree …` subcommands.

Provider field reference

Field	Type	Description
`type`	string	`messages` / `chat-completions` / `responses` / `generate-content`. Auto-detected from `api_url` or `preset` when omitted.
`api_url`	string	Endpoint URL. For Gemini, the `/models` base path; Chord appends `/{model}:streamGenerateContent?alt=sse`.
`preset`	string	Currently `codex` (OpenAI Codex / ChatGPT OAuth).
`compress`	bool	gzip request bodies when compression saves bytes. Off by default.
`models`	map	Map of model id → model config.

Model field reference

Field	Type	Description
`limit.context`	int	Total request window in tokens when known. If `limit.input` is omitted, Chord derives the input budget from this minus effective requested output.
`limit.input`	int	Separate input cap when a provider publishes one. Chord uses it to compact or retry before the prompt is too large.
`limit.output`	int	Maximum output tokens; runtime is also clamped by `max_output_tokens`.
`context.compaction.reserved`	int	Optional input-budget headroom reserved before `compact_threshold` is applied. Useful for tokenizer drift, tool overhead, and safer overflow recovery.
`reasoning`	object	OpenAI reasoning options (`summary`, `effort`). Variants commonly override `reasoning.effort`.
`text.verbosity`	string	OpenAI text verbosity hint where supported.
`thinking`	object	Anthropic extended-thinking options. `type: adaptive` lets Chord derive a budget from `effort`.
`variants`	map	Named parameter presets. Reference with `provider/model@variant`.
`modalities.input`	array	Subset of `text` / `image` / `pdf`. Defaults to `[text, image]`.

For more on agents, see Customization — Agents; for the full agent schema, see Agent config.