Skip to content

Advanced Configuration

The AI Peon plugin provides advanced configuration options accessible via Window > Preferences > Peon AI > AI Peon Advanced.

AI Peon Advanced

Per-Agent Model Selection

Different agents can use different models to optimize for cost, speed, or capability:

AgentPurposeRecommended Model Type
Dev (default)Creating task plans and strategiesReasoning-capable models (e.g., Sonnet)
PlanCreating task plans and strategiesReasoning-capable models (e.g., Opus)
SearchFinding relevant context and informationFast, smaller models (e.g., Haiku)
CompactConversation compression for context managementFast, smaller models (e.g., Haiku)

How It Works

  1. The Dev agent always uses the base model you configure — this is your primary coding model
  2. Leave a per-agent field empty to use the provider's default for that agent
  3. Enter a specific model name to override only that agent's model
  4. Models are validated against your provider's available models when you click "Check Host and Port..."

Temperature Settings

Temperature controls the randomness of model outputs:

SettingRangeEffect
Plan Temperature0.6 - 1.0Higher = more creative plans; Lower = more deterministic
Dev Temperature0.2 - 0.6Controls code generation creativity (uses base model)
  • Claude and some other models only accept 1.0.

Debug Mode

When enabled, logs all requests and responses to the Eclipse console.

Use cases:

  • Troubleshooting connection issues
  • Understanding what context is being sent to the model
  • Debugging prompt template issues if you create an issue

Query Parameters

Add custom query parameters to API requests (format: key=value,key2=value2):

Example: stream=false,timeout=30

Useful for:

  • Provider-specific options not exposed in the UI
  • Testing different API behaviors
  • Adding custom headers through query strings

Header Parameters

Add custom HTTP headers to requests (format: key=value,key2=value2):

Example: X-Custom-Header=myvalue,Authorization=Bearer token123

Useful for:

  • Custom authentication requirements
  • Provider-specific features via headers
  • Adding tracking or debugging information

Max Output Tokens

Controls the maximum number of tokens in model responses (0 = disable limit): langchain4j and some LLMs default to 1024 -- if you have odd behaviors increase this.

SettingEffect
Low values (1024)Short, concise responses; faster generation - may break
Recommended low (2048)Have a short value, limits also the think budget where possible
Default nowerdays (4000)Usually a a good default or with Opus around 8000
Disabled (0)Provider's default limit applies

Troubleshooting

Models Not Being Used by Agents

Restart Eclipse after changing preferences

Released under the MIT License.