`pydantic_ai.settings`

ModelSettings

Bases: TypedDict

用于配置 LLM 的设置。

这里我们仅包含适用于多个模型/模型提供商的设置，但并非所有模型都支持所有这些设置。

源代码位于 pydantic_ai_slim/pydantic_ai/settings.py 中

class ModelSettings(TypedDict, total=False):
    """Settings to configure an LLM.

    Here we include only settings which apply to multiple models / model providers,
    though not all of these settings are supported by all models.
    """

    max_tokens: int
    """The maximum number of tokens to generate before stopping.

    Supported by:

    * Gemini
    * Anthropic
    * OpenAI
    * Groq
    * Cohere
    * Mistral
    * Bedrock
    """

    temperature: float
    """Amount of randomness injected into the response.

    Use `temperature` closer to `0.0` for analytical / multiple choice, and closer to a model's
    maximum `temperature` for creative and generative tasks.

    Note that even with `temperature` of `0.0`, the results will not be fully deterministic.

    Supported by:

    * Gemini
    * Anthropic
    * OpenAI
    * Groq
    * Cohere
    * Mistral
    * Bedrock
    """

    top_p: float
    """An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass.

    So 0.1 means only the tokens comprising the top 10% probability mass are considered.

    You should either alter `temperature` or `top_p`, but not both.

    Supported by:

    * Gemini
    * Anthropic
    * OpenAI
    * Groq
    * Cohere
    * Mistral
    * Bedrock
    """

    timeout: float | Timeout
    """Override the client-level default timeout for a request, in seconds.

    Supported by:

    * Gemini
    * Anthropic
    * OpenAI
    * Groq
    * Mistral
    """

    parallel_tool_calls: bool
    """Whether to allow parallel tool calls.

    Supported by:

    * OpenAI (some models, not o1)
    * Groq
    * Anthropic
    """

    seed: int
    """The random seed to use for the model, theoretically allowing for deterministic results.

    Supported by:

    * OpenAI
    * Groq
    * Cohere
    * Mistral
    """

    presence_penalty: float
    """Penalize new tokens based on whether they have appeared in the text so far.

    Supported by:

    * OpenAI
    * Groq
    * Cohere
    * Gemini
    * Mistral
    """

    frequency_penalty: float
    """Penalize new tokens based on their existing frequency in the text so far.

    Supported by:

    * OpenAI
    * Groq
    * Cohere
    * Gemini
    * Mistral
    """

    logit_bias: dict[str, int]
    """Modify the likelihood of specified tokens appearing in the completion.

    Supported by:

    * OpenAI
    * Groq
    """

max_tokens `instance-attribute`

max_tokens: int

停止前生成的最大 token 数。

支持

Gemini
Anthropic
OpenAI
Groq
Cohere
Mistral
Bedrock

temperature `instance-attribute`

temperature: float

注入到响应中的随机量。

对于分析/多项选择，使用更接近 0.0 的 temperature，对于创造性和生成性任务，使用更接近模型最大 temperature 的值。

请注意，即使 temperature 为 0.0，结果也不会完全确定。

支持

Gemini
Anthropic
OpenAI
Groq
Cohere
Mistral
Bedrock

top_p `instance-attribute`

top_p: float

一种替代使用 temperature 进行采样的 Nucleus 采样方法，其中模型考虑具有 top_p 概率质量的 token 的结果。

因此，0.1 表示仅考虑包含前 10% 概率质量的 token。

您应该更改 temperature 或 top_p，但不能同时更改两者。

支持

Gemini
Anthropic
OpenAI
Groq
Cohere
Mistral
Bedrock

timeout `instance-attribute`

timeout: float | Timeout

以秒为单位，覆盖请求的客户端级别默认超时。

支持

Gemini
Anthropic
OpenAI
Groq
Mistral

parallel_tool_calls `instance-attribute`

parallel_tool_calls: bool

是否允许并行工具调用。

支持

OpenAI（某些模型，非 o1）
Groq
Anthropic

seed `instance-attribute`

seed: int

用于模型的随机种子，理论上允许确定性结果。

支持

OpenAI
Groq
Cohere
Mistral

presence_penalty `instance-attribute`

presence_penalty: float

根据新 token 是否已在文本中出现过对其进行惩罚。

支持

OpenAI
Groq
Cohere
Gemini
Mistral

frequency_penalty `instance-attribute`

frequency_penalty: float

根据新 token 在文本中已有的频率对其进行惩罚。

支持

OpenAI
Groq
Cohere
Gemini
Mistral

logit_bias `instance-attribute`

logit_bias: dict[str, int]

修改指定 token 出现在补全中的可能性。

支持

OpenAI
Groq

pydantic_ai.settings

ModelSettings

max_tokens instance-attribute

temperature instance-attribute

top_p instance-attribute

timeout instance-attribute

parallel_tool_calls instance-attribute

seed instance-attribute

presence_penalty instance-attribute

frequency_penalty instance-attribute

logit_bias instance-attribute