StandardAnthropicLLMService
Description
A Controller Service that provides integration with Anthropic's Claude AI models through their Messages API. Supports configurable parameters including model selection, response generation settings (temperature, top_p, top_k), token limits, and retry behavior.
Tags
AI, API, Anthropic, Claude, LLM, Language Model
Properties
In the list below required Properties are shown with an asterisk (*). Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.
Display Name | API Name | Default Value | Allowable Values | Description |
---|---|---|---|---|
Web Client Service * | Web Client Service | Controller Service: WebClientServiceProvider Implementations: StandardWebClientServiceProvider | The Web Client Service to use for communicating with the LLM provider. | |
Anthropic API Key * | Anthropic API Key | The API Key for authenticating to Anthropic | ||
Model Name * | Model Name | claude-3-5-sonnet-latest | The name of the Anthropic model Supports Expression Language, using FlowFile attributes and Environment variables. | |
Temperature | Temperature | The temperature to use for generating the response. Supports Expression Language, using FlowFile attributes and Environment variables. | ||
Top P | Top P | The top_p value for nucleus sampling. It controls the diversity of the generated responses. Supports Expression Language, using FlowFile attributes and Environment variables. | ||
Top K | Top K | The top K value to use for generating the response. Only sample from the top K options for each subsequent token. Recommended for advanced use cases only. You usually only need to use temperature. Supports Expression Language, using FlowFile attributes and Environment variables. | ||
User ID | User ID | The user id to set in the request metadata Supports Expression Language, using FlowFile attributes and Environment variables. | ||
Max Response Tokens * | Max Response Tokens | 1000 | The maximum number of tokens to generate in the response. Supports Expression Language, using FlowFile attributes and Environment variables. | |
Max Retries * | Max Retries | 3 | The maximum number of retry attempts for API calls | |
Backoff Base Delay (ms) * | Backoff Base Delay (ms) | 1000 | The base delay in milliseconds for exponential backoff between retries |
State Management
This component does not store state.
Restricted
This component is not restricted.
System Resource Considerations
This component does not specify system resource considerations.