generate() for full control over text generation with detailed performance metrics.
LLMGenerationResult
The result object contains comprehensive metrics:| Property | Type | Description |
|---|---|---|
text | String | Generated response text |
thinkingContent | String? | Reasoning content (for thinking models) |
inputTokens | Int | Number of prompt tokens |
tokensUsed | Int | Number of output tokens |
modelUsed | String | Model ID used for generation |
latencyMs | Double | Total generation time in milliseconds |
tokensPerSecond | Double | Generation speed |
timeToFirstTokenMs | Double? | Time to first token (streaming) |
framework | String? | Inference framework used |