Connect to Azure AI Inference

Esta página aún no está disponible en tu idioma.

🧪 Preview

This page describes how consuming apps connect to an Azure AI Inference endpoint that’s already registered as a connection string in your AppHost. For instructions on registering the connection, see Get started with the Azure AI Inference integrations.

When you reference an Azure AI Inference connection from your AppHost, Aspire injects the connection string into the consuming app as an environment variable. Your app can either read the environment variable directly — the pattern works from any language — or, in C#, use the Aspire.Azure.AI.Inference client integration for automatic dependency injection, health checks, and telemetry.

Connection string

Aspire injects the full connection string into consuming apps:

.NET apps: available via IConfiguration under the key ConnectionStrings:{resourcename}
All other apps: available as the environment variable ConnectionStrings__{resourcename} (note the double underscore)

Connection string format:

Endpoint=https://{endpoint}/;Key={apikey};DeploymentId={deploymentName}

Component	Description
`Endpoint`	The Azure AI Inference service endpoint URL
`Key`	The API key for authentication
`DeploymentId`	The model deployment name

Example:

Endpoint=https://myresource.services.ai.azure.com/models;Key=abc123;DeploymentId=gpt-4o-mini

Connect from your app

Pick the language your consuming app is written in. Each example assumes your AppHost registers a connection string named ai-foundry and references it from the consuming app.

For C# apps, the recommended approach is the Aspire Azure AI Inference client integration. It registers a ChatCompletionsClient through dependency injection and adds health checks and telemetry automatically. If you’d rather read the connection string directly, see the Read the connection string section at the end of this tab.

Install the client integration

Install the 📦 Aspire.Azure.AI.Inference NuGet package in the client-consuming project:

dotnet add package Aspire.Azure.AI.Inference

#:package Aspire.Azure.AI.Inference@*

<PackageReference Include="Aspire.Azure.AI.Inference" Version="*" />

Add a chat completions client

In Program.cs, call AddAzureChatCompletionsClient on your IHostApplicationBuilder to register a ChatCompletionsClient:

builder.AddAzureChatCompletionsClient(connectionName: "ai-foundry");

Resolve the client through dependency injection:

public class ExampleService(ChatCompletionsClient client)
{
    // Use client...
}

Register an IChatClient

Chain AddChatClient to also register an IChatClient from Microsoft.Extensions.AI:

builder.AddAzureChatCompletionsClient(connectionName: "ai-foundry")
    .AddChatClient("gpt-4o-mini");

Resolve the IChatClient through dependency injection:

public class ExampleService(IChatClient chatClient)
{
    public async Task<string> GetResponseAsync(string userMessage)
    {
        var response = await chatClient.CompleteAsync(userMessage);
        return response.Message.Text ?? string.Empty;
    }
}

Add an embeddings client

Call AddAzureEmbeddingsClient to register an EmbeddingsClient:

builder.AddAzureEmbeddingsClient(connectionName: "ai-foundry");

Add keyed clients

To register multiple clients with different connection names, use the keyed variants:

builder.AddKeyedAzureChatCompletionsClient(name: "chat");
builder.AddKeyedAzureChatCompletionsClient(name: "code");

Then resolve each instance by key:

public class ExampleService(
    [FromKeyedServices("chat")] ChatCompletionsClient chatClient,
    [FromKeyedServices("code")] ChatCompletionsClient codeClient)
{
    // Use clients...
}

For more information, see Keyed services in .NET.

Configuration

The Aspire Azure AI Inference client integration offers multiple ways to provide configuration.

Connection strings. The integration reads the connection string from the ConnectionStrings configuration section automatically when you pass the connection name to AddAzureChatCompletionsClient:

{
  "ConnectionStrings": {
    "ai-foundry": "Endpoint=https://{endpoint}/;Key={apikey};DeploymentId={deploymentName}"
  }
}

Configuration providers. The integration supports Microsoft.Extensions.Configuration. It loads ChatCompletionsClientSettings from configuration using the Aspire:Azure:AI:Inference key:

{
  "Aspire": {
    "Azure": {
      "AI": {
        "Inference": {
          "DisableTracing": false,
          "EnableSensitiveTelemetryData": false
        }
      }
    }
  }
}

Inline delegates. Pass an Action<ChatCompletionsClientSettings> to configure settings inline:

builder.AddAzureChatCompletionsClient(
    connectionName: "ai-foundry",
    configureSettings: static settings => settings.DisableTracing = true);

Client integration health checks

The Azure AI Inference client integration participates in Aspire health checks. The integration wires into the /health HTTP endpoint, where all registered health checks must pass before the app is considered ready to accept traffic.

Observability and telemetry

The Aspire Azure AI Inference client integration automatically configures logging, tracing, and metrics through OpenTelemetry.

Logging categories:

Azure.Core
Azure.Identity

Tracing activities:

Experimental.Microsoft.Extensions.AI

Read the connection string in C#

If you prefer not to use the Aspire client integration, install the 📦 Azure.AI.Inference NuGet package and read the connection string from IConfiguration directly:

using Azure;
using Azure.AI.Inference;

var connectionString = builder.Configuration.GetConnectionString("ai-foundry")!;
var parts = connectionString.Split(';')
    .Select(p => p.Split('=', 2))
    .Where(p => p.Length == 2)
    .ToDictionary(p => p[0].Trim(), p => p[1].Trim());

var endpoint = new Uri(parts["Endpoint"]);
var apiKey = parts["Key"];
var deploymentId = parts["DeploymentId"];

var client = new ChatCompletionsClient(endpoint, new AzureKeyCredential(apiKey));

Use the Go standard library to call the Azure AI Inference REST API directly. Read and parse the Aspire-injected connection string:

package main

import (
    "bytes"
    "encoding/json"
    "fmt"
    "io"
    "net/http"
    "os"
    "strings"
)

// parseConnStr parses a semicolon-delimited key=value connection string.
func parseConnStr(connStr string) map[string]string {
    parts := make(map[string]string)
    for _, part := range strings.Split(connStr, ";") {
        if idx := strings.IndexByte(part, '='); idx > 0 {
            parts[strings.TrimSpace(part[:idx])] = strings.TrimSpace(part[idx+1:])
        }
    }
    return parts
}

func main() {
    // Read the Aspire-injected connection string
    connStr := os.Getenv("ConnectionStrings__ai-foundry")
    parts := parseConnStr(connStr)

    endpoint := strings.TrimRight(parts["Endpoint"], "/")
    apiKey := parts["Key"]
    deploymentID := parts["DeploymentId"]

    // Build the request URL for chat completions
    url := fmt.Sprintf("%s/%s/chat/completions?api-version=2024-05-01-preview",
        endpoint, deploymentID)

    body, _ := json.Marshal(map[string]any{
        "messages": []map[string]string{
            {"role": "user", "content": "Hello!"},
        },
    })

    req, _ := http.NewRequest(http.MethodPost, url, bytes.NewReader(body))
    req.Header.Set("Content-Type", "application/json")
    req.Header.Set("api-key", apiKey)

    resp, err := http.DefaultClient.Do(req)
    if err != nil {
        panic(err)
    }
    defer resp.Body.Close()

    result, _ := io.ReadAll(resp.Body)
    fmt.Println(string(result))
}

Install the Azure AI Inference SDK for Python:

pip install azure-ai-inference

Read the Aspire-injected connection string and connect:

import os
from azure.ai.inference import ChatCompletionsClient
from azure.core.credentials import AzureKeyCredential

# Read and parse the Aspire-injected connection string
conn_str = os.environ.get("ConnectionStrings__ai-foundry", "")
parts = dict(
    part.split("=", 1)
    for part in conn_str.split(";")
    if "=" in part
)

endpoint = parts["Endpoint"]
api_key = parts["Key"]
deployment_id = parts["DeploymentId"]

client = ChatCompletionsClient(
    endpoint=endpoint,
    credential=AzureKeyCredential(api_key),
)

response = client.complete(
    model=deployment_id,
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Install the Azure AI Inference SDK for JavaScript:

npm install @azure-rest/ai-inference