Creating an Endpoint in Generative AI Agents

  1. On the Agents list page, click the name of the agent that you want to create an endpoint for.
    If you need help finding the list page, see Listing Agents
  2. Under Resources, click Endpoints.
  3. Click Create endpoint.
  4. Enter the following values:
    • Name: A name that starts with a letter or underscore, followed by letters, numbers, hyphens, or underscores. The length can be from 1 to 255 characters.
    • Description: An optional description
  5. Enable session: Check this box to keep the context of the chat session. You can't change this option later.
  6. If you enable session, specify the idle timeout period in seconds. The default idle timeout is 3600 seconds (1 hour). This means that after the lack of activity between the user and agent for 1 hour, the session automatically ends, and the following conversations don't retain the context of the previous conversation. You can set it as low as 1 hour and as high as 7 days.
  7. For Content moderation, select whether moderation is applied to:
    • User prompt (input)
    • Generated response (output)
    • Both input and output

    Learn about content moderation.

  8. Enable trace: Check this box to track and display the conversation history, including both the original prompt and the generated response, during the chat conversation. If you don't enable this feature now, you can always add it later by editing the endpoint.
  9. Enable citation: Check this box to display three references for each chat response. If you don't enable this feature now, you can always add it later by editing the endpoint.
  10. (Optional) Click Show advanced options and add one or more tags to the endpoint. If you have permissions to create a resource, then you have permission to update its tags. If you need help, see Tags and Tag Namespace Concepts.
  11. Click Create.