If you’re new to Unstructured, read this note first.Before you can create a destination connector, you must first sign in to your Unstructured account:
- If you do not already have an Unstructured account, sign up for free. After you sign up, you are automatically signed in to your new Unstructured Let’s Go account, at https://platform.unstructured.io. To sign up for a Business account instead, contact Unstructured Sales, or learn more.
- If you already have an Unstructured Let’s Go, Pay-As-You-Go, or Business SaaS account and are not already signed in, sign in to your account at https://platform.unstructured.io. For other types of Business accounts, see your Unstructured account administrator for sign-in instructions, or email Unstructured Support at support@unstructured.io.
-
After you sign in to your Unstructured Let’s Go, Pay-As-You-Go, or Business account, click API Keys on the sidebar.
For a Business account, before you click API Keys, make sure you have selected the organizational workspace you want to create an API key for. Each API key works with one and only one organizational workspace. Learn more. -
Click Generate API Key.
-
Follow the on-screen instructions to finish generating the key.
-
Click the Copy icon next to your new key to add the key to your system’s clipboard. If you lose this key, simply return and click the Copy icon again.
-
An Amazon S3 Vectors bucket.
- Learn how to create an S3 Vectors bucket.
- Learn how to get the name of an existing S3 Vectors bucket.
-
The AWS Region (such as
us-east-1) of the target S3 Vectors bucket. Learn how to get the Region of an existing S3 Vectors bucket. -
An index for the target S3 Vectors bucket.
- Learn how to create an index.
- Learn how to get the name of an existing index.
- Vector index name can be any allowed name pattern.
- For Dimension, only specify a number that is supported by Unstructured’s available embedding models.
- For Distance metric, only specify Cosine.
-
For Metadata configuration under Additional settings, Unstructured recommends that you specify the following 10 keys for Non-filterable metadata:
textlink_urlslink_textscoordinates-pointscoordinates-systemdata_source-urldata_source-record_locatordata_source-date_createddata_source-date_modifieddata_source-date_processed
- There are no Unstructured-specific requirements for Encryption or Tags.
- For the target index, the number of dimensions that are generated. Learn how to get the index’s number of dimensions.
-
The AWS access key ID and the AWS secret access key for the target AWS IAM principal (such as an IAM user or group) that has the appropriate access to the S3 Vectors bucket.
-
If you use identity-based policies to control access, the target IAM principal must have at minimum the following access permissions. Replace the following placeholders:
- Replace
<region-short-id>with the AWS Region short ID of the target S3 Vectors bucket. - Replace
<account-id>with the AWS account ID of the target S3 Vectors bucket. - Replace
<bucket-name>with the name of the target S3 Vectors bucket. - Replace
<index-name>with the name of the target index.
Learn more about these S3 Vectors access permissions. - Replace
- Learn how to attach an access policy to an IAM user, group, or role.
- Learn how to create and manage AWS access key IDs and their related AWS secret access keys for IAM users.
- Learn how to switch from an IAM user to a role for temporary access.
-
If you use identity-based policies to control access, the target IAM principal must have at minimum the following access permissions. Replace the following placeholders:
Create the destination connector
To create an S3 Vectors destination connector, see the following examples.<name>(required) - A unique name for this connector.<region>(required): The AWS Region (such asus-east-1) of the target Amazon S3 Vectors bucket.<key>(required): The AWS access key ID for the target AWS IAM principal that has the appropriate access to the target bucket.<secret>(required): The AWS secret access key for the corresponding AWS access key ID.<vector-bucket-name>(required): The name of the target bucket.<index-name>(required): The name of the target index in the bucket.<batch-size>: The maximum number of vectors to generate a single batch. The maximum is500. The default is100if not otherwise specified.<key-prefix>: Some string to prepend to each vector key. Prepending a string to each vector key can be useful for distinguishing between different datasets in the same bucket. Learn more about vector keys. The default is to not prepend a string to each vector key, if this value is not otherwise specified.

