How to Set Up and Configure the Amazon S3 Connector

All BA Insight connectors can be downloaded from the Upland Right Answers Portal under Knowledge > BA Insight > Product Downloads > Connectors. This connector is installed with the same generic steps as any BA Insight connector. You must satisfy the Prerequisites for your connector before installing. The configuration specifics are detailed below.

200 characters limit

S3 connector only supports relative paths up to 200 characters in the S3 bucket.

Connection Configuration Specifics

Amazon S3 Connector has to option to choose between web services depending on the content you are crawling.

After installation, when configuring the Connection Connection defines the how Connectivity Hub connects to your Source System (which contains your documents, graphics, etc.,). Your Connection includes identifying elements such as: URL of the BA Insight web service connector you are using, (File Share connector, SharePoint Online connector, etc.), Authentication mode, User Accounts and Credentials, Database information (for database connectors) you see a drop-down menu with the options:

  • Basic indexing
  • Related documents indexing

Basic Indexing

Amazon S3 log in details:

  • Login
    • If using the Amazon S3 authentication:
      • Provide the Amazon S3 Access Key ID 
    • If using an external provider:
      • Check with the provider for the appropriate login/user/account data to provide.
      • Example: var username = amazonConnectionConfigurationInfo.AccessTokenInfo.AccessKeyId;
  • Password:
    • If using the Amazon S3 authentication:
      • Provide the Amazon S3 Secret Access Key
    • If using an external provider:
      • Check with the provider for the appropriate password/secret data to provide
      • Example: var pass = amazonConnectionConfigurationInfo.AccessTokenInfo.SecretAccessKey;

Parameters

Basic indexing allows the following parameters in the Amazon S3 Connection Parameters field:

Example With External Authentication Service

Example with external authentication service
Copy
<configuration>
   <Bucket>Mytestbucket1</Bucket>
   <UseExternalCredentialsProvider>true</UseExternalCredentialsProvider>
   <ExternalCredentialsProviderUrl>
      http://MyExternalAuthProviderURL
   </ExternalCredentialsProviderUrl>
</configuration>

Related Documents Indexing

This sub connector handles the scenario where multiple files stored in the same bucket should be indexed as a single item.

  • For instance, a contract and its addendums should be indexed together.

The above configuration details for Basic Indexing also apply to Related Documents Indexing. 

In order for Amazon to know that documents are related to each other, a pattern has to be provided and the documents in the AWS bucket have to respect this pattern in their naming.

Example

Example: related documents
Copy
<relateddocumentspattern>
   <binarycontent>
      <![CDATA[(?<filename>.*).(?<extension>pdf|docx|pptx)]]>
   </binarycontent>
   <documenturls>
      <![CDATA[(?<filename>.*_URLs).(?<extension>txt)]]>
   </documenturls>
   <documentmetadata>
      <![CDATA[(?<filename>.*).(?<extension>xml)]]>
   </documentmetadata>
</relateddocumentspattern>

Content Configuration Specifics

Amazon S3 has a flat structure with no hierarchy like you would see in a typical file system.

For the sake of organizational simplicity, the Amazon S3 console supports the folder concept as a means of grouping objects.

  • Amazon S3 does this by using key name prefixes for objects.

  • Amazon S3 Connector supports this structure with the use of filters on content