How to Set Up Your Content Source for Indexing

Use the instructions below to define a new content source to index. When you define a new content source, you specify the repository that your Connector crawls to extract data. You also make the content source available in the Elastic index.

Below, you use the Content Info page to do 2 things:

  • Link the content in a repository to the BA Insight Connector that you define
  • Provide basic indexing information

To setup your web service content source for indexing manually, use the following method:

  1. Open Connectivity Hub.
  2. Click Content Sources from the top horizontal menu.

  3. Click New > Advanced Web Service content.
  4. A new screen appears with the Content Info tab open.
  5. Complete the following fields:
Field Description
Target

This specifies the target where the content is pushed. For more information on setting up a target, see how to configure your target.

Connection This specifies the connection to the content source from which your Connector will pull content. Use the drop-down to change the connection. For more information on setting up a connection, see how to connect to a source system.
Title Enter a title for your web service content source.
Target Index This is a READ-ONLY field that is automatically populated from the Title field after you save the Content Source.
Crawl start date: Specify the starting date for the crawled data. You must use US format mm/dd/yyyy .
Max paging size: This specifies the number of items that can be queued at any time. BA Insight recommends 20000.
Content Localization: Refer to the Microsoft language identifiers list and enter a valid localization ID (LCID).
Max file size Enter the maximum file size to be processed in MB. Any files that are larger than the specified size are not indexed. By default, this setting is set to 50 (MB)
Max size of extracted text This specifies the maximum number of characters that are stored in the search index per item. By default, this setting is set to 1000000 characters.
Property prefix Make sure that ESC_  property prefix (or any other custom prefix) is entered. (This is the specified property name prefix for each metadata name in your content source system.)

Fields that are denoted with an asterisk (*) are required and a value must be included in the field. Appropriate naming conventions are used.