How to Set Up Your Content Source for Indexing

Use the instructions below to define a new content source to index.

  • When you define a new content source, you specify the repository that your Connector crawls to extract data.

  • You also make the content source available in the Elastic index.

Below, you use the Content Info page to do 2 things:

  • Link the content in a repository to the BA Insight Connector that you define
  • Provide basic indexing information

To setup your web service content source for indexing manually, use the following method:

  1. Open Connectivity Hub.
  2. Click Content Sources from the top horizontal menu.

  3. Click New > Advanced Web Service content.


  4. A new screen appears with the Content Info tab open. See the following graphic (contains sample values).



    All of the fields in this screen are required.

    *The Target Index is a READ-ONLY field that is automatically populated (from the "Title" field) after you save the Content Source.

    Appropriate naming conventions are used.

  5. Complete the following fields:
Field Description
Target

Where the content is pushed.

Connection and Title: See the connection to the content named (Title) source from which your Connector will pull content. Use the drop-down to change the connection.
Crawl start date: Specify the starting date for the crawled data. You must use US format mm/dd/yyyy .
Max paging size: Leave the default, or use the drop-down, to specify the number of items that can be queued at any time. BA Insight recommends 20000.
Content Localization: To specify, go to the Microsoft list and scroll down to the Language table.
Max file size Leave the default setting, 50 (MB), or enter the maximum file size to be processed in MB. Any files that are larger than the specified size are not indexed.
Property prefix Make sure that ESC_  property prefix (or any other custom prefix) is entered. (This is the specified property name prefix for each metadata name in your content source system.)