How to Create and Configure Your Jive Connector Content Source

Creating a Content Source

The content source Content Sources do the following: Receive data from the Source System via the Connection, Filter the data it receives, Provide the results to the Target, Define the specific search index that contains the content you wish to index (and later search). defines what content is retrieved by your connection Connection defines the how Connectivity Hub connects to your Source System (which contains your documents, graphics, etc.,). Your Connection includes identifying elements such as: URL of the BA Insight web service connector you are using, (File Share connector, SharePoint Online connector, etc.), Authentication mode, User Accounts and Credentials, Database information (for database connectors) and what information is to be made available to the SharePoint search index Contains data from your source system. Your search application instance (Elasticsearch, SharePoint Online, etc.) contains the search indexes. One search index per content source..  

  1. In the connector framework, navigate to the Content tab, click Add New > WebService Content. The New Content form appears.
  2. Select the connection name for your desired Jive connection in the Connection dropdown list. The page may refresh to show fields available for this connection. Complete the fields.
    1. Title: The title of the content source.
    2. Enable Indexing: Adds this content source to your SharePoint SSA.
    3. Crawl Schedules: Can be configured for either Incremental or Full Crawls. This can be modified at any time.
    4. Security Target A Target is a "pointer" to a specific instance of a search application, such as Elasticsearch. - (A Search application instance has one or more indexes): Set to “normal”, unless you are implementing custom security mappings.
    5. Crawl Start Point: A date in mm/dd/yyyy format that identified the start point of a full crawl. Records older than this will not be crawled.
    6. Paging size: Determines the number of items to be enumerated for each request from SharePoint for the crawl enumeration. Leave the default value (20000) unless a specific page size is required.
    7. Content localization: The LCID that identifies the language of the content.
    8. Max File Size: Determines the max size of a file in Mb that can be retrieved for this content source.
  3. Select Content Type: Used to specify certain types of content that should be included (or excluded) from the crawl.
  4. Configure the Item Definition tab.
    1. Item URL:
      1. The URL format to be used for each item.
      2. If left at the default, the URL is read from the item properties while crawling.
    2. Item Grouping:
      1. A crawled property that can be used to identify content from this content source. 
      2. You can configure it to be a static value for all content in this content source, or to use a particular property.
  5. Configure the Advanced Tab
    1. Cache Database:
      1. Identifies which connector cache database should be used for this content source.
      2. For Jive connector, the default is appropriate.
    2. Filter Script:
      1. Enables you to write a VB script to filter certain items out of the crawl.  
      2. This script is executed as each item is crawled.
        1. The return value “true” enables the item to be crawled
        2. The return value “false” removes it from the crawl. (the SharePoint crawl log will show “deleted” for these items.)
    3. ACL Script:
      1. Enables you to override the security settings for each item at crawl time.
      2. By default, the connector reads the security setting assigned to an item from the source system Your Source System is the repository where your data is stored (data to be indexed). This repository is managed by applications such as: - SharePoint O365 - SharePoint 2013/16/19 - Documentum - File Share - OpenText - Lotus Notes - etc. Your Source System repository can also be a database such as SQL or Oracle..
      3. This script can manipulate the security per item.
    4. Purge Mode Only:
      1. Used to remove items from the index.
      2. This is not commonly used and should not be selected in most cases.
    5. Script Library:
      1. Enables you to write a VB Script function that can be used in the various scripting fields used by this connection.
      2. It does not need to be configured for most use cases.
    6. Smart Pipeline Integration:
      1. Used for advanced configurations, and is not necessary for most use cases.