Add Metadata from External Content
Why Use Datasets?
Use datasets to add extra metadata from another table using BA Insight Connectors such as SQL and Salesforce.
- For example, you might use your SQL Connector to crawl a table of information about documents; metadata such as the
author
ortitle
of the document, where related information is in another table in the database. - To return the related information, you create a database connection to this table using your dataset connection and a
where
clause. - This operation enables you to add a dataset to the content source so that when data such as "author" name is related to other relevant information (such as "full name"), it can be seen in search results.
How to Add Your Connection
The following steps use the SQL Connector content and specify the Title, Type, and Connection ID from the content sources.
We add the dataset that looks at the connection; the title and type of the connection for the specified content source is also returned as extra information from the item linked to your item.
- Click Connections > New > SQL connection.
- The Connection Info tab appears.
- Click the General Settings tab.
Basic Mode
- Database Server Address (required): Enter the IP address for your database.
- Database Name (required): Enter the name of this database.
- Connection TImeout: Time in seconds between timeouts.
- Authentication mode:
- Use Service Account: Uses your local machine Service Account. Set to "Use Service Account" by default.
- Specify User Account: Enter your SQL login and password in the fields that appear.
Advanced Mode
Setting the Connection Mode to "Advanced" reveals the Connection String field.
All other fields are the same as "Basic" mode.
- Connection String (required): Enter your SQL database connection details using SQL string syntax.
Security: Directory Tab
- Click the Security Directory tab.
- Group expansion threshold: Specify the Group expansion threshold.
- Create security groups: Checking this box creates missing security groups in the target directory when you run the Security Sync job.
- Click Save to see your new SQL Connection listed in the Connections page.
Security: Scripts Tab
-
Users load query: SQL statement that returns all the users in the database.
-
Required columns: SPW_USERID, SPW_USERNAME
-
Optional columns: SPW_ADID, SPW_ACTIVE
-
-
Groups load query: SQL statement that returns all the groups and roles in the database.
-
Required columns: SPW_GROUPID, SPW_GROUPNAME
-
Optional columns: SPW_ADID, SPW_ACTIVE
-
-
Group users load query: SQL statement that returns a list of unique user IDs within a specified group ID.
- Required column: SPW_USERID
- Required parameter: [SPW_GROUPID]
-
Group hierarchy load query: SQL statement that returns a list of unique group IDs within a specified group ID.
-
Required column: SPW_GROUPID
-
Required parameter: [SPW_GROUPID]
-
-
User load script: VB.Net script that is executed by Security Sync task, once per user.
-
Can be used to return the mapped user from target directory.
-
Script must return a user received from any of the "TargetDirectory.GetUserBy-" methods, which are documented for each supported target directory type.
-
Parameters needed for the mapping are coming from the connector and can be retrieved by "Host.Get-" methods.
-
If the user is not found in the target directory, it is invalid and mapping will be retried in the next Security Sync task again.
-
- Group load script: VB.Net script that is executed by Security Sync task, once per group.
- Can be used to return the mapped group from target directory.
- Script must return a group received from any of the "TargetDirectory.GetUserBy-" methods, which are documented for each supported target directory type.
- Parameters needed for the mapping are coming from the connector and can be retrieved by "Host.Get-" methods.
- Groups found in target directory will be static, while other ones will be native.
How to Add the Dataset Connection
- Click Datasets > Dataset connections > New > SQL Dataset Connection.
- The SQL Dataset Connection page appears.
- In the Connection Mode field, select Basic or Advanced from the drop-down list.
- If you selected a Basic connection mode, Enter the following information:
- Title (required): Enter a unique, descriptive name.
- SQL Server (required): Enter the IP address of your database server.
- Database Name (required): Enter a name.
- Authentication Mode (required): Set to "Use Service Account" by default.
- If you selected an Advanced connection mode, Enter the following information:
- Title (required): Enter a unique, descriptive name.
- Connection String(required): Enter your SQL database connection details using SQL string syntax.
- Authentication Mode (required): Set to "Use Service Account" by default.
Due to a vulnerability fix on the SQL Client, the default encryption flag for SQL Connections has been changed to true. As a result, you must specify Encrypt=false in your connection string if you do not support encryption in your database. For example,Server=myServerAddress;Database=myDataBase;User Id=myUsername;Password=myPassword;Encrypt=false;
- Click Save.
Note: SQL Database connections are supported only for SQL Server data providers.
How to Add Your Dataset Contents
General Settings
- Click Datasets > Dataset Contents > New > SQL Dataset.
- The Edit SQL Dataset page appears.
- Title:
- Required
- Enter a unique, descriptive title.
- Connection:
- Required
- Enter the name that you specified in the SQL Dataset Connection page above.
- SQL Query
- Required
- Using the information associated with this field, enter a valid SQL query statement that specifies which fields should be located in the second table.
- This statement interrogates the table for the connection information.
- SQL Filter Clause
- Required
- Using the information associated with this field, enter a valid SQL filtering condition. For example:
- Bracketed
[]
names for replacement purposes. For example, specify[ID],'P2'.
- Typed parameters to improve performance, such as
@S_PARAMNAME
for strings,@I_PARAMNAME
for integers wherePARAMNAME
can be any name.
- Bracketed
Parameter names must be written in uppercase. Parameter names that are written in lowercase, such as@i_paramname
, will not be recognized. - Property Prefix (required): Property name prefix. Set to "ESC_" by default.
- Title:
Metadata Settings
- Select the Metadata Settings tab.
- Active
- Enabled by default.
- When enabled, any metadata added to this content source will be crawled and indexed.
- All other settings on this page are described in the interface.
- Active
- Click Save to save your changes.
- Click Datasets> <name of your newly-added SQL database> > > Metadata: See the metadata fields.
Advanced Settings
- Select the Advanced Settings tab.
- Enable caching: When enabled, SQL Dataset content is cached for increased performance. Disabled by default.
- Cache Timeout: Time out in seconds inbetween cache refreshes.
- Cache Limit: The number of items that are stored in the cache (5-5000).
- Script Library: VBScript functions to be used in all other scripts.
How to Create a Content Source
Use the Content Sources page to create a new SQL or web service connector. The following steps detail how to create a new SQL connector.
- Navigate to Content Sources.
- Click New... > SQL Content from the drop-down menu.
- The Edit Database Content page appears.
- In the Edit Database Content page, enter the following information:
- Connection: Specify your connection.
- Title: Click and select the title.
- Click Item Enumeration. The Item Enumeration page appears. Enter the following information:
- Enumeration Query: You can copy-paste the example below this field and edit this code.
- Test/Load metadata: Click to confirm that you code is correct.
- Item extension: Enter an extension such as [SPW_EXTENSION].
- Click the Item Definition tab.
- In the Item Query field, copy-paste the example shown in the UI below this field and edit the code as appropriate to your environment and needs.
- Scroll down to see more fields.
- Enter your information as appropriate in the following fields:
- Test/Load metadata: Click to confirm that you code is correct.
- Item Url: Enter a URL such as www.google.com.
- Item Title: Go to the "Pick column" drop-down menu on the far right and click an item such as Title.
- Click the Dataset Mappings tab.
Click Add dataset mapping: Select your mapping from this drop-down menu.
- Under the Actions column, click > Edit. See the Edit Dataset Mapping pop-up.
Enter the following information:- Parameter: You must link this parameter to one of the items in the content source.
- Pick column: Select the column to map to.
- Save: Click Save.
How to View the Added Metadata
- Navigate to the Content Sources page.
- Click <your content source> > > Edit > Metadata.
- Click Generate.