How to Improve Search Results Accuracy
Use datasets to improve search results accuracy by adding the metadata from one content source to another content source.
-
Datasets are a type of content source that have limited functionality when used alone.
-
However, you can use datasets to enrich indexed content with metadata from another, associated content source.
-
-
Each dataset is a secondary content source.
-
A single content source can have multiple datasets where each dataset has its own set of metadata.
-
You can specify metadata (custom or from the dataset columns) for your datasets using the Dataset Metadata page that is similar to the Content Metadata page.
Improve Datasets to Improve Search Results
For example, you might index employee profile content.
-
To perform this indexing operation, you link the employee profile master record to the second dataset using a common
employeeID
field. In this example:-
The second source contains the employee status which might be
active
,on leave
, and so on. -
By joining this employee status record (as a dataset) with the master record dataset, your indexed record is enriched with its properties.
-
-
You can also associate one or more dataset members with a content source column.
-
You might specify
DS_
as the prefix for your dataset metadata in order to track this metadata with the content metadata (ESC_
) in the index. -
The dataset metadata, like the content metadata, is appended to the item that is indexed.
All the metadata, regardless of whether the metadata came from the content source or from the dataset, is treated equivalently.
Map the Data in Your Content Source to a Database Column to Index Related Metadata
- Content Sources > <your Web Service Connector> > > Edit > Dataset Mappings. The Dataset Mappings page displays.
- Click Add dataset mapping.
-
Parameter: Enter, or leave the default [ID].
- Value:
- template: These settings are the default settings and are shown below
- script: Enter a script using Scripts page.
- Pick column: Click and select a database column. See the following example of possible selections.
- Default value: Enter a value, such as
0
, to be used if no value is found.
Note: Unless you specify a prefix that is different from the prefix ESC_
used for content sources, there is no way to track the source of your metadata.
Dataset changes are not propagated during a recrawl operation unless the document in the content source changes.