Elastic Adapter
The Elastic Adapter integrates AutoClassifier with Elasticsearch.
Elastic Adapter Capabilities and Limitations
- Capable of pushing additional metadata to Elasticsearch Index items.
- Capable of incremental crawls, but only when specific properties are available in the Elastic Index.
- Elasticsearch Version Supported: v8.5.3 and older
How to Configure the Elastic Adapter
Use the following steps to configure the Elastic Adapter:
- Select the Elastic Adapter Source to configure.
- Complete the fields shown in the graphic below.
- Enter the Server Address.
- This is the Elastic Server Address.
- This is the Elastic Server Address.
- Enter the User Id
- If the Elastic Index is secured, enter the user ID which will be used to access content in the Elastic Index.
- If the Elastic Index is secured, enter the user ID which will be used to access content in the Elastic Index.
- Enter the User Password
- If the Elastic Index is secured, enter the user ID which will be used to access content in the Elastic Index.
- If the Elastic Index is secured, enter the user ID which will be used to access content in the Elastic Index.
- Enter the Timeout (seconds)
- Enter web service timeout for accessing the Elastic Index.
- It should be a value between 5 and 120 seconds.
- Select the Index
- An adapter is capable of crawling a single Elastic Index at a time.
- An adapter is capable of crawling a single Elastic Index at a time.
- Enter the Elastic Query.
- Enter a valid Elastic query which will serve as the bases for what data from the index is crawled.
- For example:
"match_all":{}
- Enter the Elastic Filter
- Add additional Query filters which will be applied to the index.
- Multiple filters can be added separated by ";"
- Enter the Enumeration Scroll Size
- The enumeration process takes advantage of the Elastic Scroll capabilities for enumeration.
- The enumeration process takes advantage of the Elastic Scroll capabilities for enumeration.
- Enter the Crawl Date Field (if available)
- If the Elastic Index has a Date field which represents the modification date/time of each item within the index, then incremental crawls are possible.
- Without a date-time field, you can run only Full Crawls.
- Enter the Crawl Full Start Date (utc)
- If a Crawl Date field is available, then a filter will be applied when enumerating.
Available Elastic Queries
MatchAll
"match_all":{}
Term
"term": {"escbase_fileextension": "doc"}
Terms
"terms": { "escbase_fileextension": ["doc","pdf"] }
Range
"range": { "escbase_lastupdateutc": { "gt": "1990-01-01T10:00:00.000Z"}}
"range": { "escbase_lastupdateutc": { "gte": "1990-01-01T10:00:00.000Z", "lte": "2021-01-01T10:00:00.000Z" }}
Example Configurations
Crawl all indexed data
Crawl all indexed data where lastmodifieddate property is greater than the specified data
Crawl all indexed data where createddate property is between to dates
Crawl all indexed data where fileextension property contains "doc" or "pdf"
Crawl all indexed data where fileextension property contains "doc" and the language property equals "en"
Scaling
The Elastic Adapter can be scaled to increase performance
- Add more Crawl Components per Server
- The Adapter Crawler can also be installed on multiple Servers