Components (Features)
- What are Components?
- How to Automatically Annotate and Create Columns (Search Tags)
- How to Capture Metadata
- How to Classify Images
- How to Classify Videos
- How to Clean Up HTML Markup
- How to Configure Components
- How to Set Your Components within Pipelines
- How to Create Redacted Documents
- How to Extract/Add Custom Entities
- How to Extract Biomedical Text with ScispaCy
- How to Extract Data from Raw Binary Files
- How to Extract Document Sections Using Headers
- How to Extract Images From Documents
- How to Extract Information from Images
- How to Extract Regex Values From Properties
- How to Extract Section Information from Documents
- How to Extract Languages
- How to Extract Legal Metadata
- How to Extract Metadata from Email
- How to Extract People, Places, and More from Documents
- How to Extract PubMed Articles and Apply Mesh Terms
- How to Extract Text from Audio Files
- How to Filter Metadata
- How to Gather More (Related) Data
- How to Generate Document Previews
- How to Generate a Summary
- How to Generate SmartHub Best Bets at Crawl Time
- How to Generate SmartHub Question and Answers at Crawl Time
- How to Organize Documents into Thematic Categories
- How to Record Pipeline Data
- How to Remove Special Characters from Metadata Names
- How to Singularize Metadata
- How to Test Document Tags
- How to Test Your Components
- How to Test Your Logic
- How to Use Component Data Caching
- How to Use Custom Logic (Script)
- How to Use Natural Language Processing (NLP)
- How to Use Offline Processing
- How to Use Machine Learning to Detect Entities
- How to Extract Metadata from PACER Documents
- How to Configure the NLQ Metadata Capture Stage
- How to Use Content Enrichment Component
- How to Detect Duplicates