How to Extract Metadata from PACER Documents
- About the PACER Metadata Extractor Component
- How to Add the PACER Metadata Extractor to AutoClassifier
- How to Configure the PACER Metadata Extractor Component
About the PACER Metadata Extractor Component
The PACER Metadata Extractor pipeline stage extracts legal information from Pacer documents.
How to Add the PACER Metadata Extractor to AutoClassifier
-
Prerequisites: This component needs Tika Extractor Component.
Use the following steps to add the Tika Extractor Component and Pacer Metadata Extractor Component to an AutoClassifier pipeline stage.
- Navigate to the AutoClassifier Pipelines component page.
- Click New Component and select Tika Extractor from the component list:
- Name your new Tika Extractor component and clickAdd.
- Select PACER Metadata Extractor from the component list:
- Name your new PACER Metadata Extractor component and clickAdd.
- Click Apply to save your changes.
- Ensure your new Tika Extractor and PACER Metadata Extractor components are placedin the list of existing pipeline stages.
How to Configure the PACER Metadata Extractor Component
Prerequisites: The Tika Extractor Component must be configured.
- Open your Tika Extractor component.
- Check Extract Body and Extract Metadata.
- Click Apply then Cancel.
- Open your PACER Metadata Extractor component.
- Enable Court Listener Mappings: Represents whether the output will contain the CourtId and CourtName equivalent to CourtListener API.
- If this setting is enabled, PacerCourtId, PacerCourtName, CourtListenerCourtId, and CourtListenerCourtName are dislpayed instead of CourtName.
- If this setting is enabled, PacerCourtId, PacerCourtName, CourtListenerCourtId, and CourtListenerCourtName are dislpayed instead of CourtName.
- Enable Extracting Judge Name: Represents whether the output will contain the judge name.
- If this setting will be enabled, the Judge will be displayed.
- If this setting will be enabled, the Judge will be displayed.
- Extract Judge Names Regex Pattern: Represents the Regex that will match the words before the judge name.
- Example: hon\.|honorable|district judge
- Example: hon\.|honorable|district judge
- Click Apply then Cancel.
Output Properties
Property
Type
Type Text
PublishDate Text
DocumentDisplayNumber Text CourtName Text Cost Text CaseName Text CaseId Text PacerCourtId* Text PacerCourtName* Text CourtListenerCourtId* Text CourtListenerCourtName* Text Judge** Text *Note: PacerCourtId, PacerCourtName, CourtListenerCourtId, CourtListenerCourtName metadata properties are returned only if "Enable Court Listener Mappings" is set to True. In that case, CourtName is not returned.
**Note: Judge metadata property is returned only if Enable Extracting Judge Name is set to True.