GUIDE 3 Annotation Team Playbooks to Boost Labeling Speed and Quality

PDF

The Pdf tag displays a PDF document in the labeling interface. Use this tag to perform document-level annotations, transcription, and summarization.

Use with the following data types: PDF.

Parameters

Param Type Description
value string Data field value containing the URL to the PDF

Supported control tags

The <Pdf> tag is intended for document-level classification tasks and does not support applying annotations to the actual content of the PDF.

For example, if you want to apply labels for OCR tasks, you will need to convert the PDF into images first. For more information, see Multi-Page Document Annotation.

You can use the following control tags:

Enterprise

You can also use the PDF tag with Prompts to perform auto-labeling work such as PDF summarization, classification, information extraction, and document intelligence.

Example

Labeling configuration to label PDF documents:

<View>
  <Pdf name="pdf" value="$pdf" />
  <Choices name="choices" toName="pdf">
    <Choice value="Legal" />
    <Choice value="Financial" />
    <Choice value="Technical" />
  </Choices>
</View>

Example Input data:

{
  "pdf": "https://app.humansignal.com/static/samples/sample.pdf"
}
Designed for teams of all sizes Compare Versions