Skip to main content

Parser Information

For file-based documents (PDFs, uploaded files, storage-based sources), the Document Statistics page displays information about how the document was parsed.

Parser Information Card

FieldDescription
Parser UsedThe parsing method used to extract text from the document
Quality ScoreA confidence score (0-100%) indicating extraction quality
Quality IssuesAny detected issues with the extraction (if applicable)

Parser Types

ParserDescriptionWhen Used
StandardFast, free text extraction for digital PDFsDigital PDFs with selectable text
AdvancedHigh-accuracy OCR for complex documents (Growth+ plans)Scanned PDFs, images, complex layouts

Quality Score Interpretation

Score RangeMeaningRecommendation
80-100%Excellent extraction qualityNo action needed
60-79%Good extraction with minor issuesReview if accuracy is critical
Below 60%Poor extraction qualityConsider re-syncing with Advanced parser (Growth+ plans)

When Parser Info is Not Available

Parser information may not be available for:

  • Documents synced before this feature was added
  • API-based sources (Confluence, Notion) that don't require file parsing
  • Website sources that use HTML extraction

If parser information is missing for a file-based document, you can click "Re-sync Document" to reprocess it and capture parser metadata.