Skip to main content

Document Change Extraction

3 requirements · 10 scenarios

Change Type Classification

JR-document-change-extraction-001
The system SHALL classify document changes into distinct types: partial insertion, partial deletion, move-from, move-to, format change, new paragraph, and deleted paragraph. A single paragraph MAY have multiple concurrent change types.
5 test scenarios
  • Format-only change detected JR-document-change-extraction-001.1
  • Partial content change detected JR-document-change-extraction-001.2
  • New paragraph detected JR-document-change-extraction-001.3
  • Deleted paragraph detected JR-document-change-extraction-001.4
  • Move operation detected JR-document-change-extraction-001.5

Structured Change Extraction

JR-document-change-extraction-002
The system SHALL extract a structured representation of each changed paragraph containing: a unique paragraph identifier, before and after plain text, the set of applicable change types, a pre-rendered revision snippet, and an optional page number.
3 test scenarios
  • Structured data extracted from document comparison JR-document-change-extraction-002.1
  • Revision snippet generated for each change JR-document-change-extraction-002.2
  • Multiple change types represented in single paragraph JR-document-change-extraction-002.3

Document Version Comparison

JR-document-change-extraction-003
The system SHALL support comparing two document versions to extract all changes between them. Before and after text SHALL be available for each changed paragraph.
2 test scenarios
  • Original and revised documents compared successfully JR-document-change-extraction-003.1
  • Before and after text extracted for each change JR-document-change-extraction-003.2