Document Change Extraction
3 requirements
·
10 scenarios
Change Type Classification
JR-document-change-extraction-001
The system SHALL classify document changes into distinct types: partial insertion, partial deletion, move-from, move-to, format change, new paragraph, and deleted paragraph. A single paragraph MAY have multiple concurrent change types.
5 test scenarios
- Format-only change detected JR-document-change-extraction-001.1
- Partial content change detected JR-document-change-extraction-001.2
- New paragraph detected JR-document-change-extraction-001.3
- Deleted paragraph detected JR-document-change-extraction-001.4
- Move operation detected JR-document-change-extraction-001.5
Structured Change Extraction
JR-document-change-extraction-002
The system SHALL extract a structured representation of each changed paragraph containing: a unique paragraph identifier, before and after plain text, the set of applicable change types, a pre-rendered revision snippet, and an optional page number.
3 test scenarios
- Structured data extracted from document comparison JR-document-change-extraction-002.1
- Revision snippet generated for each change JR-document-change-extraction-002.2
- Multiple change types represented in single paragraph JR-document-change-extraction-002.3
Document Version Comparison
JR-document-change-extraction-003
The system SHALL support comparing two document versions to extract all changes between them. Before and after text SHALL be available for each changed paragraph.
2 test scenarios
- Original and revised documents compared successfully JR-document-change-extraction-003.1
- Before and after text extracted for each change JR-document-change-extraction-003.2