Difference between revisions of "User:Francis Tyers/MT"
Line 4: | Line 4: | ||
authors report that this is an improvement over the more traditional strategies. The two traditional |
authors report that this is an improvement over the more traditional strategies. The two traditional |
||
strategies tested against were longest common subsequence (LCS), as used by GNU diff, and shortest |
strategies tested against were longest common subsequence (LCS), as used by GNU diff, and shortest |
||
edit distance, as used by XMLDiff and similar programs. |
edit distance, as used by XMLDiff and similar programs. The authors state that the deficiency in these |
||
methods lies in the way that they do not represent changes as made by authors. Rather they try to make |
|||
the ``smallest'' possible edit script or diff. |
|||
The authors present their method of ``structure preserving difference'', which instead of trying to find |
|||
the smallest possible edit script, attempts to maximise the structures maintained in changing one |
|||
document to another. |
Revision as of 22:19, 1 April 2008
- Structure-Preserving Difference Search for XML Documents
The paper presents a strategy for measuring the difference between a pair of documents in XML. The authors report that this is an improvement over the more traditional strategies. The two traditional strategies tested against were longest common subsequence (LCS), as used by GNU diff, and shortest edit distance, as used by XMLDiff and similar programs. The authors state that the deficiency in these methods lies in the way that they do not represent changes as made by authors. Rather they try to make the ``smallest possible edit script or diff.
The authors present their method of ``structure preserving difference, which instead of trying to find the smallest possible edit script, attempts to maximise the structures maintained in changing one document to another.