This page contains a step by step sample to show how to use the Diff class for comparing the characters of 2 strings.
For more information about Diff class see the main page.
This page can be used with 2 URL parameters (a and b) to test this implementation:
Current parameters:
Text a = "Default Text For Line A." Text b = "Default textline for line B."
1. Parameter Preparation
Before the algorithm can be used the 2 input string must be converted into the datatype that is used by the algorithm: a int Array.
Because we compare on a character basis this task is very easy to complete by using the character code of each char. This is done by the DiffCharCodes:
private static int[] DiffCharCodes(string aText, bool ignoreCase) {
int[] Codes;
if (ignoreCase)
aText = aText.ToUpperInvariant();
Codes = new int[aText.Length];
for (int n = 0; n < aText.Length; n++)
Codes[n] = (int)aText[n];
return (Codes);
} // DiffCharCodes
The codes for the 2 textlines are:
a_codes = 44 65 66 61 75 6c 74 20 54 65 78 74 20 46 6f 72 20 4c 69 6e 65 20 41 2e b_codes = 44 65 66 61 75 6c 74 20 74 65 78 74 6c 69 6e 65 20 66 6f 72 20 6c 69 6e 65 20 42 2e
2. Calling the Diff Algorithm
The main entry point for the Algorithm is the LCS function that can take 2 int[] parameters and will return an array Diff.Item structures that are describing the difference details as identical, inserted or deleted subarrays.
Diff.Item[] diffs = Diff.DiffInt(a_codes, b_codes);
Here is a dump of the actual content of this structure:
The diff result has 5items.
StartA=8, StartB=8, deletedA=1, insertedB=1
StartA=12, StartB=12, deletedA=0, insertedB=4
StartA=13, StartB=17, deletedA=1, insertedB=1
StartA=17, StartB=21, deletedA=1, insertedB=1
StartA=22, StartB=26, deletedA=1, insertedB=1
3. Formatting the result
Now we can use the original data together with the result items and generate a intuitive readable form of the result:
int pos = 0;
for (int n = 0; n < diffs.Length; n++) {
Diff.Item it = diffs[n];
// write unchanged chars
while ((pos < it.StartB) && (pos < b_line.Length)) {
this.Response.Write(b_line[pos]);
pos++;
} // while
// write deleted chars
if (it.deletedA > 0) {
this.Response.Write("<span class='cd'>");
for (int m = 0; m < it.deletedA; m++) {
this.Response.Write(a_line[it.StartA + m]);
} // for
this.Response.Write("</span>");
}
// write inserted chars
if (pos < it.StartB + it.insertedB) {
this.Response.Write("<span class='ci'>");
while (pos < it.StartB + it.insertedB) {
this.Response.Write(b_line[pos]);
pos++;
} // while
this.Response.Write("</span>");
} // if
} // while
// write rest of unchanged chars
while (pos < b_line.Length) {
this.Response.Write(b_line[pos]);
pos++;
} // while
And here is the formatted result:
Default Ttextline Ffor Lline AB.
Here are some alternative test-cases:
a=This is a big thing.&b=This is a small thing.
a=identical&b=different