So I have to write a duplicate checker to compare two XMLs and see

Question

0

Asked: May 20, 20262026-05-20T17:47:22+00:00 2026-05-20T17:47:22+00:00

So I have to write a duplicate checker to compare two XMLs and see

0

So I have to write a “duplicate checker” to compare two XMLs and see if they are the same (contain the same data). Now because they come from the same class and are generated form an XSD the structure the order of the elements inside will most likely be the same.

The best way I can think of doing the duplicate check is to set up two dictionaries (dictLeft, dictRight) and saving the xpath#value as the key and the number of times it occurs. Something like this:

Left:

{ 'my/path/to/name#greg': 1, 'my/path/to/name#john': 2, 'my/path/to/car#toyota': 1}

Right

{ 'my/path/to/name#greg': 1, 'my/path/to/name#bill': 1, 'my/path/to/car#toyota': 1}

Comparing these two dictionaries will give me a fairly accurate indication of whether or not these two XMLs are the same or not (there is the odd chance that I may get false results, but it is very remote).

Does anyone else have a better idea? Maybe a function in ElementTree that I do not know about?

EDIT: To better explain:

<root><person><name>Bob</name><surname>marley</surname></root>

and

<root><person><surname>marley</surname><name>Bob</name></root>

would be considered the same. I am ignoring attributes. The idea is to keep the code as simple as possible while not hampering performance too much.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-20T17:47:23+00:00

OK, so I had to make a decision and went with this:

foreach path in xpathlist
  find entries for path for both xml1 and xml2
  foreach entry in xmlentries1
    dict1[path#entry.value]++
  foreach entry in xmlentries2
    dict2[path#entry.value]++

  if dict1 and dict2 are not equal
    return false
return true

I hope this makes sense. This allows me to test for specific/all xpaths. If someone has a better algorithm, I’m all ears 🙂

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

So I have to write a duplicate checker to compare two XMLs and see

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply