High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Xml Multidup: A Probabilistic Duplicate Detection Method for Hierarchical Multimedia Data


K.Priyangaa , Trinity College for Women, Namakkal.; K.Valarmathi, Trinity College for Women, Namakkal.


XMLDup, Bayesian Network, Duplicate


Even though present is a long line of work on identify duplicate in relational data, only a few solutions focus on duplicate detection in more composite hierarchical structures, like XML data. In this article, we there a novel method for XML replacement detection, called XML Dup. XML Dup uses a Bayesian network to decide the probability of two XML elements being duplicate, allowing for not only the information surrounded by the elements, but also the way that information is structured. In addition, to get better the good organization of the network evaluation, a novel pruning policy, capable of major gains over the un optimized version of the algorithm, is presented. Through experiment, we show that our algorithm is able to accomplish high exactitude and remember scores in quite a lot of datasets. XMLDup is also able to do better than another state of the art replacement detection solution, both in terms of competence and of usefulness.

Other Details

Paper ID: IJSRDV4I80012
Published in: Volume : 4, Issue : 8
Publication Date: 01/11/2016
Page(s): 157-160

Article Preview

Download Article