I have a large code base and there is lots of repeated, or nearly repeated code all over the place, it’s about as unDRY as code can get, but tracking the ‘duplicates’ is hard, so I was wondering if there are any tools for finding potential DRYable code, something like a diff tool or a Hamming distance analizer, don’t need language specific knowledge or anything like that.
So any clues as too a tool like this?
Duplo (open source) works in C, C++, Java, C# and VB.Net. I tried it once, and it found enough duplicated code to keep me employed for a long time.
I’ve heard of Simian (commercial) but have not tried it.