This is to be done in C++ or C….
I know we can read the MP3s’ meta data, but that information can be changed by anyone, can’t it?
So is there a way to analyze a file’s contents and compare it against another file and determine if it is in fact the same song?
edit
Lots of interesting things coming out that I hadn’t thought of. Not at all a good idea to attempt this.
It’s possible, but very hard.
Even the same original recording may well be encoded differently by different MP3 encoders or the same encoder with different settings… leading to different results when the MP3 is then decoded. You’d need to work out an aural model to “understand” how big the differences are, and make a judgement.
Then there’s the matter of different recordings. If I sing “Once in Royal David’s City” and Aled Jones sings it, are those the same song? What if there are two different versions of a song where one has slightly modified lyrics? The key could be different, it could be in a different vocal range – all kinds of things.
How different can two songs be but still count as “the same song”? Once you’ve decided that, then there’s the small matter of implementing it 😉