The following is more of a design and implementation problem rather than one that asks you to pump out code.

Question:

How would you go about designing a Windiff of mp3 files?

Would you still rely on binary comparison if I added the goal is to match on the same tunes (i.e. The Fray - How to Save a Life) as opposed to identical files.

Is there a quick and dirty way to do this?

What about a more involved approach? Describe and list out the pros/cons.