INTERACT FORUM

More => Old Versions => Media Center 11 (Development Ended) => Topic started by: alias on April 08, 2004, 04:30:52 am

Title: Duplicates Question
Post by: alias on April 08, 2004, 04:30:52 am
Is it posible to find partial duplicates?

I have a lot of singles with various mixes of the same song, on top of this I have many compilations with posible duplicates.

Track Names are stored as 'Song (Mix)', so is it posible do partial matches but not have to write it in such a way that you have to create a list for each song, i.e. [Song"?

Thanks in advance.
alias
Title: Re:Duplicates Question
Post by: jleerigby on April 08, 2004, 07:45:40 am
Kingsparta's plug in called MD5 master can do this.  It can create a 'texthash' based on some rules you give it e.g. you could tell it to look at just the first 5 characters of the artist and track name and ignore certain characters.  I tell it to ignore stuff like:

& And Live Feat. FT. Featuring etc

It can also be told to ignore stuff in brackets.

e.g. Status Quo - Caroline (Album version) becomes
'StatuCarol'. Status Quo - Caroline Live also becomes 'StatuCarol'.

This way 2 files that match these rules will get the same 'texthash' (just a load of meaningless numbers and letters).  All you need to do then is manually review duplicates found by MC in the texthash field.

Check out www.spartasoft.com.
Title: Re:Duplicates Question
Post by: drosoph on April 08, 2004, 07:54:02 am
I agree, MD5Master is the best way to go .. I was going to suggest using the BinHash to sample the song ... there are three hashes to work with ...  You'll be sure to catch the dup one way or another!

Title: Re:Duplicates Question
Post by: alias on April 09, 2004, 09:02:15 pm
Thanks guys I shall give it a try.

Cheers