INTERACT FORUM

Please login or register.

Login with username, password and session length
Advanced search  
Pages: [1]   Go Down

Author Topic: Smartlist > Code For Finding ByArtist Dups  (Read 1744 times)

mikebinz

  • Recent member
  • *
  • Posts: 6
Smartlist > Code For Finding ByArtist Dups
« on: August 03, 2012, 06:49:56 pm »


One of my Audio Databases is arranged By Artist, Singles being stored directly in the Artist's Folder, and Albums in a Sub-Folder of the same Artist's Folder         

ie          
         
   Singles         
   ...\ [Artist] \ [Artist] - [Name].mp3      

   Albums         
    ...\ [Artist] \ [Album] \ [Track #]_[Name].mp3      
         
I have attached Screenshots of one such Artist Folder         
      
NOTE that Single & Album Files have a different format          


Could anyone tell me the Code to create a Smart List of Singles that are;         

   Duplicated on any Album
 AND
   the Same File Size as the Album copy      
         
The List SHOULD NOT contain Duplicates from Different Albums, ONLY Singles that also exist on an Album         


Payment in Beer or Cash will be considered!
Logged

MrC

  • Citizen of the Universe
  • *****
  • Posts: 10462
  • Your life is short. Give me your money.
Re: Smartlist > Code For Finding ByArtist Dups
« Reply #1 on: August 03, 2012, 07:05:03 pm »

How would this work in your case?

I see a No More Mr. Nice Guy entry, and a School's Out... entry, each with different titles and different file sizes.

You'd need Fuzzy Matching and audio fingerprint comparisons.
Logged
The opinions I express represent my own folly.

mikebinz

  • Recent member
  • *
  • Posts: 6
Re: Smartlist > Code For Finding ByArtist Dups
« Reply #2 on: August 03, 2012, 07:12:42 pm »


I don't require that level of complexity; the two egs you pointed out WOULDN'T appear in the Smartlist because they don't meet the criteria

Logged

MrC

  • Citizen of the Universe
  • *****
  • Posts: 10462
  • Your life is short. Give me your money.
Re: Smartlist > Code For Finding ByArtist Dups
« Reply #3 on: August 03, 2012, 07:14:06 pm »

I wasn't very clear.  Duplicate detection in smartlists is done via metadata matching.  If the metadata varies, it is not possible to detect a duplicate reliably.
Logged
The opinions I express represent my own folly.

mikebinz

  • Recent member
  • *
  • Posts: 6
Re: Smartlist > Code For Finding ByArtist Dups
« Reply #4 on: August 03, 2012, 11:24:23 pm »


Thanks for your replies MrC, and my apologies for not supplying a very good example; there are no Duplicates that meet the requirements in it

There is no problem with the Duplicate Matching itself;

~dup=[Name],[Artist]

works fine, but my Database is about 35,000 tracks and I don't want to list Duplicate ALBUM Tracks, just Duplicate SINGLES

Starting with the above code, I'm after something that will discriminate further, probably on the basis of the Duplicate's File Path

Logged

rick.ca

  • Citizen of the Universe
  • *****
  • Posts: 3729
Re: Smartlist > Code For Finding ByArtist Dups
« Reply #5 on: August 04, 2012, 02:50:55 am »

Assuming [Artist] and [Name] are consistent and reliable, you'll have no difficulty identifying duplicates. If that data is not reliable, you're unlikely to improve the situation by comparing [File Size] (which varies with duration, bitrate and compression). [Duration] would be better, but it's still difficult to compare things in a smartlist that aren't exactly equal. That could be done, but you'll still not be sure if one file is an original studio recording and the other an otherwise identical live or re-recording.

I suggest you start with a simple smartlist that just selects all duplicates (i.e., all files for which [Artist]-[Name] is not unique). You can review a file list of that grouped by Artist and sorted in a way that helps show the situation. Part of that will be a clear indication of whether the track is a single or a member of an album. You haven't provided enough information to suggest the best way to do that. If for singles [Album] is empty or 'Singles', then it's easy. If that's not a reliable indicator, maybe the [Filename (name)] is. If that contains an '_' and (maybe) not a ' - ', it's from an album. Or test [Filename (path) for the existence of an album folder.

For the sake of clarity, let's assume you've created a boolean expression field that by some reliable means is set to '1' for album tracks and is '0' for singles. If you're still not confident enough to delete all singles that are apparently duplicates, you can still use this in sorting them (e.g., the single appears first, and if there happens to be more than one album duplicate, they're sorted by duration). This will make them easy to assess visually for the decision to delete. If there are a lot of them and it seems clear they're all 'valid' duplicates, just select all the singles and delete them.
Logged

mikebinz

  • Recent member
  • *
  • Posts: 6
Re: Smartlist > Code For Finding ByArtist Dups
« Reply #6 on: August 04, 2012, 07:03:27 pm »

All good - thanks for your help
Logged
Pages: [1]   Go Up