INTERACT FORUM
More => Old Versions => JRiver Media Center 20 for Windows => Topic started by: JIMV on December 08, 2014, 02:22:43 pm
-
Backstory....My ancient iMac died the 'white screen' of death. I bought a new thinkpad and used backups from my backup hard drive to put the music files on the new device. When done I found hundreds (if not thousands) of files duplicated, as in 4 copies of track one, 2 etc...
Is there a way not requiring a degree in programming to scan and identify duplicate files and then batch delete them???
I really do not desire to go file by file and delete that way...
-
I think emptying the library and doing a fresh import would be easiest.
-
How would re-importing a corrupt library solve the issue? I want to scan what is on the PC, get a dump list of duplicate files and then hit a button to make all the dupes go away...How do i do that?
-
You mean you have duplicates at the file system level? If that's the case, figure out what folder is the real folder and erase the other folders.
-
alas, I am still not being clear...I have a separate drive that has 65GB of music on it. This music includes a vast number of duplicate files. That music is now on the computer.
I want to discover how to review all my existing music files on the computer, identify automatically the files with the same name, consolidate them in one pace and then remove them leaving a single file for each specific piece of music.
-
I don't think MC is the tool for the job. I'm not sure what is.
-
Are these duplicate files the same name but in a different directory or are they named differently? Or maybe they are of a different file type (mp4 vs m4a)?
If you can find some commonality between the dupes and if its always like that, you can use an expression to find them and delete them.
Make sure you have a backup.
-
I use a tool called clonespy when trying to perform disk cleanup.
-
so there is no way to scan a library in JRiver and delete dupe files? Anyway, thanks for the responses. I recall one of my earlier bits of music software being able to do that though I cannot recall the software....perhaps media Monkey or Pure Music on my MAC...
Would be a good feature in MC20.
-
Maybe you missed my reply?
If we know a little bit more about how your library is organized at a file level (ie, Artist dir\Album dir\01 - tracknames.mp3) and your duplicates for an album are in that same directory, then they must be named differently because files cannot exist with exactly the same name in the same directory.
If they are scattered but have the same name, it means your directory structure and the correct files per album are correct, but somewhere there is an incorrect directory where the dupes reside.
Obviously there can be plenty of other situations but either way, knowing more about how its gone wrong is essential to solving it.
It can be done with MC, it just can't be done automatically. I don't see how that would work reliably either, there must be some level of interaction where you choose which files can be safely deleted. This is no different in MC.
If you can't figure this out by yourself than possibly you could backup your library (MC Library, as in the database) and share it. I'm willing to have a look at it but I'm not an expression wizard so I can't promise you anything.
-
So wouldn't the built-in "Task -- possible duplicates" (under "Auto Smartlists (music)") be a good place to start? Just double-click on that smartlist and see if your files show up. If they do, then from there InflatableMouse or someone more savvy than me with script expressions (and maybe stacks) could probably guide you on how to create a nice Find/Replace expression to delete the duplicates.
-
Trouble is it's actually a bit difficult determining what is or isn't a duplicate. Sure, direct bit-for-bit scanning will tell you, but that's very resource-intensive (both CPU and disk). There are various PC file duplicate options out there (all suck, in different ways).
Music files are often "collected" from various sources, not always ones the RIAA would considering legitimate. Along with that there's a ton of screwy problems.
Comparing based on metadata makes assumptions on the metadata being present and/or accurate and that's not always the case. So making blanket decisions based on bitrate or other metadata is problematic. How does it decide? Should it keep 192kbps VBR MP3 files in favor of WMVs? Lossless vs lossy (due to disk space arguments).
So if you follow this then you'll understand why there's not a simple "remove duplicates" function.
-
I set up a file type view with file path as one of the columns
you can import this code using the import/export button in customizing the view if you want to copy/paste to try it in JRiver
[Media Type]=[Audio],[TV],[Video] ~dup=[Name],[Album],[Duration] ~sort=[Filename],[Album Artist],[Album],[Track #],[Name],[Disc #],[Artist],[Bitrate]
this will only work for identical duplicates of files actually imported but avoids removing different file types . like same song in mp3 and flac won't be called a duplicate. You can remove duration if you are not getting any dupes
Otherwise externally you can download the freeware of Glary Utilities http://www.glarysoft.com/ (http://www.glarysoft.com/) they have a good duplicate search and its multi-drive if you need that. Probably better in your case.
-
Trouble is it's actually a bit difficult determining what is or isn't a duplicate. Sure, direct bit-for-bit scanning will tell you, but that's very resource-intensive (both CPU and disk). There are various PC file duplicate options out there (all suck, in different ways).
Music files are often "collected" from various sources, not always ones the RIAA would considering legitimate. Along with that there's a ton of screwy problems.
Comparing based on metadata makes assumptions on the metadata being present and/or accurate and that's not always the case. So making blanket decisions based on bitrate or other metadata is problematic. How does it decide? Should it keep 192kbps VBR MP3 files in favor of WMVs? Lossless vs lossy (due to disk space arguments).
So if you follow this then you'll understand why there's not a simple "remove duplicates" function.
I understand perfectly, probably better than you, actually, so there's no need to talk down to me. But JIMV's situation is unique where everything you said is a non-issue. His duplicates literally are true duplicates due to his having to restore from a backup hard drive image which itself was a combination of many backups, hence the duplication.
So the idea is that he should be able to simply stack/group all the files with just the same filename and (maybe) file size, and then just prune the stacks. It's that last part I'm unsure of because I left the A/V world shortly before JRiver introduced stacks, but I'm sure there is probably a way to manipulate them via an expression.
If not, then, well, there are plenty of file duplicate finders out there on the net, and then he can just go find them, delete them, and then follow Matt's original suggestion to just recreate his library.
-
JIMV: If you want to tackle this problem, JRiver can probably be used as at least part of the solution. To help you do it, you'll need to provide what some member have asked for: more information on your file system setup. If they know the structure of where the files are and where duplicates can show up etc, they can probably help you locate them with a smartlist etc that you can then delete the files...
If there is no structure, that makes it more difficult then. However if they are all in the library and you can "see" the duplicates, you can probably create a smartlist from the library that may do the job as well. Again some more information on what you "consider" a duplicate would be helpful.
-
I understand perfectly, probably better than you, actually, so there's no need to talk down to me. But JIMV's situation is unique where everything you said is a non-issue. His duplicates literally are true duplicates due to his having to restore from a backup hard drive image which itself was a combination of many backups, hence the duplication.
You're making assumptions not in evidence. One being that anyone's "talking down" to anyone else here. So just jump off your high horse from that nonsense. Likewise with the "better than you" jab. Really?
That and just what structure of junk he's got in front of him isn't explained. Sure, there's a ton of ways restoring from backups might make a mess. Potentially made even worse by how programs like iTunes might have bastardized his libraries. Since he mentioned an iMac being involved, and I've had to deal with the mess that all that "pretend" friendliness can inflict on unsuspecting users. But that's an assumption I wasn't willing to make, hence I didn't wander down that road.
So when you read someone posting a question wanting a simple answer to a complex problem, but without enough details, don't just leap to conclusions when someone else tries to contribute.
A combination of external windows-based file duplicate checking and MC will go a long way toward getting a grip on the mess he's got on his hands. But it won't, as I posted, come without more work than just a "simple" duplicate remover function could ever hope to effectively provide.
-
It's been a few weeks since JIMV asked for help and hasn't reported back.
If the person with the problem is not engaging, then it doesn't make sense to ponder about the what ifs or debate who has a better understanding of the issues.
-
Man I'm so close!
I'm merging some libraries and wondering if MC could implement a kind of post-filter.
I've located EXACT dups, about 20K, of them and when I display only these dups I thought of two ways to eliminate them.
1. Rename/Move all my files with DUPS REMOVED filter. This is cumbersome and I'm not sure if my tages would facilitate the move e.g. using Genre to create a folder structure or even Album Artist (auto).
2. Move out all the DUPS into one location. I like to use a [file type] tag but in this case it won't help since they are exact dups, same bit rate and file type. Also I noted that if I simply move/rename the DUPS if the album is the same, all the files are dumped into one [Album] folder.
Finally, if MC had a kind of post-filter, I could just delete the displayed DUPS. Currently, you get a list of all files that are DUPS. If another filter was available, to only apply after previous and not affect the parent filter, I could click delete. Most of mine of in different file locations so I am looking at how that might help, but it's seems MC is so close to making this easy....
-
I suggest that you buy the PerfectTunes DeDup application ( http://www.dbpoweramp.com/perfecttunes.htm )
-
Thanks dude! Will check it out.
-
Not really cooking....on a NAS that does 60mb/s w/r
- (http://i.imgur.com/Gb1mMJrl.jpg) (http://i.imgur.com/Gb1mMJr.jpg)
-
Yes. Instead of just checking the meta data, it actually "listens" to each track to see if it "sounds" like any other tracks. This listening is faster than real time, but it does indeed take a while, and it certainly does help if you have a fast machine. If your music is on a detachable HD then plug it into the fastest PC in the house for that purpose. I have 12k tracks and I let it run overnight and it was ready by morning..
PS is your PC connecting to the NAS by a UNC share name (//Server/Folder/..) or via a mapped drive letter (F:) since I seem to recall that it may be faster with the latter..