Audio Dedupe is an innovative tool that can recognize duplicate audio files even if they are stored in different file formats and are not marked with ID3 tags.
Audio Dedupe will find all similar or exact duplicate audio files in a folder and its subfolders. Unlike other duplicate mp3 finders it will actually "listen" to the music and can recognize a song even if it is saved in different file formats. While the default 70% audio similarity works best, the value can be changed in order to broaden or narrow the results.
There are also fast hash or file size algorithms which will find exact duplicate mp3 files. When the scan process is finished the program shows all audio file duplicates in groups and can automatically mark the smaller bitrate, shorter length and/or smaller file size files for deletion (the lower quality ones).
Just a couple of mouse clicks will remove all duplicate audio files and save valuable disk space. A backup of the duplicates can be saved as the tool can move the files to a specified folder maintaining the directory structure.
Supports MP3, MP2, MP1, MPA, WAV, OGG, AIFF, AAC, MP4, FLAC, AC3, WavPack (WV), Musepack (MPC) and Windows Media Audio (WMA) file formats.
Note: Audio files should be at least 90 seconds in order to provide sufficient data for audio analysis. Files shorter than 90 seconds will not be processed.
Command-Line Parameters Usage: (Available only in Corporate Version)
AudioDedupe.exe -src_list <srclistfilename> -target_list <targetfilename> -log <logfilename> -s <similarity> -db <dbfilename> [-cachenames]
-src_list <srclistfilename> - specifies a plain text file which contains file names that you want to compare against <targetfilename> files
-target_list <targetfilename> - specifies a plain text file which contains file names that you want to compare against <srclistfilename> files
Note: If you omit -target_list parameter only -src_list files will be scanned
-log <logfilename> - specifies full path file name where the log from the comparison will be saved, e.g. c:\log.txt
-s <similarity> - specifies a similarity in percents. Example: -s 70
-db <dbfilename> - specifies the cache file name to be used or created.
-cachenames - if this option is specified, the program will store file names in cache instead of files checksum
Note: Cache files store audio files information gathered during the preprocessing pass (1st pass) of the scanning process. So if you often scan the same files, using cache file speeds up considerably the preprocessing pass.
You don't need to create the cache file. Just specify it as a parameter and the program will create it. Or if the cache file from previous scan exists the program will use it. The cache files extension is ADCF (Audio Dedupe Cache File).
Example: AudioDedupe.exe -src_list "c:\src.txt" -target_list "c:\target.txt" -log "c:\log.txt" -s 70 -db "c:\db.adcf"
This will find all audio files with similarity >= 70% and save their file names and similarity in c:\log.txt file
Note: If some of the parameters contain spaces, you must enclose them in double quotes. Example: -src_list "c:\program files\files.txt"
Sample src.txt and target.txt file contents:
|C:\My Documents\MP3\Track (3).mp3
C:\My Documents\MP3\Track (3).mpc
C:\My Documents\MP3\Track (4).mp3
C:\My Documents\MP3\Track (4).ogg
C:\My Documents\MP3\Track (5).mp3
C:\My Documents\MP3\Track (6).mp3
C:\My Documents\MP3\Track (7).mp3
Sample log.txt file contents:
* DUPLICATES OF * : C:\My Documents\MP3\Track (3).mp3