Different Source-Folder Option is Inefficient
Posted: 14 Mar 2012, 04:20
Hi,
The option Detect only duplicates from different source folders can be useful, but it is implemented inefficiently. Currently what it does is to wait until all files have been scanned as normal, then filter out the results. This means that a lot of unnecessary comparisons are done. Instead, the option should be used to reduce the number of comparisons.
For example, say that all the files below are identical:
Currently, AllDup checks:
Then it removes 1.txt - 2.txt and a.txt - b.txt from the results (which causes the program to look like it is frozen for a while if there are a lot of results). It should not check those in the first place; it should only check:
That’s 1/3 few comparisons and the results are displayed faster too.
Files from the same folder are being compared even though the separate sources option is selected:
The option Detect only duplicates from different source folders can be useful, but it is implemented inefficiently. Currently what it does is to wait until all files have been scanned as normal, then filter out the results. This means that a lot of unnecessary comparisons are done. Instead, the option should be used to reduce the number of comparisons.
For example, say that all the files below are identical:
Code: Select all
C:\
A\
1.txt
2.txt
B\
a.txt
b.txt
Code: Select all
1.txt - 2.txt
1.txt - a.txt
1.txt - b.txt
2.txt - a.txt
2.txt - b.txt
a.txt - b.txt
Code: Select all
1.txt - a.txt
1.txt - b.txt
2.txt - a.txt
2.txt - b.txt
Files from the same folder are being compared even though the separate sources option is selected: