X-Git-Url: https://www.fleuret.org/cgi-bin/gitweb/gitweb.cgi?a=blobdiff_plain;f=finddup.1;h=4ba180726f60a73954b8e4d101b854b6138c77a5;hb=163e504155f281678809fd2c924802e264786cb2;hp=540b0d4812a82c3db93c9719a2d3361b871f84a6;hpb=ef337e5e5c15540fe4f4f367a9bd4599b4559634;p=finddup.git diff --git a/finddup.1 b/finddup.1 index 540b0d4..4ba1807 100644 --- a/finddup.1 +++ b/finddup.1 @@ -19,6 +19,14 @@ files found in it. With two directories, it prints either the files common to both DIR1 and DIR2, or with the `not:' prefix, the ones present in DIR1 and not in DIR2. +It compares files by first comparing their sizes, hence goes +reasonably fast. + +When looking for identical files, +.B finddup +by default associates a group ID to every content, and prints it along +the file names. + .SH "OPTIONS" .TP \fB-h\fR @@ -31,43 +39,51 @@ ignore files and directories starting with a dot do not show which files from DIR2 corresponds to files from DIR1 .TP \fB-g\fR -do not show the file group IDs (one group for each content) +do not show the file group IDs .TP \fB-p\fR show progress information in stderr .TP \fB-r\fR -shows the real path of the files +show the real path of the files .SH "BUGS" None known, probably many. Valgrind does not complain though. -While not a bug per se, the for of the output should definitely be -improved. Not clear how. +.SH "WISH LIST" + +The format of the output should definitely be improved. Not clear how. + +The comparison algorithm could definitely use some MD5 kind of +signature. I doubt it would really speed up a lot. + +Their should be some fancy option to run two instances of the command +on different machines so that comparison could be done without disk +access where the disk are physically. .SH "EXAMPLES" -.nf + .B finddup -cg blah something .fi List files found in .B ./blah/ -which have a matching file with exact same content in +which have a matching file with same content in .B ./something/ without the group IDs .P -.B finddup ./sources not:./backup +.B finddup sources not:/mnt/backup .fi List all files found in .B ./sources/ which do not have content-matching equivalent in -.B ./backup.sources +.B /mnt/backup .P -.B finddup ./tralala ./cuicui | sort -n +.B finddup tralala cuicui .fi List groups of files with same content which exist both in