이름이 다른 중복 파일 검색 및 삭제

Question 1

이런 프로그램이 있는데 이름은 다음과 같습니다 rdfind.

SYNOPSIS
   rdfind [ options ] directory1 | file1 [ directory2 | file2 ] ...

DESCRIPTION
   rdfind  finds duplicate files across and/or within several directories.
   It calculates checksum only if necessary.  rdfind  runs  in  O(Nlog(N))
   time with N being the number of files.

   If  two  (or  more) equal files are found, the program decides which of
   them is the original and the rest are considered  duplicates.  This  is
   done  by  ranking  the  files  to each other and deciding which has the
   highest rank. See section RANKING for details.

중복된 항목을 제거하거나 기호 또는 하드 링크로 바꿀 수 있습니다.

Answer

이런 프로그램이 있는데 이름은 다음과 같습니다 rdfind.

SYNOPSIS
   rdfind [ options ] directory1 | file1 [ directory2 | file2 ] ...

DESCRIPTION
   rdfind  finds duplicate files across and/or within several directories.
   It calculates checksum only if necessary.  rdfind  runs  in  O(Nlog(N))
   time with N being the number of files.

   If  two  (or  more) equal files are found, the program decides which of
   them is the original and the rest are considered  duplicates.  This  is
   done  by  ranking  the  files  to each other and deciding which has the
   highest rank. See section RANKING for details.

중복된 항목을 제거하거나 기호 또는 하드 링크로 바꿀 수 있습니다.

Question 2

흡입. 나는 이것과 중복되는 문제를 해결하기 위해 모든 중복 항목을 나열하는 한 줄을 개발했습니다. 얼마나 메타. 글쎄, 낭비한 것이 부끄럽기 때문에 rdfind더 나은 해결책처럼 들리더라도 게시하겠습니다 .

이것은 적어도 "진짜" 유닉스 방식이라는 장점이 있습니다. ;)

find -name '*.mp3' -print0 | xargs -0 md5sum | sort | uniq -Dw 32

파이프를 부수세요:

find -name '*.mp3' -print0현재 디렉터리부터 시작하여 하위 트리에 있는 모든 mp3 파일을 찾아 이름을 인쇄합니다(NUL로 구분).

xargs -0 md5sumNUL로 구분된 목록을 읽고 각 파일의 체크섬을 계산합니다.

당신은 sort그것이 무엇을하는지 알고 있습니다.

uniq -Dw 32정렬된 줄의 처음 32자를 비교하여 동일한 해시 값을 가진 문자만 인쇄합니다.

따라서 모든 중복 목록이 생성됩니다. 그런 다음 제거하려는 항목으로 수동으로 줄이고 해시를 제거한 다음 목록을 rm.

Answer