Hi,
Some time ago I migrated from Linux to FreeBSD, and now I am re-organizing my backups. I have one mirror (1-on-1 copy) of my hard disk, these considerations are on my incremental backup and archive, where I keep everything I ever made on my computers. Not all that stuff is necessarily on my day-to-day box.
Watching the verbosity, I noticed that there are a lot of duplicate files in and between backups. This mostly comes from reorganizing my hard drives -- moving directories to a better location (in or out of a parent directory) and renaming them for easier accessibility (eg. using all lower case instead of first character upper case names).
I don't have exact numbers, but suspect a lot of duplicates. Some however are useful to me, eg. having copies of all versions of my old websites, where the picts have many duplicates.
So before I rsync my old backups with the one-to-stay backup, I did some manual de-duplicating. textproc/meld seems to slow for the bazillion of files I have, manually using the output of sysutils/fdupes seems to be a better way. Since I still haven't figured out what the best structure is for my backup, I don't use the
I don't want to mess around too much with my backup, but some de-duplication seems worth the extra disk space. So:
* before making a backup, de-duplicate files and directories;
* empty the trash, unless you also want to backup that;
* use directory names that you want to keep,
*
My question is if any of you de-duplicates and reorganizes your backup volumes or not, and what strategy you use...
TIA,
Some time ago I migrated from Linux to FreeBSD, and now I am re-organizing my backups. I have one mirror (1-on-1 copy) of my hard disk, these considerations are on my incremental backup and archive, where I keep everything I ever made on my computers. Not all that stuff is necessarily on my day-to-day box.
Watching the verbosity, I noticed that there are a lot of duplicate files in and between backups. This mostly comes from reorganizing my hard drives -- moving directories to a better location (in or out of a parent directory) and renaming them for easier accessibility (eg. using all lower case instead of first character upper case names).
I don't have exact numbers, but suspect a lot of duplicates. Some however are useful to me, eg. having copies of all versions of my old websites, where the picts have many duplicates.
So before I rsync my old backups with the one-to-stay backup, I did some manual de-duplicating. textproc/meld seems to slow for the bazillion of files I have, manually using the output of sysutils/fdupes seems to be a better way. Since I still haven't figured out what the best structure is for my backup, I don't use the
fdupes -r -d
options (yet).I don't want to mess around too much with my backup, but some de-duplication seems worth the extra disk space. So:
* before making a backup, de-duplicate files and directories;
* empty the trash, unless you also want to backup that;
* use directory names that you want to keep,
rsync old_name new_name
;*
rsync
with your backup volume;My question is if any of you de-duplicates and reorganizes your backup volumes or not, and what strategy you use...
TIA,