Hi there,
I'm using rdfind [1] to convert duplicate files into hardlinks. Since I also use rsnapshot as a backup solution (usually one drive for data, backup1 and backup2 are setup as targets for two separate rsnapshot jobs) quite some space can be saved this way.
Especially when I've build a new OMV and users start using it and rename and move a lot of files.
However OMV with a Pentium (1155 socket) seems to slow down quite horribly and sends ressource limit warnings, while at the same time the CPU-load doesn't seem to be a problem at all (rdfind itself uses at most 30%, mostly around 10-15%).
I thought it might be actually the hardrives causing the slow response, since rdfind of course needs to scan the whole filesystem. However the problem also occurs when I run rdfind on a backupdrive, leaving the data drive alone, which shouldn't slow down then anymore.
Questions:
- any other experiences with solutions to the duplicates problem apart from rdfind? (on ext4, not ZFS depub etc.)
- any idea what the reason for the slowing down of the system might be? Or which way to investigate? I'm kind of out of ideas.
- more general: if i'd want to limit the cpu-load that a specific scheduled job might use what is the proper way to achieve that?
Hardware:
- CPU Pentium G640
- RAM 4GB
- OS 320 GB HDD, 2,5"
- 3x 4TB HDD
- services: daily rsync, rsnapshot, rdfind; smb, openVPN
- clients for openVPN simultaneously: 10 max, usually more like 3-5
[1] https://rdfind.pauldreik.se/
Thanks so far
kwon