Posts by molnart

    One of my drives started to show some smart errors, so I started to movie off its content to my mergerFS pool via rsync. However the process seems to always fail - don't know if due to the drive's state or for different reasons.


    The command I am using is the following:
    sudo rsync -avxPH --info=progress2 --info=name0 --sparse /srv/dev-disk-by-uuid-412bd813-d998-4075-99dd-864e9ae378ec/Backups/ /srv/dev-disk-by-uuid-4ca1dab4-dabe-459f-9ced-bdb6f0d4fcb2/Backups/


    the result of rsync is this:

    Code
     57,421,479,937   1%    1.97MB/s    7:42:13 (xfr#682983, ir-chk=1120/39497182)
    rsync: connection unexpectedly closed (535108771 bytes received so far) [sender]
    rsync error: error in rsync protocol data stream (code 12) at io.c(231) [sender=3.2.7]


    I have been running this command for something like 5 times already, thats the reason for the the low amount of transferred data. the disk contains mostly rsnapshot backups, so a huge amount of small files. i am not sure how many files are there on the disk, but rsync always ends with a vastly different amount of files.


    what other ways I have to sync out the data from the disk? i cannot use dd because i am writing the data to an already pre-filled drive.

    turns out the key to rebuilding the modules is the following:


    Code
    sudo dpkg-reconfigure nvidia-kernel-dkms
    sudo nvidia-ctk runtime configure --runtime=docker
    sudo systemctl restart docker

    how do i do that? i tried dkms autoinstall, update-initramfs -u, update-grub, even apt install --reinstall nvidia-driver but nothing seems to work. if i need to purge the drivers or even worse restart OMV i rather throw that card out from a running system directly into the trash, but there is no way I am restarting my system before it reaches 500 days uptime.

    it seems after a kernel update my nvidia gpu is just "gone". all my containers using nvidia refuse to start, nvtop or nvidia-smi shows there is no gpu in the system. lspci shows it.

    thanks a lot, it was pretty easy. dumping the rrd file to an XML and finding the correct section in it it contains the human readable timestamp and the occupied space in bytes.


    apparently it looks like i have free capacity for 2 years and a few months.

    OMV keeps a pretty nice track of disk utilization under Performance statistics, and i also assume that those data are pretty granular as well. Probably there should be way how to use that information to estimate how long will it take to fill up my storage so i can plan an expansion sufficiently in advance.. Can somebody point into the direction which tools to use for that?


    Unfortunately it seems that the folder /var/lib/openmediavault/rrd is just a bunch of png images and i cannot get any data out of those

    You can not use the same GPU in multiple containers.

    wtf, you serious? thats a serious bummer.


    after further testing is looks like always the container last spun up has access to the GPU, the others don't. so far I can occasionally get HW acceleration for Plex and Ollama, but its pretty unpredictable which one can actually use it. and additionally i wanted to play around with fooocus image generation...


    so i guess my only option is to to move my GPU apps to separate LXC containers, as there seems to be a way how to share the GPU between multiple LXCs

    ok, i spoke to soon, drivers are installed, GPU should be visible inside docker containers as confirmed by

    Code
    docker run --rm --runtime=nvidia --gpus all nvidia/cuda:11.6.2-base-ubuntu20.04 nvidia-smi

    i have changed plex and immich settings to utilize HW accel, but it does not work. neither plex or immich utilize the HW acceleration... what else could have i missed?

    finally i got this running. for me the steps where the following:


    1) install proxmox kernel

    2) disable apparmor

    3) install the drivers according to the instructions, but ignore the xconfig part


    now everything seems ok, but i could not try it as in the meantime i moved all my containers that could utilize the GPU to a different VM, only to realize i can't pass the GPU there because its tied with my HBA in a single IOMMU group

    switching to pve kernel allows to install the nvidia drivers without error, but i run into issues with sudo nvidia-xconfig - it does not find the GPU:


    although in lspci i can clearly see the device


    02:00.0 3D controller: NVIDIA Corporation GP104GL [Tesla P4] (rev a1)

    i moved to virtualized OMV in 2020 when i wanted to run a router on my machine as well. first i did pass trough individual disks, then i have moved to a different rig with a HBA. both setups worked well.


    EDIT: as some highlighted below, in disk passthrough mode OMV has no low level access to the disks, e.g. SMART data cannot be retrieved from within OMV (but they can from the underlying host). Some file systems such as ZFS rely on this data to function reliably,

    i am using the stock OMV7 kernel

    Linux omv6 6.1.0-18-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.76-1 (2024-02-01) x86_64 GNU/Linux)


    did perform an apt purge *nvidia* serveral times, but did not help.


    omv6 is the hostname of my install, but i am on 7 actually

    i am trying to install the nvidia drivers on OMV 7 according your guide but running into this error:



    tried installing on a plain debian VM and it went without errors

    Wanted to add some new NFS shares and run into this error when applying settings:


    Looks like its something related to fstab, checked the outputs of lsblk and /etc/fstab content just in case there would be some change in disk settings (did not mess with them) but found no obvious reason for the error

    i am running around 60 containers, so simply too much to list, but generally they can be put into the following categories:


    Media & downloads management - plex, , *arr stack, Transmission, jDownloader, Jackett, etc.

    Home automation - Home Asssitant, Esphome

    Photo & media management - Immich, Paperless-ngx

    Monitoring & Infrastructure - Grafana, InfluxDB, LibreNMS, UniFi Controller, PiAlert, etc.

    Some utilities & tools - Vaultwarden, Tandoor, Firefly, StirlingPDF, etc,