Posts by prtigger1

    Hi folks


    Does anybody knows a method to get rid of the automated mount option, for my data BTRFS Raid1 with three SSDs: discard=async?


    I want to do the trim of the filesystem manually, because there may be a problem with the Samsung 870 QVOs.


    Code
    /dev/sdb on /srv/dev-disk-by-uuid-a8a06053-0cc4-491d-adf2-01bb8a02fb44 type btrfs (rw,relatime,ssd,discard=async,space_cache=v2,subvolid=5,subvol=/)

    I tried to set this and reboot, with no effect!:

    Code
    omv-env set OMV_FSTAB_MNTOPS_BTRFS "nodiscard"
    monit restart omv-engined
    omv-salt stage run prepare
    omv-salt stage run deploy

    Any suggestions?


    Best regards

    prtigger

    Krisbee

    crashtest


    Hi together

    here are my research results (without having a shutdown hang, up to now):


    -I/O-waits for writing 50GB of data to the btrfs Raid1 with three SSDs is from 2 to 5... That's good!

    (Edited: With write cache disabled on all disks and writing 14GB of data, I/O-waits are between 0,1 to 7,9)


    -'Synchronizing SCSI Cache/Stopping disk' is not the Problem... This always needs only a very short time during shutdown.

    Not a noticeable difference, when write cache is disabled.


    -'Stopping disk' is the part that could take time from very short (<0.5) to 6 seconds per disk, with my setup!

    That's combined with the drive usage LED... No relation to SSD write cache usage....


    I guess, stopping of the SSDs is not necessary because, no rotating disk...

    No relation to btrfs driver, because at this time no filesystem is active anymore.

    Activ is only the Sata Layer (AHCI driver in my setup).

    Varying stopping times could be SSD firmware related or Linux kernel driver, managing the stopping of the drives...

    It´s the same with 6.12.x Debian kernel and 6.14.x Proxmox kernel.


    When i think back to the last shutdown hang:

    '[FAILED] Failed unmounting srv-dev\x2dddisk\x2dby\x2duuid\x2...srv/dev-disk-by-uuid-<my-server-volume-uuid>'

    may happen at this part of the shutdown... But can't confirm at this time.


    I guess, there is nothing to unmount at this time of shutdown, because all file systems are already down!


    Possible workaround:

    Don't use shutdown! Use reboot (No disk stopping), go to Grub or BIOS and power off, by power button.


    So far at this point...


    Best regards

    prtigger

    Krisbee

    crashtest


    Hi

    I've got a question: I want to try, to disabling the write cache of the three Samsung SSDs (for Testing). The cache is enabled by default.

    I made no disk specific settings in the OMV GUI! Is there an easy way to disable it?


    Do you know any way to do a 'Synchronizing SCSI Cache' (Syncing on ahci driver layer) before shutdown server with mounted filesystem?


    Best regards

    prtigger

    Krisbee

    Hi here the result of the I/O-waits test:


    Had done this:


    Running 'top' with 5 seconds delay time:



    I/O-waits (wa) is most of the time between 1,4 - 2,3!

    The maximum i had seen during scrub was 3,3


    The maximum load average (1 min.) was 10,08


    Best regards

    prtigger

    ryecoaaron


    I guess it's an issue with the folder2ram script. But i don't care anymore, because i fixed it manually...

    Thank's for your information!


    Krisbee

    Here attached a picture of a normal shutdown process ending, before my system is powering off!

    Sometimes it's quick and sometimes 'Synchronizing SCSI Cache/Stopping disk' could take a long time!

    In this case it took about 3-4 seconds for each disk of the Btrfs Raid1 (The three Samsung SSDs).

    I don´s know what's happening at this time, because i can't find this part in the journal!

    I guess, journal service is down already, at this time...

    In this period of syncing and stopping the SSDs, the drive access led is on, permanently!


    My conclusion at this time:


    The 'Failed unmounting srv-dev\x2ddisk\.......' from picture of my post #12 is happening here!

    disk access led will stay permanently on....

    And this causes the server hang!


    Waiting more than 10 minutes doesn't help!:





    Best regards

    prtigger

    ryecoaaron


    No, the folder2ram package was uninstalled correctly during the uninstall of the flashmemory plugin in the GUI!

    I checked this after uninstall from GUI (apt list).

    But, i got a connection lost during the uninstall process... (May be #502, can't remember correctly)...


    Doesn't matter, disabling service fixes it.

    Other solution may be, installing folder2ram package manual and purge it after that...


    Best regards

    prtigger

    ryecoaaron

    Thank's for your comments.


    After uninstalling the flashmemory again from GUI, the temporary folder2ram_cleaner.service was still present and running... Disabling this service removed the link and the unit file is present!

    Don't know why not before...


    This cleaner service is not necessary anymore and can be removed, because folder2ram is not installed anymore...


    Best regards

    prtigger

    ryecoaaron


    Hi Aaron

    I found a 'folder2ram_cleaner.service' error during shutdown:



    Your flashmemory plugin was not installed!


    I disabled the this service, to try cleaning this up:

    Code
    root@pr-srv-01:~# systemctl disable folder2ram_cleaner.service
    Removed "/etc/systemd/system/multi-user.target.wants/folder2ram_cleaner.service".
    root@pr-srv-01:~#

    This disabling removed the unit-file!


    I reinstalled the your flashmemory plugin...


    No 'folder2ram_cleaner.service' unit file was installed!! No such service is running!


    My question:

    Is the 'folder2ram_cleaner.service' outdated and not nesessary anymore??


    If not, could you please post the file, directory and file rights here, so i could restore ist?


    Best regards

    prtigger

    Krisbee

    Thank's for your comment!

    Yes, i'm on latest 6.12. Debian bpo kernel..

    In normal case, i've got no problem with the shutdown of the server!

    The failed unmount of the data drives is coming up randomly and sometimes, bringing system to the hang condition!


    That's the problem...


    Now, i permanently deleted the quiet boot option, to keep a look at the shutdown...


    The astonishing thing with the last shutdown was, the long time the stopping of every data drive took!

    Don't know what was going on, in this period of time.


    That may be an indication for where the unmounting problem relates to...


    Next boot i will save the journal.

    After that, i will try the manual scrub...

    Keeping an eye on the i/o waits....


    Best regards

    prtigger

    Krisbee


    Hi Krisbee

    Would it be sensible to deactivate 'fstrim.timer' because of the mount option 'discard=async'? Because of doubled trim?


    The long shutdown time for the three data drives, was an addition of long stop times, for each single SSD of der Btrfs Raid1...


    Edit: I just found that discard=async option is not possible with Ext4... There is still my boot SSD with Ext4...


    Best regards

    prtigger

    Krisbee


    By the way:

    After getting the informations for the post above, i shutdown the server at the console.

    I found out, that the shutting down (stop, sync) of the data drives (Btrfs Raid1) took a long time!

    That´s the heavy traffic shown with the server drive led!


    In this case, there was no client access to the server.


    Funny thing: Sometimes shutdown is faster, sometimes took a longer time...


    Best regards

    prtigger

    Krisbee


    Here it is:



    1.

    Code
    root@pr-srv-01:~# findmnt --real -t btrfs
    TARGET                                                     SOURCE                FSTYPE OPTIONS
    /srv/dev-disk-by-uuid-a8a06053-0cc4-491d-adf2-01bb8a02fb44 /dev/sdc              btrfs  rw,relatime,ssd,discard=async,space_cache=v2
    /export/Dream                                              /dev/sdc[/Data/Dream] btrfs  rw,relatime,ssd,discard=async,space_cache=v2
    root@pr-srv-01:~#

    2.

    3.

    Code
    root@pr-srv-01:~# btrfs sub li /srv/dev-disk-by-uuid-a8a06053-0cc4-491d-adf2-01bb8a02fb44
    ID 256 gen 3600 top level 5 path Data
    ID 257 gen 3599 top level 256 path Data/Medien
    ID 258 gen 3312 top level 256 path Data/Dream
    root@pr-srv-01:~#


    prtigger1 Could you add the output of systemctl list-timers  --no-pager to the previous list.

    4.


    Best regards

    prtigger

    crashtest


    Thank you for this interesting information! I´m only using the services you can see in my signature!

    For media i installed the MiniDLNA plugin... Nothing else.


    With the md-raid 5 and Btrfs single installed, i had a maximum storage capacity of around 7 TB...


    My friend is just trying ZFS with the identical server hardware.

    I'm still with Btrfs Raid1 over the three SSDs, with storage capacity around 5TB..


    I still try to fix the last shutdown problem... It´s not coming up again, now... It's very rare and randomly!


    Next step, i will follow the advice of Krisbee !


    Best regards

    prtigger