Profile Details

Toggle Sidebar
Recent updates
  • Taryck BENSIALI
    Taryck BENSIALI's reply was accepted as an answer
  • WARNING: PV /dev/xxx is marked in use but no VG was found using it. PV /dev/xxx might need repairing.

    Hi all,

    After moving segment from a defective drive that is failing I've got this messages :
    WARNING: PV /dev/sdk is marked in use but no VG was found using it.
    WARNING: PV /dev/sdk might need repairing.
    WARNING: PV /dev/sdl is marked in use but no VG was found using it.
    WARNING: PV /dev/sdl might need repairing.
    WARNING: PV /dev/sdj is marked in use but no VG was found using it.
    WARNING: PV /dev/sdj might need repairing.

    I'm sure to have made a pvmove to /dev/sdj.
    Why /dev/sdk & /dev/sdl are found has been used I'm quite sure they aren't

    Here are the segments from the VG point of view
    [root@home store]# lvs -a -o +devices WD_Group
    LV VG Attr LSize Pool Origin Data% Meta% Move Log Cpy%Sync Convert Devices
    Store WD_Group -wi-ao---- <11.83t /dev/sdb(476932)
    Store WD_Group -wi-ao---- <11.83t /dev/sdb(0)
    Store WD_Group -wi-ao---- <11.83t /dev/sdg(0)
    Store WD_Group -wi-ao---- <11.83t /dev/sdb(953843)
    Store WD_Group -wi-ao---- <11.83t /dev/sdb(476911)
    Store WD_Group -wi-ao---- <11.83t /dev/sde(953843)
    Store WD_Group -wi-ao---- <11.83t /dev/sdg(476911)
    Store WD_Group -wi-ao---- <11.83t /dev/sdj(0)
    Store WD_Group -wi-ao---- <11.83t /dev/sdb(953864)
    Store WD_Group -wi-ao---- <11.83t /dev/sde(1430754)

    Here are the segment from PVS point of view :
    [root@home store]# pvs --segment /dev/sdj
    PV VG Fmt Attr PSize PFree Start SSize
    /dev/sdj WD_Group lvm2 a-- <1.82t 4.00m 0 476931
    /dev/sdj WD_Group lvm2 a-- <1.82t 4.00m 476931 1
    [root@home store]# pvs --segment /dev/sdk
    PV VG Fmt Attr PSize PFree Start SSize
    /dev/sdk WD_Group lvm2 a-- <1.82t <1.82t 0 476932
    [root@home store]# pvs --segment /dev/sdl
    PV VG Fmt Attr PSize PFree Start SSize
    /dev/sdl WD_Group lvm2 a-- <1.82t <1.82t 0 476932

    So for sdk & sdl it probably unused disk but for sdj I'm sure it is used for strore LV.

    Any recomandation on how troubleshoot that without loosing data that I can't backup for sure.

    How could I found the file in the suspected segments

  • For the flip flop may be the error message on my ISP box is a part of the reason :
    Les périphériques 0c:c4:7a:33:07:8b et 0c:c4:7a:33:07:8a utilisent la même adresse IP : 10.0.0.142.
    NIC 0c:c4:7a:33:07:8b and 0c:c4:7a:33:07:8a use the same IP address 10.0.0.142.

  • Hi,

    "/scripts/watch-cpu-temp.sh" is a script I've made to watch the CPUs temp and shutdown system if it's higher than a specified value.

    For the flip flop I understand.

    I need the 2 NIC/IP as I've got service delivred only on local 10.0.0.142 and some service only on remote 10.0.0.142.
    So my internet access grant access to 10.0.0.137 but not to 10.0.0.142.

    I've got in the past gateway error that why I need to define one NIC as External.

    But I understand that having distinct subnet could be :
    10.0.0.137 255.255.255.0
    10.0.1.142 255.255.255.0

    And DHCP : 10.0.1.0 => 10.0.1.100 mask 255.255.0.0

    For the water cooling too much additive will block the termal exchanger and too few lead to bacteria grow...
    I've give up. Thanks anyway.

  • Hi,

    Network config :

    Settings
    Network Mode
    Standalone - No Firewall
    Hostname
    home.domaine.xxx
    Internet Hostname
    home.domaine.xxx
    Default Domain
    home.domaine.xxx
    DNS
    DNS Server #1
    8.8.8.8
    DNS Server #2
    8.8.4.4
    Network Interfaces
    Interface Role Type IP Address Action
    enp3s0 LAN Static 10.0.0.142
    enp4s0 External Static 10.0.0.137

    Power Supply : https://www.evga.com/products/product.aspx?pn=120-g2-1300-xr

    Water cooling has been removed for the test (as it was non operational due to bacterie growing)
    Now is a classical heatsink with fan.
    The bridge do not have any fan. See https://c1.neweggimages.com/NeweggImage/ProductImage/13-182-761-01.jpg
    The greay heatsink below pci

    IPMI raised 512 errors on memory. I ask to supermicro to get guidance, to be sure this could stop the system....

  • I've unlocked I guess the mail by changing the relay in mail server.
    I've removed 1736 mails.

    However I've recieve mail I do not understand :


    I also recieve mail for every cron action like watching for CPU temp :

    How could I disable this at least for this script ?

  • Hi,

    Brought 3 years ago
    Supermicro H8SGL
    AMD G34 16 core
    DDR3 ECC 128Gb
    2 x IBM m1015 converted to LSI MegaRAID SAS 9240-8i
    12 HDD
    2 NIC

    No hardware raid
    Only LVM

    No UPS used

    I also suspect power supply as monitorin indicate for 12v only 11,86 and drop to 11,76V
    I've stressed the system few minutes and no crash. I use stress like this -c 8 -i 8 -m 2 -d 1

    I've brought a new power supply (with great hopes on that) with 12,254 V for 12v
    I restart yesterday at 20:30 and this morning at 10:43 system crash again...

    I did realy try the Ctrl+Alt+Del, but I always use reset button I guess because system is not reponding.
    Light inside the LSI cards are still bliking....

    I've detect a quite high system temp in my IPMI monitoring I guess related to North & South Bridge. at idle 61°C whit heavy load in raise up to 105°C but system is still responding...

    I'll ask supermicro some advise.

    I've quickly read : https://www.sraellis.tk/master.php?topic=crash
    However at this stage I can't say that I've looked to all possible log as I do not know where to look at.
    with /var/log/messages I only get a clue of time of crash however it change all the time...

    This only think I've added is a seagate 12Tb hard disk that was not working on clearos at first try, but works fine on my PC. I've removed a lot of disk in order to lower down the load on power supply.
    Now I can't remove it as I do not have enought disk to move data from this disk.
    I can't return it as it works on my PC.

  • Hi Tony,

    My issue is that the system hang, not for full filesystem.
    I need guidance to troubleshoot the reason.
    I suspect an hardware issue but which one....

    /etc/rsyslog.conf