freeipmi-users
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8 servers


From: Albert Chu
Subject: Re: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8 servers
Date: Fri, 12 Apr 2013 10:25:26 -0700

Hey Stephen,

That's exactly what I need.  I'll get it into a FreeIPMI branch so you
can try it out before I release.

Al

On Fri, 2013-04-12 at 09:57 -0700, Stephen Abbene wrote:
> Al
> 
> HP finally got back to me with the information about the error lights.  I 
> have included the email below.  Let me know if there is any other information 
> that you need.
> 
> -----Original Message-----
> 
> Stephen,
> 
> You are certainly welcome.
> 
> I just now received the data you were asking for. As I previously wrote, the 
> UID and HEALTH LED information in the SDR is not currently getting updated. 
> The fix requires changes to iLO 4 firmware and SDR tables (ROM update). These 
> changes will be included in the next iLO4 release 1.30 (ETA Sept 2013).
> 
> The "UID Light" has a sensor type OEM LED (0xC0) and EventReadingTypeCode = 
> UID: (0x70) 0x0001 = On. 0x0002 = Off. 0x0004 = Blinking.
> 
> The "Sys. Health LED" has a sensor type OEM LED (0xC0) and 
> EventReadingTypeCode = HealthLED: (0x71) 0x0001 = Green. 0x0002 = Amber. 
> 0x0004 = Red.
> 
> I hope that meets your needs and answers your questions.
> 
> 
> -----Original Message-----
> From: Albert Chu [mailto:address@hidden 
> Sent: Friday, March 15, 2013 10:16 AM
> To: Stephen Abbene
> Cc: address@hidden
> Subject: RE: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8 servers
> 
> Cool.  It is definitely reverse engineer-able, but usually requires some 
> vendor provided software to figure out what they think the magic is and doing 
> some tricks to try and get the sensor to do what you want it to do.  But 
> there are gotchas along the way.  The best bet is to just get the real info 
> from the source.
> 
> Al
> 
> On Fri, 2013-03-15 at 10:01 -0700, Stephen Abbene wrote:
> > Thanks Albert,
> > 
> > I will get a hold of my contacts at HP and see if they can get me that 
> > information.  I also have a few HP Gen7/Gen8 servers set aside for testing 
> > and I could try to recreate the different error light states if you think 
> > that would be helpful.
> > 
> > -----Original Message-----
> > From: Albert Chu [mailto:address@hidden
> > Sent: Friday, March 15, 2013 9:57 AM
> > To: Stephen Abbene
> > Cc: address@hidden
> > Subject: RE: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8 
> > servers
> > 
> > Hey Stephen,
> > 
> > Those are OEM specific sensors.  While I support a number of OEM 
> > sensors in FreeIPMI, I don't yet support these from HP.  I've asked HP 
> > several times for the "magic" to interpret those sensors, but they 
> > have been unable or unwilling to provide me the magic.
> > 
> > I see that you work at Nvidia.  If HP is a partner of yours, perhaps 
> > you might have some leverage to get the "magic" out of them?  
> > Basically, I need the sensor event table so that I know 
> > (hypothetically) 00h = "ok", 01h = "led on", 02h = "led off", 04h = 
> > "blinking", or whatever it may be.
> > 
> > Al
> > 
> > On Thu, 2013-03-14 at 18:21 -0700, Stephen Abbene wrote:
> > > Thanks for replying so Quickly Albert.  `ipmi-sensors -W 
> > > discretereading` seems to have done the trick thank you.  Any tips 
> > > on getting the "System Chassis 1 UID Light" and "System Chassis 2 
> > > Health LED" fields to populate with meaningful data?
> > >
> > > I have included the output of `ipmi-sensors -W discretereading --debug` 
> > > below:
> > >
> > >
> > > -----Original Message-----
> > > From: Albert Chu [mailto:address@hidden
> > > Sent: Thursday, March 14, 2013 5:35 PM
> > > To: Stephen Abbene
> > > Cc: address@hidden
> > > Subject: Re: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8
> > servers
> > >
> > > Hi Stephen,
> > >
> > > Could you please try the "discretereading" workaround (i.e. -W
> > discretereading).  There's a description of the issue in the manpage 
> > about the issue and it's only been seen on HP systems.  I believe HP 
> > has acknowledged the issue, but due to legacy reasons I don't believe 
> > they want to change it.
> > >
> > > If that doesn't work, please send the --debug output.
> > >
> > > Al
> > >
> > > On Thu, 2013-03-14 at 16:55 -0700, Stephen Abbene wrote:
> > > > Hello,
> > > >
> > > >
> > > >
> > > > Today I installed  freeipmi 1.2.5 on several HP gen7 and Gen8
> > servers and have noticed that all the systems are missing the RPM from 
> > the fans and the power meter information.  Is there a way I can obtain 
> > this information?  I can provide the output from --debug or any other 
> > information you may need.
> > > >
> > > >
> > > >
> > > > Here are some of the specs of one of the systems I am testing, I
> > can provide more information if it is required.
> > > >
> > > > . HP Proliant DL160 G8
> > > >
> > > > . CentOS 5.7
> > > >
> > > > . 2x Intel(R) Xeon(R) CPU E5-2680 0 @ 2.70GHz
> > > >
> > > > . 1x Power Supply (detected)
> > > >
> > > > . 8x fans (detected)
> > > >
> > > >
> > > >
> > > > I have included the output of ` ipmi-sensors
> > --entity-sensor-names`  and `bmc-info` from a HP dl160 Gen8 below.
> > > >
> > > >
> > > >
> > > > # ipmi-sensors --entity-sensor-names
> > > >
> > > > ID | Name                                            | Type
> > | Reading    | Units | Event
> > > >
> > > > 0  | System Chassis 1 UID Light       | OEM Reserved | N/A
> > | N/A   | 'OEM Event = 0000h'
> > > >
> > > > 1  | System Chassis 2 Health LED    | OEM Reserved | N/A        |
> > N/A   | 'OEM Event = 0000h'
> > > >
> > > > 2  | Power Supply 1 Power Supply 1         | Power Supply | N/A
> > | N/A   | 'Presence detected'
> > > >
> > > > 3  | System Board 1 Fan 1                            | Fan
> > | N/A        | N/A   | N/A
> > > >
> > > > 4  | System Board 2 Fan 2                            | Fan
> > | N/A        | N/A   | 'transition to Running'
> > > >
> > > > 5  | System Board 3 Fan 3                            | Fan
> > | N/A        | N/A   | 'transition to Running'
> > > >
> > > > 6  | System Board 4 Fan 4                            | Fan
> > | N/A        | N/A   | 'transition to Running'
> > > >
> > > > 7  | System Board 5 Fan 5                            | Fan
> > | N/A        | N/A   | 'transition to Running'
> > > >
> > > > 8  | System Board 6 Fan 6                            | Fan
> > | N/A        | N/A   | 'transition to Running'
> > > >
> > > > 9  | System Board 7 Fan 7                            | Fan
> > | N/A        | N/A   | 'transition to Running'
> > > >
> > > > 10 | System Board 8 Fan 8                            | Fan
> > | N/A        | N/A   | 'transition to Running'
> > > >
> > > > 11 | System Board 9 Fans                             | Fan
> > | N/A        | N/A   | 'Fully Redundant'
> > > >
> > > > 13 | Air Inlet 01-Inlet Ambient                      | Temperature
> > | 25.00      | C     | 'OK'
> > > >
> > > > 14 | Processor 1 02-CPU 1                            | Temperature
> > | 40.00      | C     | 'OK'
> > > >
> > > > 15 | Processor 2 03-CPU 2                            | Temperature
> > | 40.00      | C     | 'OK'
> > > >
> > > > 16 | Memory Device 1 04-P1 DIMM 1-6                  | Temperature
> > | 38.00      | C     | 'OK'
> > > >
> > > > 17 | Memory Device 2 05-P1 DIMM 7-12                 | Temperature
> > | 39.00      | C     | 'OK'
> > > >
> > > > 18 | Memory Device 3 06-P2 DIMM 1-6                  | Temperature
> > | 34.00      | C     | 'OK'
> > > >
> > > > 19 | Memory Device 4 07-P2 DIMM 7-12                 | Temperature
> > | 29.00      | C     | 'OK'
> > > >
> > > > 20 | Memory Device 5 08-P1 Mem Zone                  | Temperature
> > | 38.00      | C     | 'OK'
> > > >
> > > > 21 | Memory Device 6 09-P1 Mem Zone                  | Temperature
> > | 38.00      | C     | 'OK'
> > > >
> > > > 22 | Memory Device 7 10-P1 Mem Zone                  | Temperature
> > | 39.00      | C     | 'OK'
> > > >
> > > > 23 | Memory Device 8 11-P1 Mem Zone                  | Temperature
> > | 42.00      | C     | 'OK'
> > > >
> > > > 24 | Memory Device 9 12-P1 Mem Zone                  | Temperature
> > | 40.00      | C     | 'OK'
> > > >
> > > > 25 | Memory Device 10 13-P1 Mem Zone                 | Temperature
> > | 39.00      | C     | 'OK'
> > > >
> > > > 26 | Memory Device 11 14-P2 Mem Zone                 | Temperature
> > | 36.00      | C     | 'OK'
> > > >
> > > > 27 | Memory Device 12 15-P2 Mem Zone                 | Temperature
> > | 35.00      | C     | 'OK'
> > > >
> > > > 28 | Memory Device 13 16-P2 Mem Zone                 | Temperature
> > | 34.00      | C     | 'OK'
> > > >
> > > > 29 | Memory Device 14 17-P2 Mem Zone                 | Temperature
> > | 33.00      | C     | 'OK'
> > > >
> > > > 30 | Memory Device 15 18-P2 Mem Zone                 | Temperature
> > | 31.00      | C     | 'OK'
> > > >
> > > > 31 | Memory Device 16 19-P2 Mem Zone                 | Temperature
> > | 30.00      | C     | 'OK'
> > > >
> > > > 32 | Disk 20-HD Max
> > | Temperature  | N/A        | C     | N/A
> > > >
> > > > 33 | System Board 1 21-Chipset                       | Temperature
> > | 46.00      | C     | 'OK'
> > > >
> > > > 34 | Power Supply 2 22-P/S                            |
> > Temperature  | 32.00      | C     | 'OK'
> > > >
> > > > 35 | Power Unit 1 23-VR P1                           | Temperature
> > | 43.00      | C     | 'OK'
> > > >
> > > > 36 | Power Unit 2 24-VR P2                           | Temperature
> > | 28.00      | C     | 'OK'
> > > >
> > > > 37 | Power Unit 3 25-VR P1 Zone                      | Temperature
> > | 45.00      | C     | 'OK'
> > > >
> > > > 38 | Power Unit 4 26-VR P1 Mem                       | Temperature
> > | 40.00      | C     | 'OK'
> > > >
> > > > 39 | Power Unit 5 27-VR P1 Mem                       | Temperature
> > | 38.00      | C     | 'OK'
> > > >
> > > > 40 | Power Unit 6 28-VR P2 Mem
> > | Temperature  | 30.00      | C     | 'OK'
> > > >
> > > > 41 | Power Unit 7 29-VR P2 Mem
> > | Temperature  | 28.00      | C     | 'OK'
> > > >
> > > > 42 | Battery 30-Supercap Max
> > | Temperature  | N/A        | C     | N/A
> > > >
> > > > 43 | System Management Module 31-iLO Zone            | Temperature
> > | 35.00      | C     | 'OK'
> > > >
> > > > 44 | System Board 2 32-LOM
> > | Temperature  | N/A        | C     | N/A
> > > >
> > > > 45 | Add-in Card 1 33-PCI 1                          | Temperature
> > | N/A        | C     | N/A
> > > >
> > > > 46 | Add-in Card 2 34-PCI 2                          | Temperature
> > | N/A        | C     | N/A
> > > >
> > > > 47 | System Internal Expansion Board 1 35-PCI 1 Zone | Temperature
> > | 33.00      | C     | 'OK'
> > > >
> > > > 48 | System Internal Expansion Board 2 36-PCI 2 Zone | Temperature
> > | 33.00      | C     | 'OK'
> > > >
> > > > 49 | Add-in Card 3 37-LOM Card                       | Temperature
> > | N/A        | C     | N/A
> > > >
> > > > 50 | System Board 3 38-System Board                  | Temperature
> > | 26.00      | C     | 'OK'
> > > >
> > > > 51 | Back Panel Board 39-Sys Exhaust                 | Temperature
> > | 40.00      | C     | 'OK'
> > > >
> > > > 52 | System Board 10 Power Meter                     | Current
> > | N/A        | N/A   | 'Device Enabled'
> > > >
> > > > 53 | System Board 11 Memory                          | Memory
> > | N/A        | N/A   | 'Presence detected'
> > > >
> > > >
> > > > # bmc-info
> > > >
> > > > Device ID             : 19
> > > >
> > > > Device Revision       : 1
> > > >
> > > > Device SDRs           : unsupported
> > > >
> > > > Firmware Revision     : 1.10
> > > >
> > > > Device Available      : yes (normal operation)
> > > >
> > > > IPMI Version          : 2.0
> > > >
> > > > Sensor Device         : supported
> > > >
> > > > SDR Repository Device : supported
> > > >
> > > > SEL Device            : supported
> > > >
> > > > FRU Inventory Device  : supported
> > > >
> > > > IPMB Event Receiver   : unsupported
> > > >
> > > > IPMB Event Generator  : unsupported
> > > >
> > > > Bridge                : unsupported
> > > >
> > > > Chassis Device        : supported
> > > >
> > > > Manufacturer ID       : Hewlett-Packard (11)
> > > >
> > > > Product ID            : 8192
> > > >
> > > >
> > > >
> > > > Channel Information
> > > >
> > > >
> > > >
> > > > Channel Number       : 0
> > > >
> > > > Medium Type          : IPMB (I2C)
> > > >
> > > > Protocol Type        : IPMB-1.0
> > > >
> > > > Active Session Count : 0
> > > >
> > > > Session Support      : session-less
> > > >
> > > > Vendor ID            : Intelligent Platform Management Interface
> > forum (7154)
> > > >
> > > >
> > > >
> > > > Channel Number       : 2
> > > >
> > > > Medium Type          : 802.3 LAN
> > > >
> > > > Protocol Type        : IPMB-1.0
> > > >
> > > > Active Session Count : 0
> > > >
> > > > Session Support      : multi-session
> > > >
> > > > Vendor ID            : Intelligent Platform Management Interface
> > forum (7154)
> > > >
> > > >
> > > >
> > > > Channel Number       : 7
> > > >
> > > > Medium Type          : OEM
> > > >
> > > > Protocol Type        : KCS
> > > >
> > > > Active Session Count : 0
> > > >
> > > > Session Support      : session-less
> > > >
> > > > Vendor ID            : Intelligent Platform Management Interface
> > forum (7154)
> > > >
> > > > Thanks,
> > > > --Stephen Abbene
> > > >
> > > > _______________________________________________
> > > > Freeipmi-users mailing list
> > > > address@hidden
> > > > https://lists.gnu.org/mailman/listinfo/freeipmi-users
> > > --
> > > Albert Chu
> > > address@hidden
> > > Computer Scientist
> > > High Performance Systems Division
> > > Lawrence Livermore National Laboratory
> > >
> > >
> > >
> > ----------------------------------------------------------------------
> > -------------
> > > This email message is for the sole use of the intended recipient(s)
> > and may contain
> > > confidential information.  Any unauthorized review, use, disclosure
> > or distribution
> > > is prohibited.  If you are not the intended recipient, please
> > contact the sender by
> > > reply email and destroy all copies of the original message.
> > >
> > ----------------------------------------------------------------------
> > -------------
> > --
> > Albert Chu
> > address@hidden
> > Computer Scientist
> > High Performance Systems Division
> > Lawrence Livermore National Laboratory
> > 
> > 
> --
> Albert Chu
> address@hidden
> Computer Scientist
> High Performance Systems Division
> Lawrence Livermore National Laboratory
> 
> 
-- 
Albert Chu
address@hidden
Computer Scientist
High Performance Systems Division
Lawrence Livermore National Laboratory





reply via email to

[Prev in Thread] Current Thread [Next in Thread]