Documentation ¶
Overview ¶
Package sxid tracks the NVIDIA GPU SXid errors scanning the dmesg. See fabric manager documentation https://docs.nvidia.com/datacenter/tesla/pdf/fabric-manager-user-guide.pdf.
Index ¶
Constants ¶
View Source
const ( StateNameErrorSXid = "error_sxid" StateKeyErrorSXidData = "data" StateKeyErrorSXidEncoding = "encoding" StateValueErrorSXidEncodingJSON = "json" )
View Source
const ( EventNameErroSXid = "error_sxid" EventKeyErroSXidUnixSeconds = "unix_seconds" EventKeyErroSXidData = "data" EventKeyErroSXidEncoding = "encoding" EventValueErroSXidEncodingJSON = "json" )
View Source
const Name = "accelerator-nvidia-error-sxid"
Variables ¶
This section is empty.
Functions ¶
func New ¶
func New() components.Component
Types ¶
type Output ¶
type Output struct { DmesgErrors []nvidia_query_sxid.DmesgError `json:"dmesg_errors,omitempty"` // Recommended course of actions for any of the GPUs with a known issue. // For individual GPU details, see each per-GPU states. // Used for states calls. SuggestedActions *common.SuggestedActions `json:"suggested_actions,omitempty"` // Used for events calls. SuggestedActionsPerLogLine map[string]*common.SuggestedActions `json:"suggested_actions_per_log_line,omitempty"` }
func ParseOutputJSON ¶
func ParseOutputYAML ¶
func ParseStatesToOutput ¶
func ParseStatesToOutput(states ...components.State) (*Output, error)
func (*Output) Events ¶
func (o *Output) Events() []components.Event
Click to show internal directories.
Click to hide internal directories.