fabricmanagerlog

package
v0.2.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Nov 21, 2024 License: Apache-2.0 Imports: 8 Imported by: 0

Documentation

Overview

Package fabricmanagerlog implements the fabric manager log poller.

Index

Constants

View Source
const (
	// e.g.,
	// [Jul 23 2024 07:53:55] [ERROR] [tid 841] detected NVSwitch fatal error 20034 on fid 0 on NVSwitch pci bus id 00000000:86:00.0 physical id 3 port 33
	RegexNVSwitchFatalSXidFromLog = `.+detected NVSwitch fatal error (\d+)`

	// e.g.,
	// [Jul 09 2024 18:14:07] [ERROR] [tid 12727] detected NVSwitch non-fatal error 12028 on fid 0 on NVSwitch pci bus id 00000000:86:00.0 physical id 3 port 61
	RegexNVSwitchNonFatalSXidFromLog = `.+detected NVSwitch non-fatal error (\d+)`

	// e.g.,
	// [Sep 17 2024 06:01:46] [ERROR] [tid 1230079] failed to find the GPU handle 5410063385821516767 in the multicast team request setup 6130285411925746235.
	RegexNVSwitchNVLinkFailureFromLog = `.+failed to find the GPU handle \d+ in the multicast team .*`
)

Variables

This section is empty.

Functions

func CreateDefaultPoller

func CreateDefaultPoller(ctx context.Context, cfg query_log_config.Config) error

func ExtractTimeFromLogLine

func ExtractTimeFromLogLine(line []byte) (time.Time, []byte, error)

does not return error for now example log line: "[May 02 2024 18:41:23] [INFO] [tid 404868] Abort CUDA jobs when FM exits = 1" TODO: once stable return error

func GetDefaultPoller

func GetDefaultPoller() query_log.Poller

Types

This section is empty.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL