trackml

package module

v0.1.1 Latest Latest Go to latest Published: May 16, 2018 License: BSD-3-Clause Imports: 13 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/sbinet/go-trackml

Links

Open Source Insights

README ¶

go-trackml

trackml is a Go package to simplify working with the High Energy Physics Tracking Machine Learning challenge.

For more informations about the minute details of what go-trackml tries to do, please have a look at the Python version:

https://github.com/LAL/trackml-library

trackml is a Go reimplementation of the above Python library.

Installation

$> go get github.com/sbinet/go-trackml

Documentation

Served by GoDoc.

Example

$> go get github.com/sbinet/go-trackml/cmd/trkml-hough

$> trkml-hough -h
trkml-hough uses a Hough transform to make predictions.

Usage:

  $> trkml-hough [OPTIONS] <path-to-dataset> <evtid-prefix> [<path-to-test-dataset]

Examples:

  $> trkml-hough ./example_standard/dataset event000000200
  $> trkml-hough -npcus=+1 ./example_standard/dataset event000000200
  $> trkml-hough -npcus=-1 ./example_standard/dataset event000000200
  $> trkml-hough -npcus=-1 ./train_sample.zip event000001000

Options:

  -ncpus int
    	number of goroutines to use for the prediction (default 1)
  -prof-cpu
    	enable CPU profiling
  -prof-mem
    	enable MEM profiling
  -submit
     	create a submission file

$> ll example_standard/dataset/
total 56M
-rw-r--r-- 1 binet binet  13M Apr 25 18:36 event000000200-cells.csv
-rw-r--r-- 1 binet binet 4.2M Apr 25 18:36 event000000200-hits.csv
-rw-r--r-- 1 binet binet 915K Apr 25 18:36 event000000200-particles.csv
-rw-r--r-- 1 binet binet 9.5M Apr 25 18:36 event000000200-truth.csv
-rw-r--r-- 1 binet binet  14M Apr 25 18:36 event000000201-cells.csv
-rw-r--r-- 1 binet binet 4.5M Apr 25 18:36 event000000201-hits.csv
-rw-r--r-- 1 binet binet 967K Apr 25 18:36 event000000201-particles.csv
-rw-r--r-- 1 binet binet  10M Apr 25 18:36 event000000201-truth.csv

$> time trkml-hough ./example_standard/dataset event000000200
trkml-hough: loading [event000000200 from ./example_standard/dataset]...
trkml-hough: loading [event000000200 from ./example_standard/dataset]... [done]
trkml-hough: score for event 200: 0.1316012364071201
trkml-hough: loading the whole dataset "./example_standard/dataset"...
trkml-hough: score for event 200: 0.1316012364071201
trkml-hough: score for event 201: 0.1332602513710427
trkml-hough: loading the whole dataset "./example_standard/dataset"... [done]
trkml-hough: mean score: 0.13243074388908138

real  1m21.033s
user  1m22.541s
sys   0m0.569s

Compare to the Python version:

$> time python trkml-hough.py ./example_standard/dataset event000000200
   hit_id          x         y       z  volume_id  layer_id  module_id
0       1 -62.663200  -3.05090 -1502.5          7         2          1
1       2 -66.124702  -1.36730 -1502.5          7         2          1
2       3 -63.697701   1.73267 -1502.5          7         2          1
3       4 -82.501801 -14.09150 -1502.5          7         2          1
4       5 -74.343399   0.84469 -1502.5          7         2          1
Your score:  0.13153644878592863
Score for event 200: 0.132
Score for event 201: 0.133
Mean score: 0.132

real  7m12.351s
user  7m10.400s
sys   0m0.828s

Going parallel

Go has a few built-in facilities to apply concurrent programming. The simple trkml-hough command leverages them:

$> time trkml-hough -ncpus=-1 ./example_standard/dataset event000000200
trkml-hough: loading [event000000200 from ./example_standard/dataset]...
trkml-hough: loading [event000000200 from ./example_standard/dataset]... [done]
trkml-hough: score for event 200: 0.13160123640712013
trkml-hough: loading the whole dataset "./example_standard/dataset"...
trkml-hough: score for event 200: 0.1316012364071201
trkml-hough: score for event 201: 0.13326025137104267
trkml-hough: loading the whole dataset "./example_standard/dataset"... [done]
trkml-hough: mean score: 0.13243074388908138

real  0m30.081s
user  1m26.658s
sys   0m0.741s

Submission

$> time trkml-hough -ncpus=-1 -submit ./train_sample.zip event000001000 ./test.zip 
trkml-hough: loading [event000001000 from ./train_sample.zip]...
trkml-hough: loading [event000001000 from ./train_sample.zip]... [done]
trkml-hough: score for event 1000: 0.14063594336939647
trkml-hough: loading the whole dataset "./train_sample.zip"...
trkml-hough: score for event 1000: 0.14063594336939647
trkml-hough: score for event 1001: 0.15109432405046952
trkml-hough: score for event 1002: 0.134897692773808
trkml-hough: score for event 1003: 0.1474692054660444
trkml-hough: score for event 1004: 0.13474093093042283
trkml-hough: loading the whole dataset "./train_sample.zip"... [done]
trkml-hough: mean score: 0.14176761931802823
trkml-hough: loading test dataset "./test.zip"...
trkml-hough: processing event 0...
trkml-hough: processing event 1...
trkml-hough: processing event 2...
[...]
trkml-hough: processing event 124...
trkml-hough: loading test dataset "./test.zip"... [done]

real	19m40.461s
user	57m14.608s
sys	0m10.350s

$> ll submission.csv.gz
-rw-r--r-- 1 binet binet 44M May 16 17:57 submission.csv.gz

Documentation ¶

Overview ¶

Package trackml exposes facilities to ease handling of TrackML datasets.

Index ¶

func Score(evt Event, trkIDs []int) float64
type Cell
type Dataset
- func NewDataset(name string, beg, end int, reader EventReader) (Dataset, error)
type Event
- func ReadEvent(path, evtid string) (Event, error)
- func ReadMcEvent(path, evtid string) (Event, error)
- func (evt *Event) Delete()
type EventReader
type Hit
type Particle
type Submission
- func NewSubmission() (*Submission, error)
- func (sub *Submission) Append(evt Event, trkIDs []int) error
- func (sub *Submission) Close() error
type Truth

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func Score ¶

func Score(evt Event, trkIDs []int) float64

Score computes the TrackML event score for a single event.

Types ¶

type Cell ¶

type Cell struct {
	HitID int
	Ch0   int
	Ch1   int
	Value float64
}

type Dataset ¶

type Dataset struct {
	// contains filtered or unexported fields
}

Dataset is an Event container.

Dataset logically contains many Events, iterating throught the list of Events via the Next method.

Example:

ds, err := NewDataset("./example_standard/dataset", 0, -1, nil)
for ds.Next() {
    evt := ds.Event()
}
if err := ds.Err(); err != nil {
    panic(err)
}

func NewDataset ¶

func NewDataset(name string, beg, end int, reader EventReader) (Dataset, error)

NewDataset returns the list of datasets from name, a directory or zip file, containing many events data.

beg and end control the number of events to iterate over.

The returned Dataset will use the reader function to load events from a path. If reader is nil, ReadMcEvent is used.

func (*Dataset) Close ¶

func (ds *Dataset) Close() error

func (*Dataset) Err ¶

func (ds *Dataset) Err() error

func (*Dataset) Event ¶

func (ds *Dataset) Event() Event

Event returns the current event from the dataset. The returned value is valid until a call to Next.

func (*Dataset) Names ¶

func (ds *Dataset) Names() []string

Names returns the list of event IDs this dataset contains.

func (*Dataset) Next ¶

func (ds *Dataset) Next() bool

type Event ¶

type Event struct {
	ID    int        // event id
	Hits  []Hit      // collection of hits for this event
	Cells []Cell     // collection of cells for this event
	Ps    []Particle // collection of reconstructed particles for this event
	Mcs   []Truth    // Monte-Carlo truth for this event
}

Event stores informations about a complete HEP event.

func ReadEvent ¶

func ReadEvent(path, evtid string) (Event, error)

ReadEvent reads a complete Event value from the given path+prefix, but without the Monte-Carlo informations.

func ReadMcEvent ¶

func ReadMcEvent(path, evtid string) (Event, error)

ReadMcEvent reads a complete Event value from the given path+prefix, including Monte-Carlo informations.

func (*Event) Delete ¶

func (evt *Event) Delete()

Delete zeroes all internal data of an Event and prepares that Event to be collected by the Garbage Collector.

type EventReader ¶

type EventReader func(path, evtid string) (Event, error)

EventReader is a function to read an event from a path

type Hit ¶

type Hit struct {
	HitID    int
	X, Y, Z  float64
	VolumeID int
	LayerID  int
	ModuleID int
}

type Particle ¶

type Particle struct {
	ID         int
	Vx, Vy, Vz float64
	Px, Py, Pz float64
	Q          int
	NHits      int
}

type Submission ¶

type Submission struct {
	// contains filtered or unexported fields
}

Submission creates a CSV file ready for submission to Kaggle

func NewSubmission ¶

func NewSubmission() (*Submission, error)

func (*Submission) Append ¶

func (sub *Submission) Append(evt Event, trkIDs []int) error

func (*Submission) Close ¶

func (sub *Submission) Close() error

type Truth ¶

type Truth struct {
	HitID      int
	PID        int
	Tx, Ty, Tz float64
	Px, Py, Pz float64
	Weight     float64
}

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
clustering
cmd
trkml-hough trkml-hough is a simple TrackML example using a Hough transform to make predictions, similar to the Jupyter notebook from https://github.com/LAL/trackml-library.	trkml-hough is a simple TrackML example using a Hough transform to make predictions, similar to the Jupyter notebook from https://github.com/LAL/trackml-library.
hough

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL