Directories ¶
Path | Synopsis |
---|---|
alu
Package alu contains the ALU implementation
|
Package alu contains the ALU implementation |
benchmark
Package benchmark contains the driver which links traces and the simulator
|
Package benchmark contains the driver which links traces and the simulator |
gpc
Package gpc implements the simulation components for the GPC Level.
|
Package gpc implements the simulation components for the GPC Level. |
gpu
Package gpu implements the simulation components for the GPU level.
|
Package gpu implements the simulation components for the GPU level. |
nvidia
Package nvidia includes basic const, types and structs for nvidia tracing
|
Package nvidia includes basic const, types and structs for nvidia tracing |
sm
Package sm implements the simulation components for the SM level.
|
Package sm implements the simulation components for the SM level. |
smunit
Package smunit implements the simulation components for the SM Unit level.
|
Package smunit implements the simulation components for the SM Unit level. |
trace
Package trace deals with trace parsing
|
Package trace deals with trace parsing |
Package benchmarks defines Benchmark interface.
|
Package benchmarks defines Benchmark interface. |
amdappsdk/bitonicsort
Package bitonicsort implements the bitonicsort benchmark from AMDAPPSDK.
|
Package bitonicsort implements the bitonicsort benchmark from AMDAPPSDK. |
amdappsdk/fastwalshtransform
Package fastwalshtransform implements the fastwalshtransform benchmark from AMDAPPSDK.
|
Package fastwalshtransform implements the fastwalshtransform benchmark from AMDAPPSDK. |
amdappsdk/floydwarshall
Package floydwarshall implements the Floyd-Warshall benchmark from AMDAPPSDK.
|
Package floydwarshall implements the Floyd-Warshall benchmark from AMDAPPSDK. |
amdappsdk/matrixmultiplication
Package matrixmultiplication implements the matrix multiplication benchmark from AMDAPPSDK.
|
Package matrixmultiplication implements the matrix multiplication benchmark from AMDAPPSDK. |
amdappsdk/matrixtranspose
Package matrixtranspose implements the matrix transpose benchmark from AMDAPPSDK.
|
Package matrixtranspose implements the matrix transpose benchmark from AMDAPPSDK. |
amdappsdk/nbody
Package nbody include the benchmark of NBody sample Derived from SDKSample base class
|
Package nbody include the benchmark of NBody sample Derived from SDKSample base class |
amdappsdk/simpleconvolution
Package simpleconvolution implements the Simple Convolution benchmark from AMDAPPSDK.
|
Package simpleconvolution implements the Simple Convolution benchmark from AMDAPPSDK. |
dnn/dataset/cifar10
Package cifar10 provides an interface to read the cifar-10 dataset.
|
Package cifar10 provides an interface to read the cifar-10 dataset. |
dnn/dataset/imagenet
Package imagenet provides an interface to read the imagenet interface.
|
Package imagenet provides an interface to read the imagenet interface. |
dnn/dataset/mnist
Package mnist provides an interface to read the MNIST interface.
|
Package mnist provides an interface to read the MNIST interface. |
dnn/gputensor
Package gputensor provides GPU tensor and tensor operation implementations.
|
Package gputensor provides GPU tensor and tensor operation implementations. |
dnn/gputraining
Package gputraining defines GPU training process.
|
Package gputraining defines GPU training process. |
dnn/layer_benchmarks/conv2d
Package conv2d defines a benchmark for the Convolutional Layer.
|
Package conv2d defines a benchmark for the Convolutional Layer. |
dnn/layer_benchmarks/im2col
Package im2col defines a benchmark for the im2col operation.
|
Package im2col defines a benchmark for the im2col operation. |
dnn/layer_benchmarks/relu
Package relu implements the relu algorithm as a benchmark.
|
Package relu implements the relu algorithm as a benchmark. |
dnn/layers
Package layers defines common DNN layers.
|
Package layers defines common DNN layers. |
dnn/tensor
Package tensor defines the tensor interface.
|
Package tensor defines the tensor interface. |
dnn/training
Package training implements gradient descent training method.
|
Package training implements gradient descent training method. |
dnn/training/optimization
Package optimization defines commonly used optimization algorithms.
|
Package optimization defines commonly used optimization algorithms. |
dnn/training_benchmarks/lenet
Package lenet implements lenet network training.
|
Package lenet implements lenet network training. |
dnn/training_benchmarks/minerva
Package minerva implements minerva network training.
|
Package minerva implements minerva network training. |
dnn/training_benchmarks/vgg16
Package vgg16 implements VGG16 network training.
|
Package vgg16 implements VGG16 network training. |
dnn/training_benchmarks/xor
Package xor implements a extremely simple network that can perform the xor operation.
|
Package xor implements a extremely simple network that can perform the xor operation. |
heteromark/aes
Package aes implements the AES benchmark form Hetero-Mark.
|
Package aes implements the AES benchmark form Hetero-Mark. |
heteromark/fir
Package fir implements the FIR benchmark form Hetero-Mark.
|
Package fir implements the FIR benchmark form Hetero-Mark. |
heteromark/kmeans
Package kmeans implements the Kmeans benchmark form Hetero-Mark.
|
Package kmeans implements the Kmeans benchmark form Hetero-Mark. |
heteromark/pagerank
Package pagerank implements the PageRank benchmark form Hetero-Mark.
|
Package pagerank implements the PageRank benchmark form Hetero-Mark. |
matrix/csr
Package csr provides a csr matrix definition
|
Package csr provides a csr matrix definition |
mccl
Package mccl provides a collective communication library implementation.
|
Package mccl provides a collective communication library implementation. |
polybench/atax
Package atax implements the ATAX benchmark from Polybench.
|
Package atax implements the ATAX benchmark from Polybench. |
polybench/bicg
Package bicg implements the bicg benchmark from Polybench.
|
Package bicg implements the bicg benchmark from Polybench. |
rodinia/nw
Package nw defines the Needleman–Wunsch benchmark
|
Package nw defines the Needleman–Wunsch benchmark |
shoc/bfs
Package bfs implements the bfs benchmark from the SHOC suite.
|
Package bfs implements the bfs benchmark from the SHOC suite. |
shoc/fft
Package fft include the benchmark of Fourier
|
Package fft include the benchmark of Fourier |
shoc/spmv
Package spmv include the benchmark of sparse matrix-vector matiplication.
|
Package spmv include the benchmark of sparse matrix-vector matiplication. |
shoc/stencil2d
Package stencil2d implements the stencil2d benchmark from the SHOC suite.
|
Package stencil2d implements the stencil2d benchmark from the SHOC suite. |
Package bitops defines commonly used bit operations
|
Package bitops defines commonly used bit operations |
Package driver implements a GPU driver that interfaces the benchmarks with the simulator.
|
Package driver implements a GPU driver that interfaces the benchmarks with the simulator. |
internal
Package internal provides support for the driver implementation.
|
Package internal provides support for the driver implementation. |
Package emu emulates GCN3 instructions.
|
Package emu emulates GCN3 instructions. |
Package insts provides the definition for GCN3 instructions.
|
Package insts provides the definition for GCN3 instructions. |
Package kernels provides basic data definitions related to GPU kernels.
|
Package kernels provides basic data definitions related to GPU kernels. |
Package protocol defines the common messages used in MGPUSim
|
Package protocol defines the common messages used in MGPUSim |
samples
|
|
runner
Package runner defines how default benchmark samples are executed.
|
Package runner defines how default benchmark samples are executed. |
Package server defines a server that can receives commands from external applications.
|
Package server defines a server that can receives commands from external applications. |
tests
|
|
timing
|
|
cp
Package cp defines the Command Processor component of a GCN3 GPU
|
Package cp defines the Command Processor component of a GCN3 GPU |
cp/internal/dispatching
Package dispatching defines how work-groups and wavefronts are dispatched to compute units.
|
Package dispatching defines how work-groups and wavefronts are dispatched to compute units. |
cp/internal/resource
Package resource manages the Compute Unit resources
|
Package resource manages the Compute Unit resources |
cu
Package cu provides an implementation of detailed Compute Unit modeling.
|
Package cu provides an implementation of detailed Compute Unit modeling. |
pagemigrationcontroller
Package pagemigrationcontroller provides an implementation of a PageMigrationController.
|
Package pagemigrationcontroller provides an implementation of a PageMigrationController. |
rdma
Package rdma provides the implementation of an RDMA engine.
|
Package rdma provides the implementation of an RDMA engine. |
rob
Package rob implemented an reorder buffer for memory requests.
|
Package rob implemented an reorder buffer for memory requests. |
wavefront
Package wavefront defines concepts related to a wavefront.
|
Package wavefront defines concepts related to a wavefront. |
Click to show internal directories.
Click to hide internal directories.