saccade

command

v0.0.0-...-d859baf Latest Latest Go to latest Published: May 2, 2022 License: BSD-3-Clause Imports: 37 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/ccnlab/deep-obj-cat

Links

Open Source Insights

README ¶

Saccade

This does predictive learning of saccade-related signals only, in contrast to the wwi3d model which predicts the shape of the object as well. Thus, this model should be able to achieve essentially perfect predictive accuracy.

Network Layers

The primary layers in the model are:

V1f primary visual cortex, full-field (peripheral), represents the visual image (idealized blobs), on a 2D retinotopic map: one or more blobs, one of which is the target.
S1e primary proprioceptive somatosensory eye position map (WangZhangCohenEtAl07) -- cells have gaussian direction and graded slope eccentricity coding, roughly linear from the center out. Orientation is coded on X axis with vertical in center, angles to the left on the left, etc. To maintain consistency with FEF motor code, and have a simpler overall activity level profile, eccentricity coding is also gaussian with a preferred eccentricity coding progressively outward across the higher rows.
SCd is the deep superior colliculus, which is the motor output for eye movement. It has a topographically organized map in polar coordinates (orientation and eccentricity) organized in the model like S1e.
SCs is the superficial superior colliculus, which generates saccade plans based on sensory inputs. It has a topographically organized map in polar coordinates (orientation and eccentricity) organized in the model like S1e and SCd.
MDe is the medial dorsal thalamus, representing the eye motor signals. It receives an corollary discharge projection from the SCd, reflecting the actual eye motor action taken, driving the plus phase on the MD. MD also receives a top-down projection from FEF, representing a kind of prediction relative to the SCd plus phase -- this error signal is how the FEF learns to send signals in the proper language of the motor system.
FEF is the primary motor frontal eye fields, sending activity to SCd to drive cortically-driven eye movements, and also a corollary signal to MDe representing the prediction.
SEF is supplementary / second-order eye fields, which learns to predict FEF activity, and provides more strategic top-down motor plans to FEF, e.g., for sequencing etc.
LIP receives from and predicts V1, S1e, and FEF, sending activity to FEF and SEF to drive saccades to the attentionally-selected V1 target.

Circuits

V1 -> LIP <-> FEF <-> MDe -> SCd -- basic sensory-motor pathway, where the visual target represented in LIP drives motor output, with FEF tansforming the retinotopic coordinates of LIP into the polar motor coordinates of MD which is constrained by SCd for its representational structure. In the model, the MDe plus phase driven by SCd corollary discharge acts like a standard target layer.

Paradigm

Multiple blobs in the input, fixate on an attentionally-selected one in the periphery, predict where the attentional blob (and others?) go.

Timeline:

T0: visual input presented, saccade planned (FEF super) then executed in plus phase (-> FEF deep). can have random initial eye position.
T1: new visual input, based on actual saccade, predicted vs. actual for all things like eye position etc

Goal-learning

Key idea for turning predictive learning into prospective action selection is to learn sensory outcome state + motor action at the same time. However, in saccade case, and more generally, the sensory pre-conditions for driving the saccade (i.e., V1 / LIP blob somewhere off-center) are incompatible with sensory conditions post-saccade (blob in center), so it is just not clear how this is supposed to work.

Also, just letting the FEF / MD be hidden layers and do their own thing in 2nd tick is not working at all.

Alternative ideas:

Really need the higher order layers to learn sensory + motor states -- is there a way to do the outer-loop, longer-time-scale prediction story here, in higher layers?

Parameters

The sac_env specifies the width of the popcode bumps -- it is better to have these relatively wide -- narrow bumps require more inhibition to restrict activity to the bumps, which makes the thing unstable. Also wider bumps support more units voting and, in principle, better accuracy overall.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL