README ¶
Kinase Equations
See: https://github.com/ccnlab/kinase/tree/main/sims/kinase for a parallel exploration based on a biophysically detailed model, building up from the Urakubo et al (2008) model.
This standalone simulation is used to explore the synapse-level updating of calcium-based signals that drive the more abstract forms of Kinase learning rules, based on recv and send spiking impulses, passing through a cascade of exponential integrations with different time constants.
The Leabra and initial Axon equations used these cascading updated variables on each neuron (recv and send) separately, with the final values multiplied for the CHL-like DWt function.
-
CaM
= first level integration:CaM += (g * Spike - CaM) / MTau
-- spike drives up toward a max per-spike amount with gain factor (g = 8 std). The g multiplier gives a faster effective rate constant for going up vs. decay back down -- this just sets the overall scale of the values. This represents an abstract version of CaM (calmodulin) which is activated by calcium influx from NMDA receptors. This isAvgSS
in Leabra. -
CaP
= LTP, plus phase faster integration of CaM, reflecting CaMKII biologically. Faster time constant makes this reflect the plus phase relative to theCaD
LTD, minus-phase variable. This isAvgS
in Leabra. -
CaD
= LTD, minus phase slower integration of CaP (cascaded one further step), reflecting DAPK1 biologically. This isAvgM
in Leabra.
The CHL product-based learning rule is a function of the difference between the plus-phase vs. minus-phase products of factors for S = Sender and R = Receiver:
PDWt = SCaP * RCaP - SCaD * RCaD
This is the NeurSpkCa
algorithm -- separate neuron-level Ca integration.
For the rate-code activations in Leabra, the product of these averages is likely to be similar to the average of the products at a synapse level, and computing neuron-level values is much faster computationally than integrating the products at the synapse level. Indeed, experiments (a long time ago) showed no advantages to doing the synapse-level integration in Leabra.
Synapse-level integration of spikes
For spiking, the relative timing of pre-post spiking has been an obsession since the discovery of STDP.
However, at a computational level, capturing these pre-post timing interactions clearly requires computationally-expensive synapse-level integration. Thus, a major technical issue we address is how to more efficiently integrate this synapse-level signal at the theoretically most efficient level which is when either the sender or receiver spikes, with the subsequent integration computed based on time passed instead of incrementally updating.
The first issue is how the pre-post spikes interact in computing the Ca and CaM first-level integrations of synaptic activity. Because each spike is itself a discrete event, there needs to be some kind of time window over which the pre-post spiking interact, if they are to have any interaction at all. At the most extreme end of the non-interaction spectrum is the simple "OR" rule:
SynOrSpk
"OR" rule:SynSpk = SSpk || RSpk
-- either spike counts, and there is no explicit interaction at the level of the Ca influx per spike, or anything distinctive about a pre vs. postsynaptic spike.
However, this does not actually work in practice, despite many attempts with the ra25
basic example. In effect, the sender and receiver are each broadcasting their spikes to all the other neurons, and the lack of interaction leads to a kind of sum across these spikes, producing insufficient levels of selectivity across different neurons.
SynCaMProd
: A simple next step is to use the product of the pre and postCaM
values, each separately integrated over a roughly 20 msec tau window, to kick off the synaptic integration. This works, but is not notably different in performance fromSynNMDACa
.
TODO: explore this version in this model.
Counterintuitively, under typical conditions in dendritic spines as explored in the biophysical kinase model, this is actually a reasonable first-order approximation of calcium influx from pre-post spikes. The postsynaptic Vm is often reasonably depolarized in active neurons in awake behaving conditions, so the Glu release from pre-spikes has a direct impact, while there is often enough residual channel open state from prior Glu release for backpropagating action potentials to drive an extra burst of Ca influx.
In this context, it is critical to consider more naturalistic conditions relative to the relatively sterile pairs of isolated spikes in the standard STDP paradigm, which is highly unrepresentative and thus can give rise to incorrect assumptions and intuitions about the dynamics in more naturalistic conditions.
Also, the SynSpk
value then drives the CaM, CaP, CaD cascade of time integrations, and this integration process is remarkably sensitive to the ordering of spikes as they come in: the integration dynamics produce strong interactions across the different timescales, as became more clear when implementing the computational optimization described below: a huge 3 factor lookup table is needed to capture the interactions between the 3 timescales. Thus, even if the spikes themselves don't interact per-se, the individual synapse will end up with a unique integrated signal, relative to what would have otherwise happened at each send, recv neuron independently.
These interactions in integration are evident in the above figure. The blue CaP
line shows how sequences of closely-spaced spikes ramp up quickly -- thus roughly synchronous firing between pre and post neurons hitting the synapse in rapid succession can produce significantly stronger learning signals relative to more spaced-out spiking. Due to the common starting point of the spike trains, this rough synchrony is present in the above trace.
Due to the temporal-difference nature of the learning mechanism, LTP will be driven when there is a progression toward increasing synchrony over time, such that the faster CaP
signal rises above the more slowly adapting CaD
one toward the end of the 200 msec theta cycle window.
It is also notable that the CaD
value in both of these cases reaches its asymptotic value right around the 200 msec point, suggesting that the theta cycle is the relevant timescale for learning. This is due to the rate constants, which are biologically constrained based on the dynamics of Ca influx and the increase rate of CaMII as explored in the biophysical model: kinase.
It is also evident that the precise timing of the
SynSpkCa DWt is much less variable than NeurSpkCa
One of the most salient results from this model is that integrating spiking at the synapse level ("average of products") produces a much less variable DWt signal than doing it at the neuron level ("product of averages"). The intuition is that for the latter (aka NeurSpkCa
), a great deal depends on the precise timing of the last few spikes at the end of the sequence, and the product operation itself introduces further variance (each term has multiplicative "leverage" over the other).
Here are results for different matched levels of firing, showing final DWt values after each of 10000 different 200 msec trials of learning, with the indicated level of firing on both send and recv, with no change over time. The expected DWt value should be 0 as there is no minus-plus phase difference. SynPDwt
is the "product" dwt (NeurSpkCa
), and SynCDWt
is SynSpkCa
continuously integrated.
100 Hz:
50 Hz:
20 Hz:
The differences are greatest when the firing rate is high, where NeurSpkCa
is highly variable, while SynSpkCa
is relatively consistent across different rates.
Optimized SynSpkCa integration only at Spikes
For the simple SynSpkCa
rule, it is possible to only update values at each spike event, because the intervening dynamics are fully determined therefrom. However, due to the cascading nature of the integration, it is highly non-linear and a closed-form equation is unlikely to exist.
In a first pass implementation attempt, a massive lookup table was created to accurately capture the dynamics, as a function of the exact CaM, CaP, and CaD levels at the moment a new spike comes in, implemented in the kinase
package, using the Funts
function tables. With a resolution of .01, the results were quite accurate. See results/fig_synspk_mpd_optimized_fit_*
figures for results.
However, in practice, this lookup table approach ended up being much slower than just computing the synaptic values continuously, because the lookup table is so big that random access of it creates major memory cache disruption, and significant slowing.
Thus, a new approach is to still only compute at spike intervals, but use a for loop to iterate the updating from the prior point of time! That is likely to be much more efficient!