A neighbourhood of pRAMs can generate complex penalty inputs to adjoining pRAMs
through the use on non-learning pRAMs which allow the penalty input to be a
non-linear function of the neighbouring pRAM outputs.
A similar arrangement can be made for the reward inputs.
The non-learning pRAMs act as look-up tables.