The higher this is, the more the future (distant reward) becomes important in making
a decision -- but beware of infinite loops and explosions of value.
Affect PolicyExtractor to load value functions if the state does not reside in the current value function; if the state does not support SubdivisionIdentification the PolicyExtractor will communicate this.