Updating the actor-critic model