Gathering n-step experiences using the current policy