ba96e585d2ac8e1f940080d5b669976bdec8723b,agents/agent.py,Agent,update_log,#Agent#Any#,153
Before Change
logger.create_signal_value("Epsilon", self.exploration_policy.get_control_param())
if phase == RunPhase.TRAIN:
logger.create_signal_value("Training Reward", self.total_reward_in_current_episode)
elif phase == RunPhase.TEST:
logger.create_signal_value("Evaluation Reward", self.total_reward_in_current_episode)
logger.update_wall_clock_time(self.current_episode)
After Change
if phase == RunPhase.TRAIN else np.nan)
logger.create_signal_value("Evaluation Reward", self.total_reward_in_current_episode
if phase == RunPhase.TEST else np.nan)
logger.create_signal_value("Update Target Network", 0, overwrite=False)
logger.update_wall_clock_time(self.current_episode)
for signal in self.signals:
logger.create_signal_value("{}/Mean".format(signal.name), signal.get_mean())
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 2
Instances
Project Name: NervanaSystems/coach
Commit Name: ba96e585d2ac8e1f940080d5b669976bdec8723b
Time: 2018-02-12
Author: itai.caspi@intel.com
File Name: agents/agent.py
Class Name: Agent
Method Name: update_log
Project Name: NervanaSystems/coach
Commit Name: 582921ffe3b04ff502e1c3a05088ba2902e0f5bd
Time: 2019-05-02
Author: gal.leibovich@intel.com
File Name: rl_coach/agents/value_optimization_agent.py
Class Name: ValueOptimizationAgent
Method Name: run_off_policy_evaluation
Project Name: NervanaSystems/coach
Commit Name: e3c7e526c78e2cc039621ef58cf062cce3a65697
Time: 2019-03-19
Author: gal.leibovich@intel.com
File Name: rl_coach/logger.py
Class Name: BaseLogger
Method Name: update_wall_clock_time