a65ee92b69c9dfa05defbee17abd7af09103f88e,rllib/agents/marwil/marwil_torch_policy.py,,marwil_loss,#,44

Before Change


    adv = advantages - state_values
    // Update averaged advantage norm.
    policy.ma_adv_norm.add_(
        1e-6 * (torch.mean(torch.pow(adv, 2.0)) - policy.ma_adv_norm))
    // //xponentially weighted advantages.
    exp_advs = torch.exp(policy.config["beta"] *
                         (adv / (1e-8 + torch.pow(policy.ma_adv_norm, 0.5))))

After Change



    // Advantage estimation.
    adv = advantages - state_values
    adv_squared = torch.mean(torch.pow(adv, 2.0))

    // Value loss.
    policy.v_loss = 0.5 * adv_squared
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 2

Instances


Project Name: ray-project/ray
Commit Name: a65ee92b69c9dfa05defbee17abd7af09103f88e
Time: 2021-01-19
Author: sven@anyscale.io
File Name: rllib/agents/marwil/marwil_torch_policy.py
Class Name:
Method Name: marwil_loss


Project Name: OpenNMT/OpenNMT-py
Commit Name: 0e7bfac76b97e7630102687f7e6309e00e190091
Time: 2018-11-13
Author: benzurdopeters@gmail.com
File Name: onmt/decoders/ensemble.py
Class Name: EnsembleGenerator
Method Name: forward


Project Name: ray-project/ray
Commit Name: b7dbbfbf4111698145bb9e0bf2e34e36fef0430c
Time: 2020-11-25
Author: sven@anyscale.io
File Name: rllib/agents/sac/tests/test_sac.py
Class Name: TestSAC
Method Name: _sac_loss_helper