e4f051b6cce414997a97b896276563c4e361d0b8,ch09/04_cartpole_pg.py,,,#,35
Before Change
new_logits_v = net(states_v)
new_prob_v = F.softmax(new_logits_v, dim=1)
kl_div_v = -((new_prob_v / prob_v).log() * prob_v).sum(dim=1).mean()
writer.add_scalar("kl", kl_div_v.data.cpu().numpy()[0], step_idx)
grad_max = 0.0
grad_means = 0.0
After Change
// calc KL-div
new_logits_v = net(states_v)
new_prob_v = F.softmax(new_logits_v, dim=1)
kl_div_v = -((new_prob_v / prob_v).log() * prob_v).sum(dim=1).mean()
writer.add_scalar("kl", kl_div_v.item(), step_idx)
grad_max = 0.0
grad_means = 0.0
In pattern: SUPERPATTERN
Frequency: 8
Non-data size: 9
Instances
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: e4f051b6cce414997a97b896276563c4e361d0b8
Time: 2018-04-27
Author: max.lapan@gmail.com
File Name: ch09/04_cartpole_pg.py
Class Name:
Method Name:
Project Name: eriklindernoren/PyTorch-GAN
Commit Name: 1e35104169479eec418f5846b14904a8069c3b67
Time: 2018-04-25
Author: eriklindernoren@gmail.com
File Name: implementations/acgan/acgan.py
Class Name:
Method Name:
Project Name: eriklindernoren/PyTorch-GAN
Commit Name: 1e35104169479eec418f5846b14904a8069c3b67
Time: 2018-04-25
Author: eriklindernoren@gmail.com
File Name: implementations/ccgan/ccgan.py
Class Name:
Method Name:
Project Name: eriklindernoren/PyTorch-GAN
Commit Name: 1e35104169479eec418f5846b14904a8069c3b67
Time: 2018-04-25
Author: eriklindernoren@gmail.com
File Name: implementations/lsgan/lsgan.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: e4f051b6cce414997a97b896276563c4e361d0b8
Time: 2018-04-27
Author: max.lapan@gmail.com
File Name: ch09/04_cartpole_pg.py
Class Name:
Method Name:
Project Name: eriklindernoren/PyTorch-GAN
Commit Name: 1e35104169479eec418f5846b14904a8069c3b67
Time: 2018-04-25
Author: eriklindernoren@gmail.com
File Name: implementations/sgan/sgan.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: 7a6e3b93fb4b97af7b06244b768b1fee4b547c17
Time: 2018-04-29
Author: max.lapan@gmail.com
File Name: ch12/train_crossent.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: 8165c35c15dbbe78c4cb5d3ccb8d7837db0f6f7f
Time: 2018-04-29
Author: max.lapan@gmail.com
File Name: ch18/train.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: d5b0cd8e7960c247bb7c5b7c832358f8831780fb
Time: 2018-04-29
Author: max.lapan@gmail.com
File Name: ch15/03_train_trpo.py
Class Name:
Method Name: