Navigation
API > API/Plugins > API/Plugins/LearningAgentsTraining > API/Plugins/LearningAgentsTraining/ULearningAgentsPPOTrainer
Description
Call this function at the end of each step of your training loop. This takes the current observations/actions/ rewards and moves them into the episode experience buffer. All agents with full episode buffers or those which have been signaled complete will be reset. If enough experience is gathered, it will be sent to the training process and an iteration of training will be run and the updated policy will be synced back.
| Name | ProcessExperience |
| Type | function |
| Header File | /Engine/Plugins/Experimental/LearningAgents/Source/LearningAgentsTraining/Public/LearningAgentsPPOTrainer.h |
| Include Path | #include "LearningAgentsPPOTrainer.h" |
| Source | /Engine/Plugins/Experimental/LearningAgents/Source/LearningAgentsTraining/Private/LearningAgentsPPOTrainer.cpp |
UFUNCTION (BlueprintCallable, Category="LearningAgents")
void ProcessExperience
(
const bool bResetAgentsOnUpdate
)
Parameters
| Name | Remarks |
|---|---|
| bResetAgentsOnUpdate | If true, reset all agents whenever an updated policy is received. |