MultiAgentDecisionProcess
|
AgentFullyObservable represents an agent that receives the true state, the joint observation and also the reward signal. More...
#include <AgentFullyObservable.h>
Public Member Functions | |
virtual Index | Act (Index sI, Index joI, double reward)=0 |
Return an individual action based on state, last joint observation and reward. More... | |
AgentFullyObservable (const PlanningUnitDecPOMDPDiscrete *pu, Index id) | |
(default) Constructor More... | |
AgentFullyObservable (const AgentFullyObservable &a) | |
Copy constructor. More... | |
~AgentFullyObservable () | |
Destructor. More... | |
Public Member Functions inherited from AgentDecPOMDPDiscrete | |
AgentDecPOMDPDiscrete (const PlanningUnitDecPOMDPDiscrete *pu, Index id) | |
(default) Constructor More... | |
AgentDecPOMDPDiscrete (const AgentDecPOMDPDiscrete &a) | |
Copy constructor. More... | |
const PlanningUnitDecPOMDPDiscrete * | GetPU () const |
Public Member Functions inherited from SimulationAgent | |
virtual Index | GetIndex () const |
Retrieves the index of this agent. More... | |
virtual bool | GetVerbose () const |
If true, the agent will report more. More... | |
void | Print () const |
Print out some information about this agent. More... | |
virtual void | ResetEpisode ()=0 |
Will be called before an episode, to reinitialize the agent. More... | |
virtual void | SetIndex (Index id) |
Sets the index of this agent. More... | |
virtual void | SetVerbose (bool verbose) |
Set whether this agent should be verbose. More... | |
SimulationAgent (Index id, bool verbose=false) | |
(default) Constructor More... | |
virtual std::string | SoftPrint () const |
Return some information about this agent. More... | |
virtual | ~SimulationAgent () |
Destructor. More... | |
AgentFullyObservable represents an agent that receives the true state, the joint observation and also the reward signal.
|
inline |
(default) Constructor
a derived class should first call the AgentDecPOMDPDiscrete constructor. Therefore, we do not need to pass along any arguments anymore here.
|
inline |
Copy constructor.
|
inline |
Destructor.
Return an individual action based on state, last joint observation and reward.
Implemented in AgentQLearner, AgentMDP, AgentRandom, and AgentOnlinePlanningMDP.