Livoa
Discord
Pricing
English
Sign In
agents
goals
obstacles
x_i^{(j)} = [p_i^{j}, v_i^{j}, p_i^{goal,j},
entitytype
(j), cos \Delta \theta_{ij}, sin \Delta \theta_{ij}, d_{ij}]
Training
Execution
o^{(i)}
⊕
x_{agg}^{(i)} = GNN(x_i^{i}, x_i^{j} \forall j \in \mathcal{N}(i), g^{(i)})
Agent i's observation
Aggregation of information from agent's neighborhood using GNN
Actor Net
Predicted Action
↑
↓
←
→
●
X_{agg} = \frac{1}{n} \sum_{i=1}^n x_{agg}^{(i)}
Centralized Critic Net
State-action value
proj
by proj
Use this design
0
0 uses