Livoa LogoLivoa


Online Network

Target Network


Prioritized Experience Replay

Loss Function

Knowledge-Integration Module

argmax Q(s,a;θ)

S

s₁ a₁
s₂ a₂
sₙ₋₁ aₙ₋₁

sₙ aₙ

Agent 1
Agent 2
Agent n-1
Agent n

Cooperation Module

Environment

Simulator

Initial Solution

Shaking
Local Search
Final Solution

mnm

by hh

0
0 uses