Input Video Stream
Farm Surveillance Feed
YOLO-based Detection
Cattle Bounding Boxes
Per-Cattle Cropping
Individual ROIs
Multi-Object Tracking
(Identity Assignment)
Temporal Clip Formation
T Consecutive Frames
Temporal Transformer
Attention Over Time
YOLO Backbone Feature Extraction
Behaviour Classification
Eating | Drinking | Standing | Sitting
by Ishwar