Livoa LogoLivoa
Vision Transformer (ViT)
Architecture
Generate Image
Patches
Linear Projection + Position Embedding
Transformer Block
Transformer Block
...
Feed Forward Neural Network
Multi-Head Self-Attention
Patch Embeddings
Feed-Forward Network
Classification

1

by Nothing

0
0 uses