Livoa
Discord
Pricing
English
Sign In
Vision Transformer (ViT)
Architecture
Generate Image
Patches
Linear Projection + Position Embedding
Transformer Block
Transformer Block
...
Feed Forward Neural Network
Multi-Head Self-Attention
Patch Embeddings
Feed-Forward Network
Classification
1
by Nothing
Use this design
0
0 uses