Livoa LogoLivoa
Input Layer


3-channel RGB image

224 x 224 pixels

Tensor Shape: [3 x 224 x 224]

Stem Block


Conv 4x4, stride 4

Layer Normalization

Output: [96 x 56 x 56]

Stage 1


3 x ConvNeXt Blocks

Output: [96 x 56 x 56]

Stage 2


Downsampling (2x2 conv, stride 2)

3 x ConvNeXt Blocks

Output: [192 x 28 x 28]

Stage 3


Downsampling (2x2 conv, stride 2)

9 x ConvNeXt Blocks

Output: [384 x 14 x 14]

Stage 4


Downsampling (2x2 conv, stride 2)

3 x ConvNeXt Blocks

Output: [768 x 7 x 7]

ConvNeXt Block Details


- 7x7 Depthwise Conv

- Layer Normalization

- 1x1 Pointwise Conv (expand channels x4)

- GELU Activation

- 1x1 Pointwise Conv (project channels back)

- Residual Connection

Classification Head


Global Average Pooling

Layer Normalization

Linear Layer (4 classes)

Final Output


4-class scores

Softmax probabilities

.,nm

by nbv

0
0 uses