COSIE.image_preprocessing.create_model
- create_model(local_dir)[source]
Create and load a pre-trained Vision Transformer (ViT-L/16) model from a HuggingFace-compatible checkpoint.
Parameters
- local_dirstr
Path to the folder containing the pre-trained model weights (e.g., pytorch_model.bin).
Returns
- modeltorch.nn.Module
A ViT-Large (patch16, img224) model from the timm library, without classification head or global pooling.