Transformer Model Optimization Linformers vs Convolutional Models