A test with 800 images from wiki-commons. The result is not great from various reasons.
First is about position and angle of the photo, second is about the colors scheme, and so on...
Main_Battle_Tank_v2.safetensors
834932_training_data.zip