Multi-modal fusion of satellite and street-view images for urban village classification based on a dual-branch deep neural network