Due to the rapid growth of image number, the content-based image retrieval becomes an indispensable tool for huge databases. In this study, our focus is retrieving a specific building at different viewing angles stored in a database. In addition, if the user can provide additional images as the secondi and' or the third queries, how do we combine the information provided by these multiple queries? Thus, we develop a multi-query fus ion method to achieve a higher accuracy. Although Deep Neural Net (DNN) can provide an End-To-End image retrieval system, we like to see if the traditional image feature can offer additional performance improvement. That is, we test two different types of features designed for image retrieval purpose. We adopt the Scale-Invariant Feature Trans form (SIFT) features as the low-level feature and the Convolutional Neural Network (CNN) features as the high-level feature in the retrieval process. The AlexNet is used as our CNN model and also, its extension to the Siamese-Triplet Network is in use to match the image retried purpose. Several data fus ion structures have been proposed Our best sys tem exceeds mos t of the state-of-The-Art retrieval methods for a single query. The multi-query retrieval can further increase the retrieval accuracy, which is rarely studied by the other researchers.