Sungwook Wi -

Improving the accuracy of streamflow predictions in ungauged basins (PUB) has long been a significant challenge in hydrological research. This study hypothesizes that deep learning-based PUB can be enhanced for historical streamflow reconstruction by integrating local climate data from ungauged or partially gauged basins (the target site) with streamflow measurements from nearby gauged basins (donor sites). The rationale is that streamflow records from donor sites offer valuable information for predicting streamflow at the target site. However, in some instances, local weather data may be more readily available, while available donors might be poorly correlated with the target. Therefore, prediction accuracy can be improved by weighting both sources effectively. To test this hypothesis, we conducted a case study using over 200 streamflow gauges in the Great Lakes region. We developed a multi-layer perceptron to estimate Spearman rank correlations of streamflow between basins, aiding in the selection of donor sites. These estimated correlations were fed into a Long Short-Term Memory (LSTM) network, along with streamflow data from donor sites and weather data from target sites. We compared this model against two other LSTMs, one trained only on climate data and the other solely on streamflow data from donor sites, as well as the average prediction from those two models. Our findings indicate that the integrated approach outperforms the alternatives, particularly for partially gauged sites and natural ungauged sites. Lastly, we demonstrate the value of the approach for improving lake-wide runoff estimates and underscore its implications for quantifying the Great Lakes water balance.