Acceleration of GPU-based Krylov solvers via Data Transfer Reduction