The theoretical Evaluation demonstrates that EDIS exhibits reduced suboptimality compared to solely making use of on the net info or immediately reusing offline facts. EDIS is usually a plug-in approach and will be combined with existing methods in offline-to-on line RL environment. By applying EDIS to off-the-shelf methods Cal-QL and IQL, we notic