ExplorationConsciousReinforcementLearningRevisitedLiorShani1YonathanEfroni1ShieMannor1AbstractRL,i.e,whenusingfunctionapproximation,remainsanopenproblem.Onthepracticalside,recentworkscom-TheExplora...