ImitationbyPredictingObservationsAndrewJaegle1YurySulsky1ArunAhuja1JakeBruce1RobFergus1GregWayne1Abstract2009;Huberetal.,2009).Whilemostalgorithmsforimita-tionlearningassumethatdemonstrationscontai...
Cross-domainImitationfromObservationsDriptaS.Raychaudhuri1SujoyPaul2†JeroenvanBaar3AmitK.Roy-Chowdhury1AbstractExpertdomainProxytaskInferencetaskImitationlearningseekstocircumventthediffi-Transfor...
Non-StationaryDelayedBanditswithIntermediateObservationsClaireVernade1Andra´sGyo¨rgy1TimothyA.Mann1AbstractDelayedfeedbackinonlinelearninghavebeenaddressedbothinthefullinformationsetting(see,e.g....
ProvablyefficientRLwithRichObservationsviaLatentStateDecodingSimonS.Du1AkshayKrishnamurthy2NanJiang3AlekhAgarwal4MiroslavDud´ık2JohnLangford2Abstract2010;Lattimore&Hutter,2012).Consequently,treat...
LearningRegisteredPointProcessesfromIdiosyncraticObservationsHongtengXu12LawrenceCarin1HongyuanZha3AbstractRegisteredIntensityfunctionTemporalAparametricpointprocessmodelisdeveloped,withmodelingbas...