ImprovedRegretBoundsofBilinearBanditsusingActionSpaceAnalysisKyoungseokJang1Kwang-SungJun2Se-YoungYun3WanmoKang1Abstractarrangecouplesbasedontheirexperiencestogetbetterrat-ingsandrewards.Balancinge...
BestArmIdentificationinGraphicalBilinearBanditsGeovaniRizk12AlbertThomas2IgorColin2RidaLaraki13YannChevaleyre1Abstractagent(e.g.,alltheconfigurationparametersoftheantennas),andreceivesanassociatedg...
BilinearClasses:AStructuralFrameworkforProvableGeneralizationinRLSimonS.Du1ShamM.Kakade1JasonD.Lee2ShacharLovett3GauravMahajan3WenSun4RuosongWang5AbstractthereisalsoarealizationthatpracticalRLappro...
LowFER:Low-rankBilinearPoolingforLinkPredictionSaadullahAmin1StalinVaranasi12KatherineAnnDunfield12Gu¨nterNeumann12Abstracttions,intheformoffacttriples<sub,rel,obj>.Theuseful-nessofknowledgegraphs...
BilinearBanditswithLow-rankStructureKwang-SungJun1RebeccaWillett2StephenWright3RobertNowak3Abstractsystemmaywanttochooseapairofitems(top,bottom)foracustomer,whoseappealdependsinpartonwhethertheyWei...
ScalableBilinearπLearningUsingStateandActionFeaturesYichenChen1LihongLi2MengdiWang3Abstracte.g.,Azaretal.(2013)).Inotherwords,thereisanoraclethattakes(s,a)asinputandoutputsarandomswithprob-Approxi...