XOR-CD:LinearlyConvergentConstrainedStructureGenerationFanDing1JianzhuMa2JinboXu3YexiangXue1AbstractOriginLRPSEWeproposeXOR-ContrastiveDivergencelearn-!2!1Match(M)SMI2ing(XOR-CD),aprovableapproachf...
ProvablyConvergentTwo-TimescaleOff-PolicyActor-CriticwithFunctionApproximationShangtongZhang1BoLiu2HengshuaiYao3ShimonWhiteson1Abstractatwo-timescaleConvergentanalysisunderfunctionapproxi-mation(Ko...
AdaptiveSketchingforFastandConvergentCanonicalPolyadicDecompositionKareemS.Aggour1AlexGittens2BülentYener2Abstractcommunitieshasarisenthatattemptstoincreasethescala-bilityoftensordecompositionalgo...
SBEED:ConvergentReinforcementLearningwithNonlinearFunctionApproximationBoDai1AlbertShaw1LihongLi2LinXiao3NiaoHe4ZhenLiu1JianshuChen5LeSong1AbstractarereferredtothetextbookofPuterman(2014)fordetails...
ConvergentTREEBACKUPandRETRACEwithFunctionApproximationAhmedTouati12Pierre-LucBacon3DoinaPrecup34PascalVincent124Abstractdifferenttargetswhichmaytaketheformofvaluefunctionscorrespondingtodifferentp...