TeachMyAgent:aBenchmarkforAutomaticCurriculumLearninginDeepRLCle´mentRomac1Re´myPortelas1KatjaHofmann2Pierre-YvesOudeyer1AbstractagentisunlikelytobeoptimalforcomplexLearningprob-lems.Buildingupon...
SUNRISE:ASimpleUnifiedFrameworkforEnsembleLearninginDeepReinforcementLearningKiminLee1MichaelLaskin1AravindSrinivas1PieterAbbeel1Abstractefficientmodel-freeRLalgorithmsthroughimprovementsinoff-poli...
StructuredWorldBeliefforReinforcementLearninginPOMDPGautamSingh1SkandPeri1JunghyunKim1HyunseokKim2SungjinAhn13Abstractgeneralizationtonovelscenes(Chenetal.,2020).Object-centricworldmodelsprovidestr...
StraighttotheGradient:LearningtoUseNovelTokensforNeuralTextGenerationXiangLin1SimengHan1ShafiqJoty12Abstractrization(Seeetal.,2017),imagecaptioning(Melas-Kyriazietal.,2018;Wang&Chan,2019)andmachine...
SpectralNormalisationforDeepReinforcementLearning:AnOptimisationPerspectiveFlorinGogianu12TudorBerariu3MihaelaRosca45ClaudiaClopath34LucianBusoniu2RazvanPascanu4AbstractFigure1:Optimisationrivalsal...
SparseFeatureSelectionMakesBatchReinforcementLearningMoreSampleEfficientBotaoHao1YaqiDuan2TorLattimore1CsabaSzepesva´ri13MengdiWang21Abstract1.IntroductionThispaperprovidesastatisticalanalysisofhi...
SparseBayesianLearningviaStepwiseRegressionSebastianAment1CarlaGomes1Abstractvariouslyreferredtoasafeatureoranatom.Thisproblemhasbeenstudiedextensively,resultinginmyriadexistingSparseBayesianLearni...
SketchEmbedNet:LearningNovelConceptsbyImitatingDrawingsAlexanderWang12MengyeRen12RichardS.Zemel123Abstractofsketchgenerationmodels,rangingfromgenerativeadver-sarialnetworks(GANs)(Isolaetal.,2017;Li...
SimultaneousSimilarity-basedSelf-DistillationforDeepMetricLearningKarstenRoth12TimoMilbich2Bjo¨rnOmmer2JosephPaulCohen3MarzyehGhassemi41Abstractthatembeddingspacedimensionalitycanbeadriverforgener...
Shortest-PathConstrainedReinforcementLearningforSparseRewardTasksSungryullSohn12SungtaeLee3JongwookChoi1HarmvanSeijen4MehdiFatemi4HonglakLee21AbstractMoreover,thesuccessoftheRLalgorithmheavilyhinge...
SharingLessisMore:LifelongLearninginDeepNetworkswithSelectiveLayerTransferSeungwonLee1SimaBehpour2EricEaton1Abstract&Hospedales,2017;Leeetal.,2019;Liuetal.,2019b;Bulatetal.,2020),suchassharingthelo...
Self-TuningforData-EfficientDeepLearningXimeiWang1JinghanGao1MingshengLong1JianminWang1Abstractmustbedonebyanexpertsuchasadoctorinmedicalapplications.Tomitigatetherequirementforlabeleddata,Deeplear...
Self-supervisedGraph-levelRepresentationLearningwithLocalandGlobalStructureMinghaoXu1HangWang1BingbingNi1HongyuGuo2JianTang345Abstract2020).Thesemethodsareusuallytrainedinasupervisedfashion,whichre...
Self-PacedContextEvaluationforContextualReinforcementLearningTheresaEimer1Andre´Biedenkapp2FrankHutter23MariusLindauer1AbstractFigure1:ExampleinstancesofthecontextualPointMassenvironment.Theagent...
Self-DamagingContrastiveLearningZiyuJiang1TianlongChen2BobakMortazavi1ZhangyangWang2Abstractpowerfulvisualrepresentationsfromunlabeleddata.Thestate-of-the-artcontrastiveLearningframeworksconsistent...
SCC:anEfficientDeepReinforcementLearningAgentMasteringtheGameofStarCraftIIXiangjunWang1JunxiaoSong1PenghuiQi1PengPeng1ZhenkunTang1WeiZhang1WeiminLi1XiongjunPi1JujieHe1ChaoGao1HaitaoLong1QuanYuan1Ab...
ScalingUpVisualandVision-LanguageRepresentationLearningWithNoisyTextSupervisionChaoJia1YinfeiYang1YeXia1Yi-TingChen1ZaranaParekh1HieuPham1QuocV.Le1YunhsuanSung1ZhenLi1TomDuerig1Abstract1.Introducti...
ScalingMulti-AgentReinforcementLearningwithSelectiveParameterSharingFilipposChristianos1GeorgiosPapoudakis1ArrasyRahman1StefanoV.Albrecht1Abstract(e.g.(Guptaetal.,2017))wherebyagentssharesomeorallp...
ScalableEvaluationofMulti-AgentReinforcementLearningwithMeltingPotJoelZ.Leibo1EdgarDue´n˜ez-Guzma´n1AlexanderSashaVezhnevets1JohnP.Agapiou1PeterSunehag1RaphaelKoster1JaydMatyas1CharlesBeattie1Ig...
Sample-OptimalPACLearningofHalfspaceswithMaliciousNoiseJieShen1AbstractGenerallyspeaking,alargebodyofexistingworksstudytheproblemofLearninghalfspacesunderlabelnoise.Thisin-WestudyefficientPAClearni...