"Learning"的相关文档 - 文库宝

开通VIP限时优惠

|

登录 | 注册

标签“Learning”的相关文档，共905条

Quasi-global Momentum Accelerating Decentralized Deep Learning on Heterogeneous Data
Quasi-GlobalMomentum:AcceleratingDecentralizedDeepLearningonHeterogeneousDataTaoLin1SaiPraneethKarimireddy1SebastianU.Stich1MartinJaggi1Abstractiskeptlocally(nevertransmittedduringtraining).Decentr...
Learning on Deep Momentum Decentralized
2023-11-16 19:28:3717598.84 MB1
下载文档
Quantum algorithms for reinforcement Learning with a generative model
QuantumAlgorithmsforReinforcementLearningwithaGenerativeModelDaochenWang1AarthiSundaram2RobinKothari2AshishKapoor3MartinRoetteler2Abstractfasteralgorithmsforcertaintaskslikesearchandfactor-ing(Grov...
Learning for Generative Algorithms with
2023-11-16 19:28:371531405.27 KB14
下载文档
Quantifying the Benefit of Using Differentiable Learning over Tangent Kernels
QuantifyingtheBeneﬁtofUsingDifferentiableLearningoverTangentKernelsEranMalach1PritishKamath2EmmanuelAbbe3NathanSrebro2AbstractCanallthesuccessofdeepLearningbeexplainedusingtheNTK?Thiswouldimplytha...
Learning of Using the Differentiable
2023-11-16 19:28:361725506.02 KB16
下载文档
PsiPhi-Learning Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
PsiPhi-Learning:ReinforcementLearningwithDemonstrationsusingSuccessorFeaturesandInverseTemporalDifferenceLearningAngelosFilos1ClareLyle1YarinGal1SergeyLevine2NatashaJaques23GregoryFarquhar4Abstract...
Learning Using with Reinforcement Demonstrations
2023-11-16 19:28:359524.23 MB7
下载文档
Putting the “Learning into Learning-Augmented Algorithms for Frequency Estimation
Puttingthe“Learning”intoLearning-AugmentedAlgorithmsforFrequencyEstimationElbertDu12FranklynWang12MichaelMitzenmacher2AbstractbyθandanalgorithmAInLearning-augmentedalgorithms,algorithmsmaxM(θ,A...
Learning for Algorithms the into
2023-11-16 19:28:351711744.13 KB28
下载文档
Proximal Causal Learning with Kernels Two-Stage Estimation and Moment Restriction
ProximalCausalLearningwithKernels:Two-StageEstimationandMomentRestrictionAfsanehMastouri1YuchenZhu1LimorGultchin23AnnaKorba4RicardoSilva1MattJ.Kusner1ArthurGretton†1KrikamolMuandet†5AbstractGoalA...
Learning with Estimation Proximal Causal
2023-11-16 19:28:358811.16 MB6
下载文档
Provably End-to-end Label-noise Learning without Anchor Points
ProvablyEnd-to-endLabel-noiseLearningwithoutAnchorPointsXuefengLi12TongliangLiu2BoHan3GangNiu4MasashiSugiyama45Abstract(Arpitetal.,2017;Zhangetal.,2017;Xiaetal.,2021;Wuetal.,2021).Inlabel-noiselear...
Learning without Provably End-to-End Points
2023-11-16 19:28:34877944.8 KB3
下载文档
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping
ProvablyEfﬁcientReinforcementLearningforDiscountedMDPswithFeatureMappingDongruoZhou1JiafanHe1QuanquanGu1Abstractlinearfunctionsorneuralnetworkstomapstatesandactionstoalow-dimensionalspaceandsolvet...
Learning for Efficient Reinforcement Provably
2023-11-16 19:28:341192361.96 KB29
下载文档
Provably Efficient Learning of Transferable Rewards
ProvablyEfﬁcientLearningofTransferableRewardsAlbertoMariaMetelli1GiorgiaRamponi1AlessandroConcetti1MarcelloRestelli1Abstracttheoretically,underthestrongassumptionofrewardunique-ness(Abbeel&Ng,2004...
Learning of Efficient Provably Rewards
2023-11-16 19:28:341016561.02 KB20
下载文档
Provable Robustness of Adversarial Training for Learning Halfspaces with Noise
ProvableRobustnessofAdversarialTrainingforLearningHalfspaceswithNoiseDifanZou1SpencerFrei2QuanquanGu1AbstractToformalizetheabovecomment,letusdeﬁnethero-Weanalyzethepropertiesofadversarialtrain-bus...
Learning of for Adversarial Robustness
2023-11-16 19:28:331989450.02 KB17
下载文档
Preferential Temporal Difference Learning
PreferentialTemporalDifferenceLearningNishanthAnand12DoinaPrecup123AbstractTD-Learningcanbeviewedasawaytoapproximatedy-namicprogrammingalgorithmsinMarkovianenviron-Temporal-Difference(TD)Learningis...
Learning Temporal Preferential Difference
2023-11-16 19:28:3011046.93 MB6
下载文档
Practical and Private (Deep) Learning Without Sampling or Shuffling
PracticalandPrivate(Deep)LearningWithoutSamplingorShufﬂingPeterKairouz1BrendanMcMahan1ShuangSong1OmThakkar1AbhradeepThakurta1ZhengXu1Abstractinthecontextofdistributedsettingslikefederatedlearn-ing...
Learning Sampling and Deep without
2023-11-16 19:28:3019304.83 MB5
下载文档
Prediction-Centric Learning of Independent Cascade Dynamics from Partial Observations
Prediction-CentricLearningofIndependentCascadeDynamicsfromPartialObservationsMateuszWilinski1AndreyY.Lokhov1AbstractThismotivatesthedevelopmentofefﬁcientalgorithmsthatcaninfertheeffectivespreading...
Learning of from Dynamics Independent
2023-11-16 19:28:307713.27 MB26
下载文档
Personalized Federated Learning using Hypernetworks
PersonalizedFederatedLearningusingHypernetworksAvivShamsian∗1AvivNavon∗1EthanFetaya1GalChechik12Abstractwhileminimizingcommunication.Unfortunately,Learningasingleglobalmodelmayfailwhenthedatadist...
Learning Using Personalized Federated Hypernetworks
2023-11-16 19:28:2819692.01 MB7
下载文档
PEBBLE Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
PEBBLE:Feedback-EfﬁcientInteractiveReinforcementLearningviaRelabelingExperienceandUnsupervisedPre-trainingKiminLee1LauraSmith1PieterAbbeel1AbstractKober&Peters,2011;Koberetal.,2013;Silveretal.,201...
Learning Reinforcement via Interactive PEBBLE
2023-11-16 19:28:2816906.86 MB12
下载文档
PC-MLP Model-based Reinforcement Learning with Policy Cover Guided Exploration
PC-MLP:Model-basedReinforcementLearningwithPolicyCoverGuidedExplorationYudaSong1WenSun2Abstractsuccessrate0.5HandEgg0.4Model-basedReinforcementLearning(RL)isa0.3DeepPC-MPL200000popularLearningparad...
Learning with Reinforcement Cover Model-Based
2023-11-16 19:28:2817363.18 MB23
下载文档
Parameterless Transductive Feature Re-representation for Few-Shot Learning
ParameterlessTransductiveFeatureRe-representationforFew-ShotLearningWentaoCui1YuhongGuo12AbstractFSLsolutionsaregenerallydevelopedintwobranches:meta-Learningbasedmethodsandﬁnetuningbasedmeth-Recen...
Learning for Feature Few-shot Transductive
2023-11-16 19:28:28756544.85 KB18
下载文档
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
On-PolicyDeepReinforcementLearningfortheAverage-RewardCriterionYimingZhang1KeithW.Ross21AbstractHaarnojaetal.,2018)orinaqueuingscenario(Tadepalli&Ok,1994;Sutton&Barto,2018),thereisnonaturalsep-Wede...
Learning for Reinforcement Deep the
2023-11-16 19:28:241004805.15 KB2
下载文档
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $ sqrt$T Regret
OnlinePolicyGradientforModelFre√eLearningofLinearQuadraticRegulatorswithTRegretAsafCassel1TomerKoren12AbstractModel-basedmethods,whichperformplanningbasedonasystemidentiﬁcationprocedurethatestima...
Learning for Online Gradient Model
2023-11-16 19:28:241211289.14 KB25
下载文档
Online Learning for Load Balancing of Unknown Monotone Resource Allocation Games
OnlineLearningforLoadBalancingofUnknownMonotoneResourceAllocationGamesAbstractIlaiBistritz1NicholasBambos12018;Boursier&Perchet,2019;Mehrabianetal.,2020;ConsiderNplayersthateachusesamixtureofNayyar...
Learning of for Online Balancing
2023-11-16 19:28:241114613.47 KB22
下载文档

首页上页 2 3 4 5 6 下页尾页

确认删除?

VIP会员服务
限时5折优惠