Google Scholar

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

S Bubeck, N Cesa-Bianchi - Foundations and Trends® in …, 2012 - nowpublishers.com

Multi-armed bandit problems are the most basic examples of sequential decision problems
with an exploration-exploitation trade-off. This is the balance between staying with the option …

Save Cite Cited by 3102 Related articles All 26 versions Library Search View as HTML

[PDF] arxiv.org

Stochastic first-and zeroth-order methods for nonconvex stochastic programming

S Ghadimi, G Lan - SIAM journal on optimization, 2013 - SIAM

In this paper, we introduce a new stochastic approximation type algorithm, namely, the
randomized stochastic gradient (RSG) method, for solving an important class of nonlinear …

Save Cite Cited by 1562 Related articles All 13 versions

[PDF] researchgate.net

Robust stochastic approximation approach to stochastic programming

A Nemirovski, A Juditsky, G Lan, A Shapiro - SIAM Journal on optimization, 2009 - SIAM

In this paper we consider optimization problems where the objective function is given in a
form of the expectation. A basic difficulty of solving such stochastic optimization problems is …

Save Cite Cited by 2698 Related articles All 36 versions

[PDF] neurips.cc

Dual averaging method for regularized stochastic learning and online optimization

L Xiao - Advances in Neural Information Processing …, 2009 - proceedings.neurips.cc

We consider regularized stochastic learning and online optimization problems, where the
objective function is the sum of two convex terms: one is the loss function of the learning …

Save Cite Cited by 1031 Related articles All 17 versions View as HTML

[PDF] researchgate.net

An optimal method for stochastic composite optimization

G Lan - Mathematical Programming, 2012 - Springer

This paper considers an important class of convex programming (CP) problems, namely, the
stochastic composite optimization (SCO), whose objective function is given by the …

Save Cite Cited by 591 Related articles All 12 versions

[PDF] psu.edu

Optimal stochastic approximation algorithms for strongly convex stochastic composite optimization i: A generic algorithmic framework

S Ghadimi, G Lan - SIAM Journal on Optimization, 2012 - SIAM

In this paper we present a generic algorithmic framework, namely, the accelerated stochastic
approximation (AC-SA) algorithm, for solving strongly convex stochastic composite …

Save Cite Cited by 381 Related articles All 10 versions

[PDF] arxiv.org

Stochastic quasi-Newton methods for nonconvex stochastic optimization

X Wang, S Ma, D Goldfarb, W Liu - SIAM Journal on Optimization, 2017 - SIAM

In this paper we study stochastic quasi-Newton methods for nonconvex stochastic
optimization, where we assume that noisy information about the gradients of the objective …

Save Cite Cited by 208 Related articles All 14 versions

[PDF] projecteuclid.org

Aggregation for Gaussian regression

F Bunea, AB Tsybakov, MH Wegkamp - 2007 - projecteuclid.org

This paper studies statistical aggregation procedures in the regression setting. A motivating
factor is the existence of many different methods of estimation, leading to possibly competing …

Save Cite Cited by 405 Related articles All 16 versions

[PDF] researchgate.net

Regularization in statistics

PJ Bickel, B Li, AB Tsybakov, SA van de Geer, B Yu… - Test, 2006 - Springer

This paper is a selective review of the regularization methods scattered in statistics literature.
We introduce a general conceptual approach to regularization and fit most existing methods …

Save Cite Cited by 304 Related articles All 19 versions

[PDF] springer.com

Aggregation by exponential weighting, sharp PAC-Bayesian bounds and sparsity

A Dalalyan, AB Tsybakov - Machine Learning, 2008 - Springer

We study the problem of aggregation under the squared loss in the model of regression with
deterministic design. We obtain sharp PAC-Bayesian risk bounds for aggregates defined via …

Save Cite Cited by 223 Related articles All 30 versions

Cite

Advanced search

Saved to My library

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

Stochastic first-and zeroth-order methods for nonconvex stochastic programming

Robust stochastic approximation approach to stochastic programming

Dual averaging method for regularized stochastic learning and online optimization

An optimal method for stochastic composite optimization

Optimal stochastic approximation algorithms for strongly convex stochastic composite optimization i: A generic algorithmic framework

Stochastic quasi-Newton methods for nonconvex stochastic optimization

Aggregation for Gaussian regression

Regularization in statistics

Aggregation by exponential weighting, sharp PAC-Bayesian bounds and sparsity