Follow
Thinh T. Doan
Title
Cited by
Cited by
Year
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation for Multi-Agent Reinforcement Learning
TT Doan, ST Maguluri, J Romberg
International Conference on Machine Learning, 2019
1412019
Performance of Q-learning with Linear Function Approximation: Stability and Finite-Time Analysis
Z Chen, S Zhang, TT Doan, ST Maguluri, JP Clarke
arXiv preprint arXiv:1905.11425, 2019
117*2019
Fast Convergence Rates of Distributed Subgradient Methods with Adaptive Quantization
TT Doan, ST Maguluri, J Romberg
arXiv preprint arXiv:1810.13245, 2018
672018
Convergence rates of distributed gradient methods under random quantization: A stochastic approximation approach
TT Doan, ST Maguluri, J Romberg
IEEE Transactions on Automatic Control 66 (10), 4469-4484, 2020
59*2020
Distributed resource allocation on dynamic networks in quadratic time
TT Doan, A Olshevsky
https://arxiv.org/abs/1507.07850, 2015
562015
Convergence of the iterates in mirror descent methods
TT Doan, S Bose, DH Nguyen, CL Beck
IEEE control systems letters 3 (1), 114-119, 2018
522018
Distributed Lagrangian Methods for Network Resource Allocation
TT Doan, CL Beck
arXiv preprint arXiv:1609.06287, 2016
45*2016
On the convergence rate of distributed gradient methods for finite-sum optimization under communication delays
TT Doan, CL Beck, R Srikant
arXiv preprint arXiv:1708.03277, 2017
44*2017
Finite-time performance of distributed temporal-difference learning with linear function approximation
TT Doan, ST Maguluri, J Romberg
SIAM Journal on Mathematics of Data Science 3 (1), 298-320, 2021
432021
Finite sample analysis of two-time-scale natural actor-critic algorithm
S Khodadadian, TT Doan, J Romberg, ST Maguluri
IEEE Transactions on Automatic Control, 2022
382022
On the Convergence Rate of Distributed Gradient Methods for Finite-Sum Optimization under Communication Delays
TT Doan, CL Beck, R Srikant
Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (2 …, 2017
352017
Finite-time analysis and restarting scheme for linear two-time-scale stochastic approximation
TT Doan
SIAM Journal on Control and Optimization 59 (4), 2798-2819, 2021
332021
A decentralized policy gradient approach to multi-task reinforcement learning
S Zeng, MA Anwar, TT Doan, A Raychowdhury, J Romberg
Uncertainty in Artificial Intelligence, 1002-1012, 2021
322021
Distributed resource allocation over dynamic networks with uncertainty
TT Doan, CL Beck
IEEE Transactions on Automatic Control, 2020
302020
On the geometric convergence rate of distributed economic dispatch/demand response in power networks
TT Doan, A Olshevsky
arXiv preprint arXiv:1609.06660, 2016
292016
Nonlinear two-time-scale stochastic approximation convergence and finite-time performance
TT Doan
IEEE Transactions on Automatic Control, 2022
282022
Byzantine fault-tolerance in decentralized optimization under 2f-redundancy
N Gupta, TT Doan, NH Vaidya
2021 American Control Conference (ACC), 3632-3637, 2021
242021
Convergence rates of accelerated markov gradient descent with applications in reinforcement learning
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2002.02873, 2020
242020
Finite-time analysis of stochastic gradient descent under markov randomness
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2003.10973, 2020
222020
A two-time-scale stochastic optimization framework with applications in control and reinforcement learning
S Zeng, TT Doan, J Romberg
SIAM Journal on Optimization 34 (1), 946-976, 2024
162024
The system can't perform the operation now. Try again later.
Articles 1–20