publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-upsIn The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
- Temporal-distributed backdoor attack against video based action recognitionIn Proceedings of the AAAI Conference on Artificial Intelligence, 2024
- Provably efficient ucb-type algorithms for learning predictive state representations12th International Conference on Learning Representations (ICLR), 2024
- Non-asymptotic Convergence of Training Transformers for Next-token PredictionThe Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
- Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes12th International Conference on Learning Representations (ICLR), 2024
- Robust Offline Reinforcement Learning for Non-Markovian Decision ProcessesarXiv preprint arXiv:2411.07514, 2024
- Towards General Function Approximation in Nonstationary Reinforcement LearningIEEE Journal on Selected Areas in Information Theory, 2024
2023
- Federated linear contextual bandits with user-level differential privacyIn International Conference on Machine Learning (ICML), 2023
- Improved sample complexity for reward-free reinforcement learning under low-rank mdps11th International Conference on Learning Representations (ICLR), 2023
- Non-stationary reinforcement learning under general function approximationIn International Conference on Machine Learning (ICML), 2023
- Safe exploration incurs nearly no additional sample complexity for reward-free rl11th International Conference on Learning Representations (ICLR), 2023
- FLORAS: Differentially private wireless federated learning using orthogonal sequencesIn ICC 2023-IEEE International Conference on Communications, 2023
- Near-optimal conservative exploration in reinforcement learning under episode-wise constraintsIn International Conference on Machine Learning, 2023
2022
- Cascading bandits with two-level feedbackIn 2022 IEEE International Symposium on Information Theory (ISIT), 2022
2021
- Federated linear contextual banditsAdvances in neural information processing systems, 2021