|
HACMan: Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation
Wenxuan Zhou, Bowen Jiang, Fan Yang, Chris Paxton*, David Held*
Conference of Robot Learning (Oral), 2023
We propose a spatially-grounded and temporally-abstracted action representation with a hybrid discrete-continuous reinforcement learning framework.
Keywords: RL with 3D Vision, Action Representation, Contact-rich manipulation
[Paper]
[Code]
[Website]
[BibTex]
|
|
Self-Paced Policy Optimization with Safety Constraints
Fan Yang, Wenxuan Zhou, Harshit Sikchi, David Held
ICML Workshop, Safe Learning for Autonomous Driving, 2022
[BibTex]
The method of graduating incurring a harder safety constraints can lead to a better performance in safe RL tasks.
|
|
Evaluations of the Gap between Supervised and Reinforcement Lifelong Learning on Robotic Manipulation Tasks
Fan Yang, Chao Yang, Huaping Liu, Fuchun Sun
Conference on Robot Learning (CORL), 2021
[BibTex]
We develop a benchmark and evaluate the state-of-the-art lifelong learning method on reinforcement learning tasks, especially robotic manipulation tasks.
|
|
Fault-aware robust control via adversarial reinforcement learning
Fan Yang, Chao Yang, Di Guo, Huaping Liu, Fuchun Sun
IEEE 11th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER), 2021
[BibTex]
An adversarial training algorithm is used to increase the robustness of robot joint damage.
|
|
RAIN: Reinforced hybrid attention inference network for motion forecasting
Jiachen Li, Fan Yang, Hengbo Ma, Srikanth Malla, Masayoshi Tomizuka, Chiho Choi
International Conference on Computer Vision (ICCV), 2021
[BibTex]
We develop a method that uses RL agent to select important interactions for trajectory prediction in a highly interactive environment.
|
|
Evolvegraph: Multi-agent trajectory prediction with dynamic relational reasoning
Jiachen Li*, Fan Yang*, Masayoshi Tomizuka, Chiho Choi
Neural Information Processing Systems (NeurIPS), 2020
[BibTex]
We develop a Graph-Neural-Network-based method captures the interactions between different agents for trajectory prediction in a highly interactive environment.
|
|