Ph.D. student at Peking University. Interested in Reinforcement Learning & Multi-Agent Systems & Distributed Computing. Working on LLMs and AI Alignment.
-
CFCS @ PKU
- Peking University, Beijing
-
18:57
(UTC +08:00) - in/xuahai-pan
- @XuehaiPan
Highlights
- Pro
Block or Report
Block or report XuehaiPan
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
PKU-Alignment/safe-rlhf
PKU-Alignment/safe-rlhf PublicSafe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
-
metaopt/torchopt
metaopt/torchopt PublicTorchOpt is an efficient library for differentiable optimization built upon PyTorch.
-
PKU-Alignment/omnisafe
PKU-Alignment/omnisafe PublicOmniSafe is an infrastructural framework for accelerating SafeRL research.
-
pytorch/pytorch
pytorch/pytorch PublicTensors and Dynamic neural networks in Python with strong GPU acceleration
-
ray-project/ray
ray-project/ray PublicRay is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.