garrett4wade

Follow

Wei Fu garrett4wade

Follow

Ph.D. student in Tsinghua

23 followers · 6 following

Tsinghua University
Beijing, China

Achievements

Achievements

Pinned Loading

openpsi-project/ReaLHF openpsi-project/ReaLHF Public

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 109 4
revisiting_marl revisiting_marl Public

Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)

Python 21 1
cugae cugae Public

CUDA implementation of Generalized Advantage Estimation (GAE)

Python
scaling_marl scaling_marl Public

Python