PhD Candidate @ Tsinghua University
-
Tsinghua University
- Beijing, China
- https://shenzhi-wang.netlify.app/
- @ShenzhiWang_THU
- https://huggingface.co/shenzhi-wang
Highlights
- Pro
Pinned Loading
-
Llama3-Chinese-Chat
Llama3-Chinese-Chat PublicThis is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
-
LeapLabTHU/FamO2O
LeapLabTHU/FamO2O PublicRepository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.