https://github.com/lucidrains/llama-qrlhf

GitHub - lucidrains/llama-qrlhf: Implementation of the Llama architecture with RLHF Q-learning

Implementation of the Llama architecture with RLHF + Q-learning - GitHub - lucidrains/llama-qrlhf: Implementation of the Llama architecture with RLHF + Q-learning

github.com



딥러닝 관련 새로운 기술이나 논문 나오면 ㅈㄴ빠른속도로 구현하는걸로 유명한 사람인데

이번에 Q스타 루머보고 LLaMA에 Q러닝 붙여서 구현하고있나봄 ㅋㅋㅋ