is there any way to use RL for decoder only models

11 Views Asked by rohit jindal At 22 March 2024 at 16:05

Could you please assist me in finding resources related to training models like BERT using RLHF? Additionally, I'm curious about the scarcity of research on applying reinforcement learning to decoder-only models. Are there any specific challenges or issues associated with this area that limit the research? Any guidance or insights would be greatly appreciated. Thank you!

while exploring trl library i found this issue (https://github.com/huggingface/trl/issues/747) so have brainstorming a lot about how to define the trajectories for this

Original Q&A

is there any way to use RL for decoder only models

There are 0 best solutions below

Related Questions in NLP

Related Questions in TRANSFORM

Related Questions in HUGGINGFACE-TRANSFORMERS

Related Questions in REINFORCEMENT-LEARNING

Related Questions in TRANSFORMER-MODEL

Trending Questions

Popular # Hahtags

Popular Questions