RLHF
ThoughtStorms Wiki
Context : ArtificialIntelligence, MachineLearning, LanguageModels
Reinforcement Learning from Human Feedback
Most AIs are trained on large amounts of human-data. But then human feedback is used to improve / overcome the issues in that data.
Backlinks (1 items)