{Reference Type}: Journal Article {Title}: A review of reinforcement learning based hyper-heuristics. {Author}: Li C;Wei X;Wang J;Wang S;Zhang S; {Journal}: PeerJ Comput Sci {Volume}: 10 {Issue}: 0 {Year}: 2024 {Factor}: 2.411 {DOI}: 10.7717/peerj-cs.2141 {Abstract}: The reinforcement learning based hyper-heuristics (RL-HH) is a popular trend in the field of optimization. RL-HH combines the global search ability of hyper-heuristics (HH) with the learning ability of reinforcement learning (RL). This synergy allows the agent to dynamically adjust its own strategy, leading to a gradual optimization of the solution. Existing researches have shown the effectiveness of RL-HH in solving complex real-world problems. However, a comprehensive introduction and summary of the RL-HH field is still blank. This research reviews currently existing RL-HHs and presents a general framework for RL-HHs. This article categorizes the type of algorithms into two categories: value-based reinforcement learning hyper-heuristics and policy-based reinforcement learning hyper-heuristics. Typical algorithms in each category are summarized and described in detail. Finally, the shortcomings in existing researches on RL-HH and future research directions are discussed.