BaselinesΒΆ
These tutorials show how to configure and compare MASA baselines in small, readable experiments.
For a verified list of the algorithms currently present in the codebase, start with Algorithms Overview.
Tabular Safe RL Baselines compares
q_learning,q_learning_lambda,lcrl,sem, andrecregon one constrained tabular environment.Continuous Safe RL Baselines is a stub for PPO, CPO, and PPO Lagrangian continuous-control baselines.
Runnable notebooks: