Tabular AlgorithmsΒΆ
This section covers the tabular algorithms currently documented in MASA. These methods assume discrete state and action spaces and are implemented under masa/algorithms/tabular/.
QL is the baseline tabular learner. The other pages in this section describe variants that add safety-related penalties, auxiliary tables, or action overrides.