A PyTorch implementation of the flow policy with invalid action rejection for large discrete (categorical) action space with constraints.
Latest commits.
Builders behind this project.