RLlib Examples

This page is an index of examples for the various use cases and features of RLlib.

If any example is broken, or if you’d like to add an example to this page, feel free to raise an issue on our Github repository.

Tuned Examples

Training Workflows

  • Custom training workflows:
    Example of how to use Tune’s support for custom training functions to implement custom training workflows.
  • Curriculum learning:
    Example of how to adjust the configuration of an environment over time.
  • Custom metrics:
    Example of how to output custom training metrics to TensorBoard.

Custom Envs and Models

Serving and Offline

  • CartPole server:
    Example of online serving of predictions for a simple CartPole policy.
  • Saving experiences:
    Example of how to externally generate experience batches in RLlib-compatible format.

Multi-Agent and Hierarchical

Community Examples