TensorBoard
The training scripts write TensorBoard logs to /results during training.
To monitor training using TensorBoard, launch the port-forwarding TensorBoard container in another terminal:
./scripts/docker/launch_tb.sh <results_dir1> [results_dir2 ...] [--port PORT] [--samples NUM]
If --port isn’t passed then it defaults to port 6010. --samples is the
number of steps that TensorBoard will sample from the log and plot. It defaults
to 1000.
Then navigate to http://traininghostname:<OPTIONAL PORT NUMBER> in a web browser.
If a connection dies and you can’t reconnect to your port because it’s already allocated, run:
docker ps
docker stop <name of docker container with port forwarding>