They are talking about layers. The visualizations are not just raw weights from nodes converted to images.
The paper refered in the book list the details of the processing done to get the visualizations (1).
From the paper: “We show the top 9 activations in a random subset of feature maps across the validation data, projected down to pixel space using our deconvolutional network approach.”