Most of our mental representations are constructed from our sensory experiences. For exemple we have self talk (auditory representations), internal images (visual representations) or imagined physical movement (kinesthetic representations).
As a "Reductio ad absurdum", try to solve this problem without pronouncing a single word in your head:
Continue the series without visualy reproducing any internal images:
Find the hidden face of the die without imagining any movement:
As you can see, it is impossible; each kind of problem requires a specific type of representation.
Try the .