For random accesses, I recommend to use TMU because it's more flexible than VPM DMA. To do TMU read, you do: 1. Write memory address (aligned with 4 byte) to TMU[01]_S. The address can be different across QPU threads, that is, you can read up to 16 x 4bytes of memory on a TMU read. 2. Signal the TMU...