On Wed, Jul 20, 2011 at 10:47 AM, Robin Gareus wrote:
In addition... inspecting the disassembly is helpful (-S -o
myprogram.s). Rule of thumb is that you should have `movaps` (MOVe
Aligned Packed-Storage) and `mulps` (MULtiply Packed Storage)
instructions for multiplying vectors of single-precision floats.
In addition... profiling with valgrind/callgrind is helpful (esp. if
you have it dump instructions/assembly)...
$ valgrind --tool=callgrind --dump-instr=yes ./myprogram
Open the output file with kcachegrind and it'll save you a lot of time.
Linux-audio-dev mailing list