likwid-bench: improve peakflops micro benchmark
Created by: melven
- add different variants AVX/AVX-512 with/without FMAs, similar to the other benchmarks
- higher unrolling of AVX/SSE variants to obtain better results (on Haswell)
- add missing description and annotation (number of instructions etc)