Search found 2 matches

by pauljurczak
2023-12-25, 7:35:06
Forum: Agner's CPU blog
Topic: Is using BSF instruction instead of using GNU C __builtin_ctz inefficient?
Replies: 1
Views: 12709

Is using BSF instruction instead of using GNU C __builtin_ctz inefficient?

I posted this question on stackoverflow: https://stackoverflow.com/questions/77707238/looking-for-an-efficient-function-to-find-an-index-of-max-element-in-simd-vector. The answer claims, among others, that using "legacy BSF instruction (slow on AMD), instead of using GNU C __builtin_ctz on non-MSVC"...
by pauljurczak
2023-12-23, 6:33:45
Forum: Agner's CPU blog
Topic: Efficiency of array<Vec32uc, 8> vs. ContainerV<Vec32uc, 8>
Replies: 1
Views: 12152

Efficiency of array<Vec32uc, 8> vs. ContainerV<Vec32uc, 8>

Is there an obvious performance penalty in using array<Vec32uc, 8> instead of ContainerV<Vec32uc, 8>? One reason for this choice is https://godbolt.org/ not having vector_containers.h available.