On Wed, Nov 26, 2008 at 06:50:53AM +0100, Manuel Lauss wrote:
> > The probe routines in cpu-probe.c should get at least some of that correct.
> > How about just overriding the things that cpu-probe.c doesn't get right?
>
> CPU detection gets them all right, it's just that somehow GCC does not use
> the information correctly; i.e. in the __fls() case it blindly falls back
> on the C version instead of using the asm macro with clz in it. I scanned
> a few callsites of __fls() and there's not 'clz' to be found anywhere. With
> this addition the clz is used and the binary is a _lot_ smaller.
>
You should define all values as constants, as far as known. GCC will
then be able to use constant propagation and dead code elemination to
optimize the code for a particular target system.
The way fls() is written it will only use of CLZ if the expression
cpu_has_mips_r is a constant, that is if the kernel is being built
exclusivly for MIPS32 / MIPS64 revision 1 or higher. The reason that
__fls is written this way is that both it's legacy and R1 variants using
CLZ/DCLZ the function body will be compiled into something relativly small.
There is not such much point in adding even more code for a runtime
decission between two variants.
> I believe this is a gcc thing, but this seemed to be the obvious quick
> remedy.
GCC does correct.
Ralf
|