> ----- Forwarded message from Jun Sun <email@example.com> -----
Sep 17, 2002 at 03:58:54PM -0700, Greg Lindahl wrote:
> > The only good test is Linux with and without lazy saves. Throwing in a
> > new OS complicates matters. It sounds like Jun already has working
> > code for (1) and (3), so he can do a good test.
> I actually have 2) and 3). 1) is easy to do, though.
> Anyone can recommand some test programs to try?
> A while back, I tried lmbench which is not very telling.
> I think the reason is that most of the tests are not using
> FPU at all.
"Not very telling?" Sounds to me as if it confirms the
hypothesis that the benefits of these optimizations are
> However I might try it again anyway. It might tell the
> difference between 1) and 2)&3) easily.
If I wanted to see the effect at its strongest, I'd whip
up an FP-intensive, low-I/O program along the lines
of the old fashioned Whetstone benchmark that runs
for at least a few seconds, then time a script that
forks off N of them in parallel with M instances of
a program that does no FP. You can then play with
M and N to see where a hack becomes advantageous.
If all runnable programs are using the FPU, there's
clearly no benefit from the optimization.
Are you able to test this stuff on a proper SMP
system, by the way? The efficiency of the code
that manipulates interprocessor control variables
can reasonably be expected to drop off a bit
in a system with MP cache invalidations blasting
left and right.