Subject: Re: stmw/lmw [Was: CVS commit: src/sys/arch/powerpc/powerpc]
To: Simon Burge <simonb@wasabisystems.com>
From: Matt Thomas <matt@3am-software.com>
List: source-changes
Date: 08/03/2003 18:41:30
On Sunday, August 3, 2003, at 06:31 PM, Simon Burge wrote:
> Matt Thomas wrote:
>
>> Module Name: src
>> Committed By: matt
>> Date: Mon Aug 4 00:32:50 UTC 2003
>>
>> Modified Files:
>>
>> src/sys/arch/powerpc/powerpc: locore_subr.S trap_subr.S
>>
>> Log Message:
>>
>> Eliminate stmw/lmw substituting the individual load/store
>> instructions.
>
> Does this have any measurable impact on trap response and context
> switch
> times (simply because of more instruction fetches)? It's also larger
> code, so we have greater I$ pollution...
Actually, when I was doing my internet checksum tests, I found
individual loads to be *faster* (by 20-25%) than lmw/stmw. This may be
why gcc doesn't emit them for saving/restoring callframes.
--
Matt Thomas email: matt@3am-software.com
3am Software Foundry www:
http://3am-software.com/bio/matt/ Cupertino, CA
disclaimer: I avow all knowledge of this message.