Re: [PATCH] Mantaining turnstile aligned to 128 bytes in i386 CPUs



Kip Macy wrote:
On 1/16/07, Ivan Voras <ivoras@xxxxxx> wrote:
But it does seem to hurt the performance a bit - maybe it's time to add
another CPU option like I586_CPU and I686_CPU?

Unless there is a compelling reason not to do so, I think that that
would be a good idea.

Maybe even someone finds a way to get optimized versions of memcpy in
the kernel :)

I was thinking: AFAIK the only major stopper is context saving of the
various "auxiliary" registers - FPU, MMX, SSE, right? But is it an
all-or-nothing situation? I.e. does it make sense (can it be done?) to
just elect to save the MMX context? (AFAIK they are different registers
than SSE, but overlay FPU registers?) The idea is to save something
smaller than the full set.


Attachment: signature.asc
Description: OpenPGP digital signature



Relevant Pages

  • Re: SIMD/Vectors/Matrices
    ... AVX, LRBni, various GPUs, etc.) in Forth. ... registers that SSE, AltiVec, etc provide. ... it leads to the problem again that the programmer has to write ...
    (comp.lang.forth)
  • Re: SSE
    ... > I have an interesting application and I am wondering if SSE will do it. ... > In chinese chess the board is bigger, namely it would be 90 bits. ... > reading calls these floating point registers and the operators work on ... There are shift instructions that move the scalars ...
    (comp.programming)
  • Re: SSE
    ... > I have an interesting application and I am wondering if SSE will do it. ... > In chinese chess the board is bigger, namely it would be 90 bits. ... > reading calls these floating point registers and the operators work on ... Although they are semantically identical to the original instructions SSE ...
    (comp.programming)
  • Re: Where do I start (over)?
    ... Availability of 64-bit wide general-purpose registers. ... the things to cross out are all the MMX instructions. ... one of them has an SSE counterpart, so there's little need to use MMX ...
    (comp.lang.asm.x86)
  • Re: SSE
    ... >>I have an interesting application and I am wondering if SSE will do it. ... >>In chinese chess the board is bigger, namely it would be 90 bits. ... >>reading calls these floating point registers and the operators work on ... memory is an issue but so is speed. ...
    (comp.programming)