Commit Graph

9 Commits

Author SHA1 Message Date
David Major
f55a74b44b Backed out changeset 4914871dbd96 (bug 1163740) now that the diagnostic patch has served its purpose 2015-05-26 13:07:26 -04:00
David Major
18a26dc4d5 Backed out changeset a620a422367f (bug 1163740) now that the diagnostic patch has served its purpose 2015-05-26 13:06:34 -04:00
David Major
2c7c9c21b7 Bug 1163740: Keep |result| in a register where we can see it in minidumps. r=seth 2015-05-22 16:49:37 -04:00
David Major
49271789f7 Bug 1163740: Keep |result| in a register where we can see it in minidumps. r=seth 2015-05-20 20:48:57 -04:00
Jeff Muizelaar
e14722f2c7 Bug 791422. Support more flexibiltiy qcms output format. r=BenWa
This will help support Chrome and should also let us output directly to a cairo
compatible format.
2012-10-03 19:04:25 -04:00
Jeff Muizelaar
c8721bef06 Bug 548612. qcms: Reduce size of output lookup tables. r=bgirard
We currently use a larger output lookup table than we probably need. Switch to
a common define for the table size and lower it. The should also give a small
improvement to startup time because we have fewer lookup table entries to
compute.
2010-06-04 14:48:30 -04:00
Steve Snyder
222246b6c4 Bug 512865. qcms: Improve SSE2 performance, add SSE support. r=jrmuizel
This patch greatly improves the performance of QCMS transformations on x86 &
x86_64 systems.  Some notes:

0. On 32-bit x86 systems it does runtime selection between non-SIMD, SSE, and
SSE2 code paths.

1. On x86_64 systems the SSE2 code path is always taken.  The non-SIMD and SSE
code paths are left intact, but contemporary versions of the GCC and MSVC
compilers will see that they cannot be reached and optimize them away.

2. The execution of the SSE2 code path is reduced by 67%, relative to the
original Intel/Microsoft formatted ASM code.  The relative performance is seen
on a Pentium4 (Northwood) 2.4GHz CPU with DDR1 RAM.

3. The SSE code path provides a 80% reduction in execution time, relative to
the non-SIMD code path.  The relative performance is seen on a Pentium3
(Coppermine) 1.26GHz CPU with SDRAM.

4. The code has been split out into separate files so that it can be built
with different cflags (-msse, and -msse2) when using gcc.

5. Try to land again, this time with __attribute__((__force_align_arg_pointer__))
to avoid crashes on linux.
2009-10-01 14:19:41 -04:00
Jeff Muizelaar
25f9fdb935 Backed out changeset cb4f078cc8cb (bug 512865)
Was causing crashes on the leak test box.
2009-09-24 23:35:53 -04:00
Steve Snyder
e24422b587 Bug 512865. qcms: Improve SSE2 performance, add SSE support. r=jrmuizel
This patch greatly improves the performance of QCMS transformations on x86 &
x86_64 systems.  Some notes:

0. On 32-bit x86 systems it does runtime selection between non-SIMD, SSE, and
SSE2 code paths.

1. On x86_64 systems the SSE2 code path is always taken.  The non-SIMD and SSE
code paths are left intact, but contemporary versions of the GCC and MSVC
compilers will see that they cannot be reached and optimize them away.

2. The execution of the SSE2 code path is reduced by 67%, relative to the
original Intel/Microsoft formatted ASM code.  The relative performance is seen
on a Pentium4 (Northwood) 2.4GHz CPU with DDR1 RAM.

3. The SSE code path provides a 80% reduction in execution time, relative to
the non-SIMD code path.  The relative performance is seen on a Pentium3
(Coppermine) 1.26GHz CPU with SDRAM.

4. The code has been split out into separate files so that it can be built
with different cflags (-msse, and -msse2) when using gcc.
2009-09-24 11:58:24 -04:00