From 1632782fe9a845e545f0ebd05fb4dc304adb5bd2 Mon Sep 17 00:00:00 2001
From: Evan Cheng <evan.cheng@apple.com>
Date: Wed, 28 Jan 2009 08:35:02 +0000
Subject: The memory alignment requirement on some of the mov{h|l}p{d|s}
 patterns are 16-byte. That is overly strict. These instructions read / write
 f64 memory locations without alignment requirement.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@63195 91177308-0d34-0410-b5e6-96231b3b80d8
---
 lib/Target/X86/README-SSE.txt | 5 +++++
 1 file changed, 5 insertions(+)

(limited to 'lib/Target/X86/README-SSE.txt')

diff --git a/lib/Target/X86/README-SSE.txt b/lib/Target/X86/README-SSE.txt
index 7110b31487..bc51b53482 100644
--- a/lib/Target/X86/README-SSE.txt
+++ b/lib/Target/X86/README-SSE.txt
@@ -907,3 +907,8 @@ We should be able to use:
   cvtsi2ss 8($esp), %xmm0
 since we know the stack slot is already zext'd.
 
+//===---------------------------------------------------------------------===//
+
+Consider using movlps instead of movsd to implement (scalar_to_vector (loadf64))
+when code size is critical. movlps is slower than movsd on core2 but it's one
+byte shorter.
-- 
cgit v1.2.3