NVIDIA GeForce GTX 460M SLI vs NVIDIA GeForce GTX 480M vs NVIDIA GeForce GTX 485M
NVIDIA GeForce GTX 460M SLI► remove from comparison
The NVIDIA GeForce GTX 460M SLI consists of two high end DirectX 11 capable Geforce GTX460M cards connected with an SLI bridge. Therefore, it offers 2x 192 cores of the GF106 chip. Due to the alternating frame rendering (AFR), the SLI solution may suffer from micro stuttering in low fps ranges (around 30fps). Furthermore, the 2x1536 MB GDDR5 cant be added together as each card uses the same date in the graphics memory.
The GF106 core of the GTX 460M is related to the GF100 core of the GeFore GTX 480M and offers 192 shaders and a 192 Bit memory bus for GDDR5. Except for the memory controllers, the GF106 can basically be considered a halved GF104. Therefore, the architecture is not comparable to the old GT215 (e.g., GeForce GTS 350M) or GT216 (e.g., GeForce GT 330M) cores.
Unlike the GF100, the smaller GF104, GF106, and GF108 cores have all been shortened and considerably modified. In contrast to the GF100, which was designed for professional applications, these GF10x chips target the consumer market. They feature more shaders (3x16 instead of 2x16), more texture units (8 instead of 4) and SFUs per streaming multi-processor (SM).
As there are still only 2 warp schedulers (versus 3 shader groups), Nvidia now uses superscalar execution to use the higher amount of shaders per SM more efficiently. In theory, the shaders can thereby be utilized more efficiently and the performance per core should be improved. However, in worst case scenarios the performance can also be worse than of the GF100 (and its predecessors). The ECC memory protection, which is important for professional applications, was completely omitted and the FP64 hardware shortened (only 1/3 of the shaders are FP64-capable and thereby only 1/12 of the FP32’s performance). Because of these cutbacks, the size of the SM grew only by 25% despite the higher number of shaders and larger warp schedulers with superscalar dispatch capabilities. Due to the different shader architectures and the higher clock rate of the shader domain, the core count can not be directly compared to AMD cores of the Radeon 5000 series (e.g. HD 5850).
Detailed information on the GF104 architecture (and by extension the GF106 and GF108) can be found in the desktop GTX 460 article by Anandtech.
The SLI performance with the current drivers (mid 2010) should be about 15-20% faster than a single GTX 460M in high settings and resolutions. Therefore, all current games (as of 2011) should run fluently in highest detail settings with antialiasing and Full HD resolution, with the exception of Metro 2033 and Crysis.
A novelty of the GF104/106/108 chips is the support for Bitstream HD Audio (Blu-Ray) output via HDMI. Similar to the Radeon HD 5850, the GTX 460M can transfer Dolby True HD and DTS-HD bitstream-wise without quality loss to a HiFi receiver.
The GTX460M offers the PureVideo HD technology for video decoding. The included Video Processor 4 (VP4) supports feature set C and therefore the GPU is able to fully decode MPEG-1, MPEG-2, MPEG-4 Part 2 (MPEG-4 ASP - e.g., DivX or Xvid), VC-1/WMV9, and H.264 (VLD, IDCT, Motion Compensation, and Deblocking). The X500 tester was able to decode the VC-1 encoded Elephants Dream video with about 3-6% CPU load (according to the task manager). The H.264 coded Big Buck Bunny video was played back with 1-3% CPU load (both 1080p videos).
Furthermore, the GPU is able to decode two 1080p streams simultaneously (e.g. for Blu-Ray Picture-in-Picture).
Through CUDA, OpenCL, and DirectCompute 2.1 support, the GeForce GTX 460M can used for general calculations. For example, the stream processor can encode videos considerably faster than can many modern CPUs. Furthermore, physics calculations can be done by the GPU using PhysX (e.g. supported by Mafia 2 or Metro 2033). A single GTX 460M of the SLI combination can also be used for dedicated calculations.
According to Nvidia, support for 3D Vision on the GTX graphics cards is also enabled. It allows the laptop to send 3D content (3D games, 3D Web Streaming, 3D photos, 3D Blu-Rays) to a supported 3D enabled screen or an external 3D TV if compatible.
The power consumption of a single GeForce GTX 460M is supposedly about 72 Watts (TDP including the MXM board and memory). By extension, the GTX 460M SLI should use twice as much power. However, a single 460M in the SLI setup can be deactivated in the drivers to save power. If idle, each chip is clocked at 50/100 MHz and 200/400MHz (chip/shader) in 2D mode and 3D mode, respectively, to save power.
The similar desktop GeForce GTX 460 SLI is based on the GF104 chip and offers significantly more shader cores at 2x336 cores. Therefore, the desktop setup is significantly faster than the 460M SLI and even the 470M SLI graphics solutions.
NVIDIA GeForce GTX 480M► remove from comparison
The NVIDIA GeForce GTX 480M is a high-end graphics card for laptops based on the Fermi architecture. As a result, it supports DirectX 11 and is produced in 40nm at TSMC. Featuring 352 cores, the GTX480M is similar to the desktop GTX 465 but with a lower clock rate. The GeForce GTX 480M is combined with 2 GB of fast dedicated GDDR5 graphics memory and the overall performance should be generally superior to the Mobility Radeon HD 5870.
The GF100 (aka. Fermi) chip was redesigned from scratch and features 3 billion transistors, assuming all 512 shaders are present. Compared to the desktop HD 5870 with 2.13 billion transistors or the Mobility Radeon HD 5870 (RV870) with 1.04 billion transistors, the GTX 480M die is physically quite large.
The mobile Fermi chip is stripped down to 352 shader cores (1-dimensional) with 32 ROPs and 44 Texture Units. The memory bus is 256 bit wide, but due to the fast GDDR5 memory, this should not be a bottleneck. The power consumption has been specified to 100 Watt TDP when taking into account the MXM board and the 2 GB GDDR5. AMD usually specifies the TDP of the chip alone, therefore this value is not directly comparable with competing Radeon cards. Still, the GTX 480M is only suited for very large desktop replacement (DTR) laptops with an excellent cooling solution. Originally, only Clevo was offering the GTX 480M in its 17" (D901F) and 18" (X8100) barebones.
The performance of the Nvidia GeForce GTX 480M is about 10% better on average than the ATI Mobility Radeon HD 5870. In demanding DirectX 11 games and benchmarks (that make use of the Tessellation engine), the GTX 480M is even faster. That means the GTX480M is the fastest single GPU graphics solution for laptops as of early to mid 2010. Modern DirectX 10 games should run fluently in high resolutions and details with antialiasing. Only very demanding games like Crysis or Metro 2033 may need some reduced settings (see list below). Due to the DirectX 11 hardware support (and the good tessellation implementation), the Fermi based laptop graphics card should be well suited for upcoming DirectX 11 games.
As the GeForce 300M series, the GeForce GTX 480M supports PureVideo HD with VideoProcessor 4 (VP4 with Feature Set C). This means the GPU is able to fully decode HD videos in H.254, VC-1, MPEG-2, and MPEG-4 ASP. The graphics card can also accelerate Flash videos by using Flash 10.1 or greater.
The rendering cores of the Nvidia GeForce GTX 480M can be used for general calculations with CUDA or DirectCompute. For example, the encoding of HD videos can be done significantly faster than most modern CPUs by using the shader cores of the GPU instead. PhysX is also supported by the mobile Fermi chip to calculate physics effects in supported games.
Compared to desktop GPUs, the Geforce GTX 480M is comparable to a nunderclocked Nvidia GeForce GTX 465 (607/1200 clock speed) and is therefore on par with a desktop Radeon HD 5770.
NVIDIA GeForce GTX 485M► remove from comparison
The Nvidia GeForce GTX 485M is the fastest graphics card for laptops at the time of announcement (Q1 2011). It is based on the GF104 chip and offers all 384 shader cores and the full 256 Bit memory bus. Due to the high clock rate of 575 MHz, it is significantly faster than the old GeForce GTX 480M of which it replaces. It also supports DirectX 11 and OpenGL 4.0.
Other than the GeForce GTX 480M, the 485M is no longer based on a trimmed down GF100 chip, but on the related GF104 instead. The GF104 is designed for the consumer sector and has a total of 384 cores. A number of cores may be disabled, for example the 470M with only 288 active cores.
The technology of the GF104 differs quite a bit from the GF100 chip (which was actually designed for professional use). The GF104 has more shaders (3x16 vs. 2x16), texture units (8 vs. 4) and SFUs (Special-Funciton-Units) per Streaming-Multiprocessors (SM). Nvidia now uses the superscalar architecture as there are still only two warp schedulers supporting three shader blocks. In theory, this helps to utilize the shaders more efficiently and increases the performance per core.
However, in the worst case, the performance can drop below the GF100 architecture (and its predecessors). The ECC memory protection, important in professional applications, was completely omitted and the FP64 was trimmed down (only 1/3 of the shaders are FP64-capable, only 1/12 of the FP32 performance). Because of these reductions in the GF104, the size of a SM increased only by 25% despite the higher number of shaders.
Note that it is not possible to directly compare the number of cores to the AMD Radeon graphics cards (e.g. HD 5870) or even to Nvidia's own predecessors (e.g., G92b), because shader architecture and clock rates are significantly different in the GF104 chip.
In our extensive test of the GeForce GTX 485M, we found that the GTX485M is significantly faster than the old GeForce GTX 480M (at the same TDP rating). The performance is on a level with two GeForce GTX 460M in SLI mode. Nearly all games are therefore playable in highest details and resolutions. Even demanding games like Mafia 2 or Battlefield Bad Company 2 can run fluently in 1080p with maximus detail settings. Detailed benchmarks can be found at the end of this page.
What's new compared to the GF100 is support for Bitstream transfer of HD Audio (Blu-Ray) via HDMI in the GF104 chips. Similar to the HD 5850, the GTX 485M can transmit Dolby True HD and DTS-HD via Bitstream to compatible receivers without quality loss.
For decoding HD videos, the GTX485M supports PureVideo HD. The built-in Processor 4 (VP4) handles Feature Set C. As a result, MPEG-1, MPEG-2, MPEG-4 Part 2 (MPEG-4 ASP - z.B. DivX or Xvid), VC-1/WMV9 and H.264 can be fully decoded by the graphics card (VLD, IDCT, Motion Compensation, and Deblocking). Furthermore, two streams can be simultaneously decoded in realtime, e.g. Blu-Ray Picture-in-Picture (2x1080p lt DXVAChecker). In addition, PureVideo HD indicates HDCP encoding for digital interfaces.
The shader cores (also called CUDA cores) can also be used for general computations (e.g. Video Transcoding) by using the interfaces CUDA, DirectCompute 2.1 or OpenCL. Thanks to PhysX, the 485M can also perform physics calculations.
According to Nvidia, the support for 3D Vision includes support for the recent HDMI 1.4a standard as well. If enabled by the laptop manufacturer, content such as 3D games, 3D web streaming videos, 3D pictures and 3D Blu-Ray videos can be displayed on a 3D-capable TV (via discrete 3DTV Play) or on the internal notebook 3D display.
With regards to energy demand, the GTX 485M should be on par with the GeForce GTX 480M. In other words, both graphics cards should draw about 100 Watts each when including their respective memory and MXM boards. Due to the higher performance from the 485M, the performance/power efficiency here has clearly been improved.
Compared to desktop graphics cards, the performance of the 485M should be on par with a GeForce GTX 460 768MB which features less cores but operates on higher clock rates.
|NVIDIA GeForce GTX 460M SLI||NVIDIA GeForce GTX 480M||NVIDIA GeForce GTX 485M|
|GeForce GTX 400M Series|
|Pipelines||384 - unified||352 - unified||384 - unified|
|Core Speed||675 MHz||425 MHz||575 MHz|
|Shader Speed||1350 MHz||850 MHz||1150 MHz|
|Memory Speed||1250 MHz||1200 MHz||1500 MHz|
|Memory Bus Width||192 Bit||256 Bit||256 Bit|
|Max. Amount of Memory||2x1536 MB||2048 MB|
|DirectX||DirectX 11, Shader 5.0||DirectX 11, Shader 5.0||DirectX 11, Shader 5.0|
|technology||40 nm||40 nm||40 nm|
|Features||Optimus Support, PureVideo HD VP4, 3D Vision, Bitstream HD Audio, CUDA, DirectCompute, OpenCL, OpenGL 4.0, DirectX 11, SLI Supported|
|Date of Announcement||03.09.2010||25.05.2010||06.01.2010|
|Link to Manufacturer Page||http://www.nvidia.com/object/product-gef...||http://www.nvidia.com/object/geforce_gtx...|