Quantcast

26 February, 2012

Kepler GK104 cards in production

We got word that Nvidia finally talked to its partners about its highly confidential Kepler products.

They got briefed, they got the reference design and the production of the cards is staring as we speak. It looks like most of the cards available at launch are going to look alike, with different cooling from some partners, but the non-reference partner cards will come a bit later.

Nvidia should be ready to release GK104 by April, but we are not aware of any solid date at this time. Jensen did mention that Nvidia had some lower than expected 28nm yields but also clarified that it is nothing like the transition to 40nm a few years back.
Partners got the briefing

We got word that Nvidia finally talked to its partners about its highly confidential Kepler products.

They got briefed, they got the reference design and the production of the cards is staring as we speak. It looks like most of the cards available at launch are going to look alike, with different cooling from some partners, but the non-reference partner cards will come a bit later.

Nvidia should be ready to release GK104 by April, but we are not aware of any solid date at this time. Jensen did mention that Nvidia had some lower than expected 28nm yields but also clarified that it is nothing like the transition to 40nm a few years back.

Despite some rumors that Kepler might be a good performer, AMD is still enjoying quite good sales of its 28nm products. It is getting ready to finish launching top to bottom portfolio from entry level all the way to high-end.

Source: fudzilla.com
(Read more)

AMD approached Nvidia before ATI acquisition in 2006

News has emerged that Advanced Micro Devices also held talks with graphics processor designer Nvidia about acquiring them before they went ahead and purchased ATI in 2006, according to former AMD employees speaking to Forbes.

At the time many experts in the processor industry believed that shrinking transistors would create an opportunity to add new capabilities to processors designed and built for PCs and servers. AMD’s then chief executive Hector Ruiz bet that his company could overtake arch rival Intel by grabbing a slice of the graphics processing unit (GPU) market.

According to those former employees, Nvidia was the first target, but talks broke down after the video card giant’s chief Executive Jen-Hsun Huang insisted on being made chief executive of the combined company if the acquisition went ahead.
News has emerged that Advanced Micro Devices also held talks with graphics processor designer Nvidia about acquiring them before they went ahead and purchased ATI in 2006, according to former AMD employees speaking to Forbes.

At the time many experts in the processor industry believed that shrinking transistors would create an opportunity to add new capabilities to processors designed and built for PCs and servers. AMD’s then chief executive Hector Ruiz bet that his company could overtake arch rival Intel by grabbing a slice of the graphics processing unit (GPU) market.

According to those former employees, Nvidia was the first target, but talks broke down after the video card giant’s chief Executive Jen-Hsun Huang insisted on being made chief executive of the combined company if the acquisition went ahead.

This resulted in the CPU maker approaching ATI and after talks subsequently purchased them in July 2006 for $5.4 billion. Initially the firm struggled with integrating its newly acquired graphics business, amid multiple launches of strong products from Nvidia, which further eroded ATI’s market share.

AMD has worked hard to fight back, with several very strong GPU releases, including the current world’s fastest single GPU graphics card, the new AMD Radeon HD 7970, which took the spot from Nvidia’s GTX 580.

At the same time Nvidia has been pushing hard into the mobile processors market, a move they were rewarded with last year when Microsoft announced that Windows 8 will run on their ARM processors, alongside processors from Qualcomm and Texas Instruments.

Whilst AMD’s graphics video unit has been achieving good results with several GPU releases, including the AMD Radeon HD6870 and HD6970 models, as well as the dual-GPU HD6990 model, the processor division has been struggling against Intel.

The “bulldozer” AM3+ platform hit retail much later than planned after multiple delays, and despite claims throughout its development cycle that it would compete with the fastest Intel products, it ultimately struggled against Intel’s “Sandy Bridge” line of i5 and i7 quad core processors.

Today, AMD has a market capitalization of just $5.2 billion. Nvidia’s sits at nearly twice that figure, with 9.7 billion.

Source: techspot.com
(Read more)

AMD seizes more GPU market share

According to the latest market share data from Jon Peddie Research, folks in AMD’s graphics division have a good reason to pop open the champers over the weekend.

AMD ended Q4 2011 with a 24.8 percent market share, growing 7.8 percent sequentially over Q3. On the other hand, Nvidia’s share dipped by 3.1 percent, from 16.1 percent to 15.7 sequentially. Intel’s growth also came to a halt. Chipzilla dropped from 60.4 to 59.1 percent.

Looking at the year-on-year figures, the result is even better for AMD. In Q4 of 2010 AMD commanded a 24.2 percent share, while Nvidia was in a close second with 22.5 percent. Now AMD is in a comfortable lead, shipping 58 percent more GPUs than Nvidia. Nvidia also lost share to Intel, as Intel had an overall share of 52.5 percent in Q4 10, and now it is at 59.1 percent.
Extends lead over Nvidia

According to the latest market share data from Jon Peddie Research, folks in AMD’s graphics division have a good reason to pop open the champers over the weekend.

AMD ended Q4 2011 with a 24.8 percent market share, growing 7.8 percent sequentially over Q3. On the other hand, Nvidia’s share dipped by 3.1 percent, from 16.1 percent to 15.7 sequentially. Intel’s growth also came to a halt. Chipzilla dropped from 60.4 to 59.1 percent.

Looking at the year-on-year figures, the result is even better for AMD. In Q4 of 2010 AMD commanded a 24.2 percent share, while Nvidia was in a close second with 22.5 percent. Now AMD is in a comfortable lead, shipping 58 percent more GPUs than Nvidia. Nvidia also lost share to Intel, as Intel had an overall share of 52.5 percent in Q4 10, and now it is at 59.1 percent.

Source: fudzilla.com
(Read more)

AMD Working on Graphics Processor for PlayStation 4 - Report



It is well known that Sony Corp. is developing the next iteration of the popular PlayStation console, the PlayStation 4. What remains unknown is the hardware that powers it. According to a media report, the PlayStation 4 will utilize graphics processing technology designed by Advanced Micro Devices.

A former employees of AMD told Forbes web-site that the company is working on a graphics processing technology for the next-generation PlayStation 4 video game console. The ex-employees did not provide any actual details or evidence about the actual proceedings and also naturally remained anonymous.


Sony PlayStation 4 May Be Powered by AMD

It is well known that Sony Corp. is developing the next iteration of the popular PlayStation console, the PlayStation 4. What remains unknown is the hardware that powers it. According to a media report, the PlayStation 4 will utilize graphics processing technology designed by Advanced Micro Devices.

A former employees of AMD told Forbes web-site that the company is working on a graphics processing technology for the next-generation PlayStation 4 video game console. The ex-employees did not provide any actual details or evidence about the actual proceedings and also naturally remained anonymous.

At present the information should be considered as a rumour as a custom AMD Radeon graphics chip inside the PS4 means that Sony will either have to drop compatibility with PS3 titles on its new consoles, or pay additional royalties to Nvidia Corp., whose chips power the current PlayStation 3 system.

In case the rumours about AMD's custom Radeon graphics processors inside Xbox Next (Loop, Durango) as well as PlayStation 4 are correct, then the company has a reason to celebrate: it is a massive success to power all three next-generation consoles from all three major platform holders, Microsoft, Nintendo, Sony). Such position on the market may be very favourable for AMD as it will allow it to scale its graphics processing architecture beyond consoles, which are the primary game platforms nowadays, to new types of hardware that will be the gaming platforms of tomorrow.

Among other advantages, Radeon HD architecture inside every next-generation consoles will provide AMD a major advantage on the market of personal computers as all major game designers will have to optimize their titles for AMD architecture and therefore the Radeon graphics chips for PCs will have an advantage over competing solutions.

However, developing of three separate graphics cores for the next-gen video game consoles means that AMD will have to offload resources from other projects, such as next-generation GPUs for PC or ultra-portables, and dedicate them to development of new solutions for console platform holders.

Source: xbitlabs.com
(Read more)

Nvidia comes somewhat clean on their quarterly conference call

Nvidia seems to be so far ahead of the curve that they are experiencing problems that are unique in the industry. In their recent year end financial conference call, there was enough said to draw some very grim conclusions.

Today’s conference call was a near complete validation of all the things SemiAccurate has been saying about Nvidia. Remember when we asked if Nvidia could supply Apple? Anyone notice the part about dumping early 28nm capacity, and the disappearance of 28nm Fermi shrinks? Remember how 28nm was not an issue for Nvidia, even if their product roadmap slips said otherwise. How well does this mesh with the quotes from Jen-Hsun himself on the topic?
Some might call the news grim

Nvidia seems to be so far ahead of the curve that they are experiencing problems that are unique in the industry. In their recent year end financial conference call, there was enough said to draw some very grim conclusions.

Today’s conference call was a near complete validation of all the things SemiAccurate has been saying about Nvidia. Remember when we asked if Nvidia could supply Apple? Anyone notice the part about dumping early 28nm capacity, and the disappearance of 28nm Fermi shrinks? Remember how 28nm was not an issue for Nvidia, even if their product roadmap slips said otherwise. How well does this mesh with the quotes from Jen-Hsun himself on the topic?

The financial update from Nvidia a few weeks ago mentioned two factors for the lowering of guidance, the shortage of hard drives, and the pre-Tegra3 drop in Tegra2 sales. The latter is to be expected, especially in light of the awful reception Tegra3 is getting from OEMs. On the other hand, the hard drive shortage that should be affecting AMD GPUs as well as Nvidia GPUs seems to only have hit one company. Funny that.

Lets look at some of the problems that surfaced during the call in more detail, as well as a few other gems that leaked out. On 28nm, the first hint at problems came out a few weeks ago when mysterious analyst reports came out of nowhere saying that many companies were having problems with yields on 28nm. This raised red flags for two reasons, first was the single source in the financial community, something that positively screams ‘planted leak’. Next is the fact that SemiAccurate talks to several companies making 28nm chips at TSMC regularly, and yield is the first topic that comes up.

There was a major change/tweak to the 28nm process last summer. Before that, companies openly grumbled about the process, and could not be described as happy campers. With the wafers that came out last summer, the frowns turned upside down. The mood wasn’t exactly joyous, but things were upbeat and yields were said to be ‘as expected’ or even good by multiple companies. Until the recent ‘revelation’ about yields, that was how it remained.

As usual, SemiAccurate immediately checked out the 28nm yield claims, and found absolutely no change. If anything, yields were better than the last time we checked on them, and no one queried could explain the reports. TSMC has publicly stated that yields are better than expected, and AMD has publicly said similar things about their 28nm experiences. For the record, Nvidia was not one of the companies queried. Ironically, that same report came out five days before Nvidia lowered Q4/2012 guidance citing 28nm yields as one of the causes. While not definitive in any way, the timing sure looks a bit odd.

During today’s call, Nvidia once again blamed TSMC for their yield problems. As we have said, Nvidia seems to be alone in having these problems. Again. The yield problems, combined with the gem that the 40nm per-chip pricing had expired or did not carry over to 28nm, dropped Nvidia’s numbers quite a bit. The view at SemiAccurate is the same as it was before, Nvidia does not have a process, 28nm or 40nm, problem, it has a management problem. Instead of addressing that side of things, once again, the company blames external forces that no one else seems to be running into.

This brings up the question of why if yields are lower than hoped for, and wafer start capacity is desperately short, did Nvidia dump all their early 28nm capacity? Related to that, what happened to the 28nm Fermi shrinks that were on the roadmap? This is pertinent because on the call, the possibility of substituting 40nm chips for 28nm shortfalls was brought up.

That answers our question about Nvidia being able to supply Apple, it looks like they will even if they can’t get enough 28nm wafers. Logic dictates that others won’t be so lucky, and that brings up some rather nasty political ramifications. If AMD can supply 28nm GPUs,one has to wonder about the design wins Nvidia is counting on for Ivy Bridge. Where the chips end up lying is unknown, but low yields and wafer shortages to satisfy high demands are not a recipe for keeping customers.

Two things will solve this problem, TSMC ‘fixing’ their ‘yield problems’ and more wafer supply. Nvidia’s 40nm woes were twofold. Early on, TSMC did have a contamination problem that affected all customers, but once that was solved, Nvidia could never get GF100/110 yields to acceptable levels. Although the company denies it, every industry insider queried by SemiAccurate pointed to design problems that were never rectified. 28nm seems to be a rerun, blaming design problems that are potentially not economically viable to fix on external forces. How much can be done and how quickly is for the moment, unanswerable.

From there, wafer supply comes in to the picture. As AMD’s Llano proved, low yields can eventually be papered over with more wafer starts. If you essentially have a fab to yourself, and wafer starts are gated by tools being rolled on to the floor and calibrated, this is a solvable problem. If you are fighting with many other players for a limited amount of capacity, it becomes an economic problem. He who is willing to pay more gets more. We will leave out the intangible cost of publicly blaming others for your own faults, it is likely unquantifiable.

Once again, Nvidia has a serious problem on its hands. It was allocated capacity and then gave it up mysteriously. How much capacity and for how long is not publicly known though, and could very well be a non-issue going forward. At the time, there were few customers that needed the early starts, a normal situation early in a process ramp. By the spring, Qualcomm is going to be ramping S4/Krait chips on 28nm, and those likely have a very high ASP per mm^2 of silicon. They will also need a lot of wafer starts too, and will be vying for the same fixed capacity.

AMD is also in the game, and the decision about where to make Kaveri and Kabini is said to be made late in this quarter. If you don’t think that will influence wafer supply right about now, please write the author for the full portfolio of for sale bridges in the vast SemiAccurate real estate holdings, the good ones are going fast.

The worst part is Apple, you know, the ones that Nvidia is likely going to burn a few bridges not owned by SemiAccurate to supply GPUs to. Apple is strongly rumored to be ramping A6 production at TSMC right about now. If true, Apple is a new account for TSMC, and they are notoriously finicky about their supply chain. On top of this, the fruit themed widget purveyor is legendary for it’s ironclad contracts that are said to guarantee preferential supply. TSMC is unlikely to step on Apple’s toes for anyone, Nvidia notwithstanding.

Add it all up, and you come to one conclusion, don’t expect 28nm wafer supply to get better for Nvidia any time soon. Same for the 28nm ‘yield problems’, design issues take months to correct if it is possible. 28nm is shaping up to be a mess for Nvidia.

The last gem thrown out on today’s conference call is an absolute shocker, Nvidia is more or less permanently out at Samsung. We say shocker not because it is a surprise to anyone following the mobile market, but because Nvidia actually admitted it in public. Samsung has had a lot of teething problems with it’s Exynos chip line, but those are very likely a thing of the past now. The company has been quite open about its plan to use in-house CPUs/SoCs preferentially, and now has the parts to do it with. For the short and mid-term, this door is closed to any volume for Tegra chips.

If you recall the Q3 financial conference call, Jen-Hsun was quite open about going after the next Kindle Fire design. SemiAccurate did some checking around that and found two pertinent details. First is the 10″ Fire prototypes floating around Taiwan late last year did not have Tegra in them. The roadmaps have definitely changed since then, so Tegra could be in the rumored 8.9″ variant. We suspect this is more of a change to the roadmap of the rumored supplier rather than actions by either Amazon or Nvidia.

On top of this is the other tidbit. A few months ago Nvidia sales people were said to be ordered to win the next Fire design at any cost. This is curious in light of Nvidia IR telling any analyst who didn’t run away fast enough how unpalatable the Fire was. The purported reason is that TI was selling Amazon OMAP chips for the Fire at $7-8, and at those prices, Nvidia was not interested in the design. TI has strongly denied these numbers, but the important point is not the actual sales numbers, it is cost.

It is SemiAccurate’s opinion that Nvidia does not have a product that can compete with TI’s OMAP in this space for a number of reasons. Tegra 3 has a bloated die and is made on an expensive process, Tegra2 has some power use figures that make it unpalatable for the Fire’s target market without massive BoM bloat, and both chips badly lag TI and Qualcomm on I/O. If Nvidia can get a win on the next Fire, it will likely come at a high cost.

That leaves the Tegra line with a very big problem, the number one and number two phone makers, Apple and Samsung not necessarily respectively, are closed to them. The number one and number three tablet maker, Apple and Samsung, this time respectively, are also closed to Nvidia. The number two tablet maker, Amazon, if technically open, is likely to have painfully low margins. The remaining tablet sales can be considered rounding error, but only if you are feeling charitable.

Phones are the remaining potential bright spot. The non-Apple, non-Samsung smartphone market is around 50% of unit sales and growing like mad. Unfortunately, Nvidia is facing a 2012 market where A15 based CPUs will dominate the high (read profitable) end, and Nvidia doesn’t have an A15 based part coming soon. On the low end, the same problems that make the Fire unpalatable apply, high costs, high power use, and bad design choices. Nvidia does not look to have any advantage to speak of in the phone market.

To make matters yet worse, Qualcomm and TI both have A15 based chips coming soon. On top of that, Intel is making a renewed effort to enter phones with their Atom line, and several others are unveiling very competitive SoCs. The once wide open phone market is rapidly becoming overcrowded, and most players have parts much better suited to it. Throw in one major player that has yet to announce their part, and things look ripe for a shakeout.

Getting back to Nvidia, the conference call that just ended was quite grim. According to the words spoken, it is a perfect storm of uncontrollable external forces. If you listen to how those words were spoken, you might notice some voices that normally don’t waver did, and volumes went way down when certain topics were discussed. This could be coincidence, but yield problems that only seem to affect one company and lopsided hard drive shortage problems say otherwise. Don’t look for things to get better from here.S|A

Editor’s note: A quick conversation with Charlie yielded this tidbit about Kepler. His previous article about Kepler did not include yield problems, which at the time of publication were not evident whatsoever. At the time he researched that story he says that everyone, including public statements by Nvidia, said there were no yield problems on TSMC 28nm. I asked his opinion on Kepler still being a winner and he said “It all comes down to execution”. Yield problems change the equation and an update will be forthcoming if Charlie gets solid answers to some questions he asked in return.

Source: semiaccurate.com
(Read more)

13 February, 2012

Possible AMD Radeon HD 7700 Series Specs Revealed

As report first of February in a leaked roadmap, AMD is set to release its HD 7700 series graphics cards based on the Cape Verde. Based on information coming out of German-based 3dcenter.org, we may have our first glimpse at the full specifications for the new Cape Verde series. The HD 7700 series will utilize AMD's GCN stream processors seen with the HD 7900 series. With the HD 7770 series, you'll have a base core-clock speed of 1 GHz, with performance falling between the HD 6850 and HD 6790, at a suggested price around $150 dollars. The HD 7750 should equal performance of the HD 6770/5770 in performance, at a suggested price of around $125 dollars.
With AMD reportedly set to release its Radeon HD 7700 series next week, we get our first peek at the specifications of the HD 7770 and HD 7750.

As report first of February in a leaked roadmap, AMD is set to release its HD 7700 series graphics cards based on the Cape Verde. Based on information coming out of German-based 3dcenter.org, we may have our first glimpse at the full specifications for the new Cape Verde series. The HD 7700 series will utilize AMD's GCN stream processors seen with the HD 7900 series. With the HD 7770 series, you'll have a base core-clock speed of 1 GHz, with performance falling between the HD 6850 and HD 6790, at a suggested price around $150 dollars. The HD 7750 should equal performance of the HD 6770/5770 in performance, at a suggested price of around $125 dollars.

Cape Verde Physical
•Built on TSMC 28 nm process, ~1.5 billion transistors
•10 Graphics CoreNext Compute Units (CUs)
•640 stream processors
•40 TMUs, 16 ROPs
•128-bit wide GDDR5 memory interface

Radeon HD 7770
•All CUs enabled, 640 stream processors
•1 GB GDDR5 memory
•40 TMUs, 16 ROPs
•1000 MHz core clock-speed
•1125 MHz (actual), 4500 MHz (effective) memory clock-speed
•72 GB/s memory bandwidth
•1280 GFLOP/s single-precision floating-point performance
•Typical board power: 80W

Radeon HD 7750
•8 CUs enabled, 512 stream processors
•1 GB GDDR5 memory
•32 TMUs, 16 ROPs
•800 MHz core clock-speed
•1125 MHz (actual), 4500 MHz (effective) memory clock-speed
•72 GB/s memory bandwidth
•819 GFLOP/s single-precision floating-point performance
•Typical board power: 55W

Please keep in mind, of course, that these specifications are from 3dcenter's supposed reliable source. We won't know for sure until AMD shows its hand. Stay tuned!

Source: tomshardware.com
(Read more)

There are two GK104/Kepler variants

Sources are now telling SemiAccurate that Nvidia has two variants of the GK104 in the pipe. These two variants hint at a finer grained fusing ability for the end product.

The two siblings are said to be GK104-400 and GK104-335, basically a full working and partially fused off version of the same chip. The -400 is said to be an “8 group” device, the -335 described as “7 group’. If you recall the sad tale of Fermi/GF100, the chip had large swathes of shaders turned off, the ability to do less radical surgery was not there. This is a fairly painful way to deal with defects, the more granular you can make the disabling, the better off you are.
Not much between them this time

Sources are now telling SemiAccurate that Nvidia has two variants of the GK104 in the pipe. These two variants hint at a finer grained fusing ability for the end product.

The two siblings are said to be GK104-400 and GK104-335, basically a full working and partially fused off version of the same chip. The -400 is said to be an “8 group” device, the -335 described as “7 group’. If you recall the sad tale of Fermi/GF100, the chip had large swathes of shaders turned off, the ability to do less radical surgery was not there. This is a fairly painful way to deal with defects, the more granular you can make the disabling, the better off you are.

Nothing comes for free in the silicon world, and the art of chip design is balancing granularity with cost. Nvidia botched this badly in Fermi, and paid a high price. The only good that came of it was the entertainment in seeing their spokespeople spin ever increasing leaps of logic in public. This however doesn’t placate investors much, even if they do smile.

With this new description of the -400 and -335 variant of GK104, it looks like Nvidia has implemented what AMD has been doing since at least the R700 (HD4000) chips, if not earlier. Instead of being forced to fuse off large blocks of shaders as a minimum, it looks like they can now do much smaller chunks. In Fermi terms, instead of taking a CU at a whack, they can now do portions of a CU too.

This should greatly improve yields, allow for endless SKU variations, and generally make things better. Of course, it comes at a die size penalty, but after the last learning experience, it would be foolish to do otherwise.

Source: semiaccurate.com
(Read more)

10 February, 2012

AMD Radeon HD 7950 Game Benchmarks



A collections of real-world gaming tests using AMD Radeon HD 7950 Game Benchmarks, listed alphabetically.


A collections of real-world gaming tests using AMD Radeon HD 7950 Game Benchmarks, listed alphabetically.
"At an anticipated price tag of $450, it's a cheaper, cooler, and more power-friendly alternative to GeForce GTX 580. In games, there's really no contest in a decision between the two." -tomshardware.com
(Read more)

09 February, 2012

Rambus and Nvidia sign patent agreement

Rambus and Nvidia have signed a patent license agreement that covers the use of Ramubs patented innovations in a brad range of integrated circuit products offered by Nvidia. This patent agreement also settles all claims that Rambus had been pushing against Nvidia for years.

The details regarding the new agreement have been safely locked away and kept confidential. The only thing that was released to public is the actual duration of the agreement that will cover a period of five years. Harold Hughes, president and CEO of Rambus didn't hesitate to point out the importance of this licence agreement that also allows Rambus to move forward with Nvidia and "focus on developing innovative solutions in concert with our licensees to help bring compelling, innovative products to market."
Burry the lawsuit hatchet

Rambus and Nvidia have signed a patent license agreement that covers the use of Ramubs patented innovations in a brad range of integrated circuit products offered by Nvidia. This patent agreement also settles all claims that Rambus had been pushing against Nvidia for years.

The details regarding the new agreement have been safely locked away and kept confidential. The only thing that was released to public is the actual duration of the agreement that will cover a period of five years. Harold Hughes, president and CEO of Rambus didn't hesitate to point out the importance of this licence agreement that also allows Rambus to move forward with Nvidia and "focus on developing innovative solutions in concert with our licensees to help bring compelling, innovative products to market."

You can check out more here.

Source: fudzilla.com
(Read more)

AMD Announces Turks Based FirePro V3900



While AMD’s consumer GPU division is well into its deployment of their first 28nm products, the long validation and certification period for business hardware means that AMD’s business GPU division is still in the process of wrapping up the last of their 40nm product launches. In November AMD launched the Turks based FirePro V4900, and today they’re launching the final member of the current generation FirePro product stack: the FirePro V3900.

If the FirePro V4900 was a business version of the Radeon HD 6670, then the FirePro V3900 is a business version of the Radeon HD 6570 DDR3. Clocked at 650MHz and coupled with 1GB of 900MHz DDR3, the hardware specs are identical to the DDR3 version of the Radeon HD 6570.


While AMD’s consumer GPU division is well into its deployment of their first 28nm products, the long validation and certification period for business hardware means that AMD’s business GPU division is still in the process of wrapping up the last of their 40nm product launches. In November AMD launched the Turks based FirePro V4900, and today they’re launching the final member of the current generation FirePro product stack: the FirePro V3900.


 

AMD FirePro V7900

AMD FirePro V5900

AMD FirePro V4900

AMD FirePro V3900

Stream Processors

1280

512

480

480

Texture Units

80

32

24

24

ROPs

32

32

8

8

Core Clock

725MHz

600MHz

800MHz

650MHz

Memory Clock

1.25GHz (5GHz data rate) GDDR5

500MHz (2GHz data rate) GDDR5

1GHz (4GHz data rate) GDDR5

900MHz (1.8GHz data rate) DDR3

Memory Bus Width

256-bit

256-bit

128-bit

128-bit

VRAM

2GB

2GB

1GB

1GB

FP64

N/A

N/A

N/A

N/A

Transistor Count

2.64B

2.64B

716M

716M

TDP

<150W

<75W

<75W

<50W

Manufacturing Process

TSMC 40nm

TSMC 40nm

TSMC 40nm

TSMC 40nm

Price Point

N/A

N/A

$189

$119

If the FirePro V4900 was a business version of the Radeon HD 6670, then the FirePro V3900 is a business version of the Radeon HD 6570 DDR3. Clocked at 650MHz and coupled with 1GB of 900MHz DDR3, the hardware specs are identical to the DDR3 version of the Radeon HD 6570.

For this reason the FirePro V3900 compares to the V4900 in much the same way the 6570 and 6670 do. While the V3900 has a lower core clock (650MHz vs. 800MHz), it’s otherwise a fully functional Turks GPU just like the V4900. The bigger reason for their performance difference is that while the V4900 uses GDDR5, the V3900 uses DDR3, giving it less than half the memory bandwidth and a similar overall performance drop compared to the V4900.

Of course the tradeoff for this drop in performance is size and power consumption. While the V4900 was a full profile card rated for 75W the V3900 is a low-profile card rated for 50W, with most of those power savings coming from switching out GDDR5 for DDR3. This makes the V3900 unique in that it’s the only low-profile FirePro card in AMD’s lineup – though it should be noted that for compatibility purposes it will be shipping with its full-profile bracket installed while the low-profile bracket will be in the box.

AMD will be releasing the V3900 today, with a price of $119. This positions it directly against NVIDIA’s GT216 based Quadro 400, and roughly $50 below NVIDIA’s GF108 based Quadro 600. For the V3900 AMD will be heavily leaning upon the fact that Turks can drive 5 monitors. However as with the V4900 this feature is effectively MIA until DisplayPort MST hubs ship this summer, as without the hub the card can only drive up to 2 monitors via its DP 1.2 and DL-DVI ports.

More immediately, on paper the V3900 should be far more powerful than the architecturally ancient Quadro 400. But as this is the professional market AMD’s real competition is NVIDIA’s certification and support, more so than their performance at any given price.

On that note, since AMD already launched a Turks based FirePro last year the certification process should be rather straightforward. Products (rather than GPUs) are individually certified, but as AMD already worked out any Turks driver kinks for the V4900 there shouldn’t be any surprises in store for the V3900.

Finally, it’s interesting to note that with this launch AMD has effectively committed to keeping Turks around for quite some time. AMD’s 3 year FirePro lifecycle means that the V3900 will be available until at least February of 2015, some 4 years after the first Turks products launched. Given Turks’ continual recurrence through 2012 in OEM laptops, desktops, and now professional cards, it’s clear that it’s living up to its position of being AMD’s low cost, high volume anchor GPU for the 40nm generation.

Source: anandtech.com
(Read more)

04 February, 2012

HD 7950 in healthy quantity in EU, US

AMD’s new HD 7950 is the talk of town these days and for good reason, too. The second iteration of AMD’s Tahiti GPU costs quite a bit less than the HD 7970, yet it’s fast enough to wipe the floor with Nvidia’s GTX 580.

From what we’ve been hearing, demand is good, but luckily there is no shortage of HD 7950’s in retail. A few dozen shops in mainland Europe have plenty of cards in stock and the prices are pretty good for a brand new product. In Germany, you can pick up a PowerColor reference card for as little as €386, or a Sapphire card for €397. You can even get a slightly overclocked 830MHz card from MSI for €393. In case you are looking for something in the 880MHz – 900MHz range, you have a choice of overclocked cards from Gigabyte, MSI, Sappire, PowerColor and XFX, with prices starting at €441. The HD 7970 sells for €519+, while the cheapest GTX 580 3GB cards go for €460+.
Prices starting at $449, sub-€400

AMD’s new HD 7950 is the talk of town these days and for good reason, too. The second iteration of AMD’s Tahiti GPU costs quite a bit less than the HD 7970, yet it’s fast enough to wipe the floor with Nvidia’s GTX 580.

From what we’ve been hearing, demand is good, but luckily there is no shortage of HD 7950’s in retail. A few dozen shops in mainland Europe have plenty of cards in stock and the prices are pretty good for a brand new product. In Germany, you can pick up a PowerColor reference card for as little as €386, or a Sapphire card for €397. You can even get a slightly overclocked 830MHz card from MSI for €393. In case you are looking for something in the 880MHz – 900MHz range, you have a choice of overclocked cards from Gigabyte, MSI, Sappire, PowerColor and XFX, with prices starting at €441. The HD 7970 sells for €519+, while the cheapest GTX 580 3GB cards go for €460+.

In the US, prices start at $449 for reference cards, while overclocked models retail for €489 or €499, depending on the clock and vendor. This is a full $100 less than the HD 7970 or GTX 580 3GB.

All in all AMD has done a pretty good job with the HD 7950. Quite a few consumers find the HD 7970 a tad too expensive, so the HD 7950 looks like a great deal, particularly in Europe, as it is up to 30 percent cheaper than the flagship HD 7970.

Source: fudzilla.com
(Read more)

Physics hardware makes Kepler/GK104 like crazy

Nvidia’s Kepler/GK104 chip has an interesting secret, a claimed Ageia PhysX hardware block that really isn’t. If you were wondering why Nvidia has been beating the dead horse called PhysX for so long, now you know, but it only gets more interesting from there.

Sources tell SemiAccurate that the ‘big secret’ lurking in the Kepler chips are optimisations for physics calculations. Some are calling this PhysX block a dedicated chunk of hardware, but more sources have been saying that it is simply shaders, optimisations, and likely a dedicated few new ops. In short, marketing may say it is, but under the heat spreader, it is simply shaders and optimisations.
That is the marketing claim anyway

Nvidia’s Kepler/GK104 chip has an interesting secret, a claimed Ageia PhysX hardware block that really isn’t. If you were wondering why Nvidia has been beating the dead horse called PhysX for so long, now you know, but it only gets more interesting from there.

Sources tell SemiAccurate that the ‘big secret’ lurking in the Kepler chips are optimisations for physics calculations. Some are calling this PhysX block a dedicated chunk of hardware, but more sources have been saying that it is simply shaders, optimisations, and likely a dedicated few new ops. In short, marketing may say it is, but under the heat spreader, it is simply shaders and optimisations.

The market has treated hardware PhysX like an unexplained sore that shows up a week after a night you can’t remember through a tequila induced haze. Numbers vary about the absolute magnitude of PhysX’s overwhelming success, but counts of 2011 game releases supporting hardware acceleration range from a low of two to a high of six. The snowball has pretty much stopped rolling, or to be more accurate, it never started, all the developers SemiAccurate spoke with indicate that their use of PhysX hardware acceleration was a cash flow positive experience, but we didn’t talk to all six listed.

With this new bit of information, one big question is answered, but specific hardware implementations details are a bit murky. Is the ‘hardware block’ dedicated to physics calculations when there are some being issued, or is it a AMD/GCN like multiple instruction issue? Is it just shaders with an added op or two that speed up math routines heavily used by physics simulations? How much die area is spent on this functionality? This isn’t very clear, and given the marketing materials Semiaccurate has seen, explanations will only serve to impede the impending hype.

That said, SemiAccurate is told Kepler/GK104 will be marketed as having a dedicated block, and this will undoubtedly be repeated everywhere, truth not withstanding. Luckily, since most of the target audience isn’t technically literate, it may “become fact” through the VIECOOCDF (Vast Internet Echo Chamber Of Often Repeated Dubious Facts). Lowering the collective intelligence can be profitable if not ethical. Luckily, the story doesn’t end here, it gets much worse.

This part ties in to the story SemiAccurate published a few weeks ago saying that Nvidia would win this generation. A lot of people have been asking about Kepler/GK104 performance and if it is really that good. The short story is yes and no, depending on your views on some very creative ‘optimisations’ around physics.

We stated earlier, Kepler wins in most ways vs the current AMD video cards. How does Nvidia do it with a $299 card? Is it raw performance? Massive die size? Performance per metric? The PhysX ‘hardware block’? Cheating? The easy answer is yes, but lets go in to a lot more detail.

GK104 is the mid-range GPU in Nvidia’s Kepler family, has a very small die, and the power consumption is far lower than the reported 225W. How low depends on what is released and what clock bins are supported by the final silicon. A1 stepping cards seen by SemiAccurate had much larger heatsinks than the A2 versions, and recent rumours suggest there may be an A3 to fix persistent PCIe3 headaches.

To date, an A3 spin has not been confirmed, but if it is necessary, it will likely push out the late March/early April release date by at least a month. One other possibility is for Nvidia to pull an Intel and release cards without the official PCI SIG stamp, adding it when A3 silicon is available. In any case, the number of PCIe3 supporting computers on the market is minimal, so functionally speaking, it doesn’t matter. You may loose a small bit of theoretical performance, but for a mid-range part, it is unlikely to be noticeable. Marketing is a completely different story though, one not closely tied to the reality most of us live in.

The architecture itself is very different from Fermi, SemiAccurate’s sources point to a near 3TF card with a 256-bit memory bus. Kepler is said to have a very different shader architecture from Fermi, going to much more AMD-like units, caches optimised for physics/computation, and clocks said to be close to the Cayman/Tahiti chips. The initial target floating among the informed is in the 900-1000MHz range. Rumours have it running anywhere from about 800MHz in early silicon to 1.1+GHz later on, with early stepping being not far off later ones. Contrary to some floating rumours, yields are not a problem for either GK104 or TSMC’s 28nm process in general.

Performance is likewise said to be a tiny bit under 3TF from a much larger shader count than previous architectures. This is comparable to the 3.79TF and 2048 shaders on AMD’s Tahiti, GK104 isn’t far off either number. With the loss of the so called “Hot Clocked” shaders, this leaves two main paths to go down, two CUs plus hardware PhysX unit or three. Since there is no dedicated hardware physics block, the math says each shader unit will probably do two SP FLOPs per clock or one DP FLOP.

This would be in line with the company’s earlier claims of a large jump in compute capabilities, but also leads to questions of how those shaders will be fed with only a 256-bit memory path. Given the small die sizes floating around, it is unlikely to be Itanium-esque brute forcing through large caches. The net result is that shader utilisation is likely to fall dramatically, with a commensurate loss of real world performance compared to theoretical peak.

In the same way that AMD’s Fusion chips count GPU FLOPS the same way they do CPU FLOPS in some marketing materials, Kepler’s 3TF won’t measure up close to AMD’s 3TF parts. Benchmarks for GK104 shown to SemiAccurate have the card running about 10-20% slower than Tahiti. On games that both heavily use physics related number crunching and have the code paths to do so on Kepler hardware, performance should seem to be well above what is expected from a generic 3TF card. That brings up the fundamental question of whether the card is really performing to that level?

This is where the plot gets interesting. How applicable is the “PhysX block”/shader optimisations to the general case? If physics code is the bottleneck in your app, A goal Nvidia appears to actively code for, then uncorking that artificial impediment should make an app positively fly. On applications that are written correctly without artificial performance limits, Kepler’s performance should be much more marginal. Since Nvidia is pricing GK104 against AMD’s mid-range Pitcairn ASIC, you can reasonably conclude that the performance will line up against that card, possibly a bit higher. If it could reasonably defeat everything on the market in a non-stacked deck comparison, it would be priced accordingly, at least until the high end part is released.

All of the benchmark numbers shown by Nvidia, and later to SemiAccurate, were overwhelmingly positive. How overwhelmingly positive? Far faster than an AMD HD7970/Tahiti for a chip with far less die area and power use, and it blew an overclocked 580GTX out of the water by unbelievable margins. That is why we wrote this article. Before you take that as a backpedal, we still think those numbers are real, the card will achieve that level of performance in the real world on some programs.

The problem for Nvidia is that once you venture outside of that narrow list of tailored programs, performance is likely to fall off a cliff, with peaky performance the likes of which haven’t been seen in a long time. On some games, GK104 will handily trounce a 7970, on others, it will probably lose to a Pitcairn. Does this mean it won’t actually do what is promised? No, it will. Is this a problem? Depends on how far review sites dare to step outside of the ‘recommenced’ list of games to benchmark in the reviewers guide.

Ethically, this could go either way, and in a vacuum, we would be more than willing to say that the cards are capable of very high performance. The problem is that the numbers that Nvidia will likely show off at the launch are not in a vacuum, nor are they very real, even considering the above caveats. Nvidia is going out of their way to have patches coded for games that tend to be used as benchmarks by popular sites.

Once again, this is nothing new, and has been done many times before. One example that is often mentioned is Starcraft II’s use of stencil buffers. People with inside knowledge of that game’s development have said that Nvidia gave Blizzard help in coding some parts of the game during the final ‘crunch’ period. The code is said to heavily use stencil buffers to fix some issues and patch over minor glitches. Again nothing unusual, AMD, Intel, and almost everyone else does this on a case by case basis, especially for AAA titles released in conjunction with new hardware.

Since Nvidia’s Fermi generation GPUs are very good at handling stencil buffers, they perform very well on this code. Again, this is normal practice, Nvidia put in the effort and now reaps the benefits, good for them. What is odd about this case, is that several knowledgeable sources have said that the code actually net decreases performance on both cards. The above tale may be anecdotal, but Starcraft 2′s release code sure seemed to use stencil buffers a lot more than you would expect, unreasonably so according to many coders. This however doesn’t constitute proof in any way, but it fits what SemiAccurate has seen Nvidia do in prior cases.

More to the point is antialiasing (AA) in Batman: Arkham Asylum. If you recall, AMD stated complaining about that game’s AA routines upon release. They directly stated that if AMD cards were detected, the game would disable AA for non-technical reasons. (Note: The original post that TechPowerUp refers to has the pertinent sections in the comments, not on the front page. It takes a little searching to find the post that also talks about several other games having similar ‘bugs’.) It goes on to state that if the card IDs were changed, the AA in the game functioned correctly on ATI hardware.

Short story, this turned in to the proverbial “epic pissing match”, with Nvidia claiming that it was Eidos that owned the code, and they were free to do with it as they feel fit. This is technically true. Unfortunately, emails seen by SemiAccurate directly contradict this. Those emails state unequivocally that Eidos should not change code written by Nvidia and provided to Eidos as a part of Batman: Arkham Asylum. At the point they were questioned on why, Eidos says they could not do anything due to advice of their attorneys.

Since it was the attorneys objecting, not the coders, we can only speculate that this was due to Nvidia’s financial sponsorship of the game, not any technical reason. Since sources tell SemiAccurate that Batman: Arkham Asylum only uses standard DirectX calls to implement AA, and it appears to function if the graphics card IDs are changed, this seems to be nothing other than Nvidia directly sabotaging their competition and not allowing AMD remove the lockout. Go and re-read the statements from AMD/ATI, Nvidia, and Eidos, then draw your own conclusions.

Why do we bring these two cases up in a Kepler article? Well, we hear that it is happening again. Both AMD and Nvidia have developers that they can and do ‘embed’ at game companies. This is an old and quite legitimate practice for GPU and non-GPU hardware companies. Everyone does it. It can be done ethically or not, with net performance gains for the end user or not, and with the intent to hurt or harm. In general, the more marketing money involved, the more most developers are willing to go out on a shaky ethical limbs.

One last really good example, tesselation. High end Fermi cards, GF100/110/GTX480/GTX580 are heavily biased toward geometry performance. Since most modern GPUs can compute multiple triangles per displayable pixel on any currently available monitor, usually multiple monitors, doubling that performance is a rather dubious win. Doubling it again makes you wonder why so die area was wasted.

Since Nvidia did waste that die area, helping games show that prowess off is a good thing for users, right? Look at Crysis 2, a AAA title that is heavily promoted by Nvidia, it positively flies on Fermi based cards, but performance on AMD GPUs is far less impressive. Why? The amazing detail in things like the concrete blocks, brick walls, and vast expanses of realistically modelled water. Breathtaking isn’t it? All thanks to Nvidia’s efforts to make the game experience better on their hardware. How could this be interpreted as anything but a win for users by a reasonable observer?

Nvidia is said to have around 15 developers they can embed at companies to help ‘optimise’ their code, ‘fix bugs’, and work out ‘performance problems’, even if those problems are not on Nvidia hardware. The count for other companies is less clear, but unlikely to be much different. Sources tell SemiAccurate that about half of them are currently working at Eidos on, wait for it, a patch for the recently released Batman: Arkham City game. Since Both the original and and the new Batman games are flag bearers for Nvidia’s hardware/GPU PhysX acceleration, it doesn’t take a genius to connect the dots. Since neither the patch or Kepler based video cards are out yet, we can only wait to see what the end result is.

If the purported patch does change performance radically on specific cards, is this legitimate GPU performance? Yes. How about if it raises performance on Kepler cards while decreasing performance on non-Kepler cards to a point lower than pre-patch levels? How about if it raises performance on Kepler cards while decreasing performance only on non-Nvidia cards? Which scenario will it be? Time will tell.

How many other games have had this level of attention and optimisation gifted upon them is another open question. One thing we can say is that the list of benchmarks shown off by Nvidia where Kepler has an overwhelming advantage all support PhysX. This is not to say that they are all hardware/GPU PhysX accelerated, they are not, most use the software API.

This is important because it strongly suggests that Nvidia is accelerating their own software APIs on Kepler without pointing it out explicitly. Since Kepler is a new card with new drivers, there is no foul play here, and it is a quite legitimate use of the available hardware. Then again, they have been proven to degrade the performance of the competition through either passive or active methods. Since Nvidia controls the APIs and middleware used, the competition can not ‘fix’ these ‘problems with the performance of their hardware’.

Going back to Kepler, we see that this happy and completely ethical game is going to be starting round 3, or round 17, depending on how you count. Nvidia appears to be stacking the playing field to both cripple the competition and raise their own performance. Is the performance of Kepler cards legitimate? Yes. Is it the general case? No. If you look at the most comprehensive list of supported titles we can find, it is long, but the number of titles released per year isn’t all that impressive, and anecdotally speaking, appears to be slowing.

When Kepler is released, you can reasonably expect extremely peaky performance. For some games, specifically those running Nvidia middleware, it should fly. For the rest, performance is likely to fall off the proverbial cliff. Hard. So hard that it will likely be hard pressed to beat AMD’s mid-range card.

What does this mean in the end? Is it cheating? Is it ethical? Is Kepler/GK104 going to be worth the money? Will it beat AMD’s 7970? These are all subjective decisions for you to make. What software will Nvidia show off as benchmarks to promote Kepler’s performance? That list is a little narrower. What will happen to sites that dare to test software that is not ‘legitimately accelerated’? No idea, but history offers some clues. One thing you can say for sure is that the information released prior to and with the card is unlikely to be the whole story. Legitimacy, performance, honesty, and ethics are unlikely to resemble the official talking points, and the whole truth is likely to be hidden from prying eyes for very partisan reasons. Big grains of salt around this one people, be very skeptical of everything you hear, and take nothing at face value.

Source: semiaccurate.com
(Read more)

Random Post