The Async Compute problem is probably one of the most controversial issues surrounding the older generation of Geforce graphics cards from Nvidia. Something very interesting, however, is present in the press release that they sent out to, well, the press. According to the official statement, the GTX 1080 is fully capable of performing Async Compute. If this turns out to be true, then this will give negate a major edge that Radeon graphics cards from AMD have enjoyed this past year.

Nvidia: GTX 1080 is capable of Async Compute

Keep in mind however, that even Maxwell featured Asynchronous Compute on paper. Unfortunately, due to the fact that expensive software based context switching had to be employed before it could be used, (since it did not have a dedicated hardware scheduler like AMD’s GCN) resulted in lowered performance on Maxwell based graphics cards. Nvidia’s style had been a technique called preemption, which it had perfected to an impressive degree. The reviews are going to be out on May 27th and if independent reviews confirm this fact, than it will be a huge win for the green camp.

Asynchronous compute has been a deal sweetener for Radeon buyers ever since the DirectX 12 API hit the stage. AMD currently leads in Hitman and AOTS which utilize their Asynchronous shader technology developed around DirectX 12 API. Interestingly, Nvidia GPUs historically perform much better without ASync turned on. This is probably due to the fact that Nvidia had apparently disabled ASync from their driver suite. The rationale given for that move is that its GPUs cannot process ASync concurrently on the hardware level, rather they need context switching which is expensive in terms of frame rate.

The Async Compute Story Distilled Down To Its Core

Async Compute has been a hot subject of debate ever since gamers became aware of its very existence. We dove deep a couple of months ago into this peculiar DirectX 12 feature in our two thousand word analysis piece dubbed “AMD’s Secret DirectX 12 Weapon That Nvidia Had To Trade Off – Demystifying Async Compute“. We explained the inherent architectural differences between Nvidia and AMD graphics cards and distilled the key reasons as to why they deal and perform so differently with asynchronous game code. We’d highly recommend giving it a read if you’re looking to wrap your head around this topic and get down to the core of the issue before proceeding.

The following is the relevant extract from the press release:

Five Marvels of Pascal: NVIDIA engineered the Pascal architecture to handle the massive computing demands of technologies like VR. It incorporates five transformational technologies:

Next-Gen GPU Architecture. Pascal is optimized for performance per watt. The GTX 1080 is 3x more power efficient than the Maxwell Architecture.

16nm FinFET Process. The GTX 1080 is the first gaming GPUs designed for the 16nm FinFET process, which uses smaller, faster transistors that can be packed together more densely. Its 7.2 billion transistors deliver a dramatic increase in performance and efficiency.

Advanced Memory. Pascal-based GPUs are the first to harness the power of 8GB of Micron’s GDDR5X memory. The 256-bit memory interface runs at 10Gb/sec., helping to drive 1.7x higher effective memory bandwidth than that delivered by regular GDDR5.

Superb Craftsmanship. Increases in bandwidth and power efficiency allow the GTX 1080 to run at clock speeds never before possible — over 1700 MHz — while consuming only 180 watts of power. “New asynchronous compute advances improve efficiency and gaming performance.” And new GPU Boost™ 3 technology supports advanced overclocking functionality.

Groundbreaking Gaming Technology. NVIDIA is changing the face of gaming from development to play to sharing. New NVIDIA VRWorks™ software features let game developers bring unprecedented immersiveness to gaming environments. NVIDIA’s Ansel™ technology lets gamers share their gaming experiences and explore gaming worlds in new ways.

Async Compute on the GTX 1080 will allow developers to execute some tasks that would otherwise be allocated to the CPU, on the GPU. This means that if a game is being CPU-bound (that is to say the CPU is the bottlenec present), it will drastically increase frame rates. It may even improve performance in games that are GPU bound, by allowing full use of the GPUs resources. The thing we have to keep in mind however that Preemption and Asynchronous compute are both different approaches to achieve the same end result: maximizing the utilization of a GPU. And while AMD will have you believe Async is drastically superior choice, badly implemented Async will fare much worse than properly implemented preemption.

Due to the pressure exerted by the industry to make Async compatible graphics cards however, Nvidia has been working actively to implement Async in their GPUs but were held back due to the fact that this was something that had to be implemented at a hardware level. Chip design usually takes a lot of time (in the lieu of many years) and if Nvidia has actually managed to properly implement Async in the Pascal based GTX 1080 – that would be quite an accomplishment. Something their CEO stated a few weeks back (regarding the P100 being capable of advanced preemption) made us think that Pascal might stick with the Preemption approach for now, but the press release from Nvidia states otherwise. So consider us pleasantly surprised!

NVIDIA launched their ultimate graphics card, the GeForce GTX 1080 today. The GeForce GTX 1080 is part of the GeForce 1000 series which marks the arrival of the first 16nm FinFET based graphics cards and the card can now be bought directly from NVIDIA and a select number of partners. NVIDIA’s partners will also ready custom cards which will be available from 27th May.

NVIDIA GeForce GTX 1080 and 1070 are the most advanced GPU for gamers!

NVIDIA GeForce GTX 1080 Founders Edition Listed For Pre-Order at $699 US – Review Go Live on 17th May

The NVIDIA GeForce GTX 1080 and GeForce GTX 1070 are the first gaming cards to feature the Pascal GPU. We have already covered the features in detail. The latest FinFET architecture dramatically improves performance and efficiency to the GeForce 1000 series cards. Key technologies which include the 16nm FinFET based Pascal chip, the GDDR5X memory components, superior build design and new gaming initiatives combine to make the GTX 1080 the most advanced graphics card ever to be designed by NVIDIA.

“It’s insane. The 1080 is insane, It’s almost irresponsible amounts of performance,” Huang, wearing a black leather jacket and huge grin, told the crowd of hundreds of gamers at Emo’s, one of music-mad Austin’s top entertainment venues. via NVIDIA

ZOTAC’s GeForce GTX 1080 Founders Edition pictured in detail.

GeForce GTX 1080 retail availability from the world’s leading add-in card suppliers, including ASUS, Colorful, EVGA, Gainward, Galaxy, Gigabyte, Innovision 3D, MSI, Palit, PNY and Zotac, will begin at the end of May. Partner boards will vary by region, and pricing is expected to start at $599. If you want to try your hand at winning a GeForce GTX 1080, or even a custom gaming rig, test your puzzle-solving skills in our Order of 10 Challenge.

The NVIDIA GeForce GTX 1080 has now been listed by NVIDIA’s AIB partner ZOTAC where it should soon be made available for users to purchase. Furthermore, OCUK has already made a page where they have listed down the GeForce GTX 1080 and GeForce GTX 1070 for pre-order so expect them to go live really soon.

Our friends at Videocardz also pointed out that the reviews for the GeForce GTX 1080 would go live before the official launch on 27th May allowing users to analyze all metrics of the card before purchase. This indicates NVIDIA being confident about their upcoming release which is something good to see.

The NVIDIA GeForce GTX 1080 and GTX 1070 Pre-Order Page has gone live on OCUK.

Using the same air cooler, NVIDIA was able to overclock their GeForce GTX 1080 to 2.1 GHz. At these insane clocks, the cooler managed to keep the temperature stable at 67C which is insane knowing that this is a reference design. But users who are willing to purchase the non-reference, custom models will be able to do so on 27th May with powerful PCBs and beefy coolers (Air/Water/Hybrid). The non-reference coolers will start at pricing of $599 US.

“NVIDIA’s flagship GeForce GTX 1080 is the most advanced gaming GPU ever created, powered by the high-performance, energy-efficient new NVIDIA Pascal architecture. Pascal enables truly game-changing performance with captivating sights and sounds, as well as next-generation VR performance that offers the lowest latency and plug-and-play compatibility with leading headsets.” via ZOTAC

NVIDIA GeForce GTX 1080 Specifications at a Glance – FinFET, GDDR5X, Async Compute and 9 TFLOPs Compute

The NVIDIA GeForce GTX 1080 features the GP104 GPU (GP104-400-A1) core which comprises of 2560 CUDA cores. The chip houses 7.2 Billion transistors that is really close to the 8 Billion transistor GM200 GPU. The key difference is that GM200 is a 601mm2 chip while the GP104 is a chip that spans less than 320mm2. but comes with vastly improved performance.

The clock speeds are maintained at 1607 MHz base and 1733 MHz boost but the FinFET architecture allows for extreme overclocking as demoed at the event. The chip is built to deliver 9 TFLOPs of compute horserpower. NVIDIA went with 8 GB of GDDR5X (next-gen) memory on the GeForce GTX 1080 graphics card. The memory is featured across a 256-bit bus and clocked at 10 GB/s. This leads to a total bandwidth of 320 GB/s. Like the core, the memory also features great overclocking potential with clock speeds that can be pushed to around 11-12 GB/s.

The NVIDIA GeForce GTX 1080 features a TDP of just 180W. The rated TDP is just 15W higher than the GeForce GTX 980 but at the same time, delivers performance better than dual 980s. Coupled with a very smooth power delivery system to avoid leakage, the chip is one of the most efficient GPU architecture ever designed for PC gamers. Power is fed through a single 8-Pin connector.

NVIDIA GeForce GTX 1080 "Preliminary" Specs:

Graphics Card Name NVIDIA GeForce GTX 980 NVIDIA GeForce GTX 980 Ti NVIDIA GeForce GTX Titan X NVIDIA GeForce GTX 1070 NVIDIA GeForce GTX 1080 NVIDIA GP100

Graphics Core GM204 GM200 GM200 GP104 GP104 GP100

Process Node 28nm 28nm 28nm 16nm FinFET 16nm FinFET 16nm FinFET

Transistors 5.2 Billion 8 Billion 8 Billion 7.2 Billion 7.2 Billion 15.3 Billion

CUDA Cores 2048 CUDA Cores 2816 CUDA Cores 3072 CUDA Cores 2048 CUDA Cores 2560 CUDA Cores 3840 CUDA Cores

VRAM 4 GB GDDR5 6 GB GDDR5 12 GB GDDR5 8 GB GDDR5 8 GB GDDR5X 16/32 GB HBM2

Bus Interface 256-bit bus 384-bit bus 384-bit bus 256-bit bus 256-bit bus 4096-bit bus

Power Connector 6+6 Pin Power 8+6 Pin Power 8+6 Pin Power Single 8-Pin Power Single 8-Pin Power 8+6 Pin Power

TDP 165W 250W 250W 150W 180W 300W (Tesla P100)

Display Outputs 3x Display Port
1x HDMI 2.0
1x DVI 3x Display Port
1x HDMI 2.0
1x DVI 3x Display Port
1x HDMI 2.0
1x DVI 3x Display Port 1.4
1x HDMI 2.0b
1x DVI 3x Display Port 1.4
1x HDMI 2.0b
1x DVI TBC

Launch Date September 2014 May 2015 March 2015 10th June 2016 27th May 2016 TBC

Launch Price $549 US $649 US $999 US $379 US $599 US N/A

Graphics Card Name	NVIDIA GeForce GTX 980	NVIDIA GeForce GTX 980 Ti	NVIDIA GeForce GTX Titan X	NVIDIA GeForce GTX 1070	NVIDIA GeForce GTX 1080	NVIDIA GP100
Graphics Core	GM204	GM200	GM200	GP104	GP104	GP100
Process Node	28nm	28nm	28nm	16nm FinFET	16nm FinFET	16nm FinFET
Transistors	5.2 Billion	8 Billion	8 Billion	7.2 Billion	7.2 Billion	15.3 Billion
CUDA Cores	2048 CUDA Cores	2816 CUDA Cores	3072 CUDA Cores	2048 CUDA Cores	2560 CUDA Cores	3840 CUDA Cores
VRAM	4 GB GDDR5	6 GB GDDR5	12 GB GDDR5	8 GB GDDR5	8 GB GDDR5X	16/32 GB HBM2
Bus Interface	256-bit bus	384-bit bus	384-bit bus	256-bit bus	256-bit bus	4096-bit bus
Power Connector	6+6 Pin Power	8+6 Pin Power	8+6 Pin Power	Single 8-Pin Power	Single 8-Pin Power	8+6 Pin Power
TDP	165W	250W	250W	150W	180W	300W (Tesla P100)
Display Outputs	3x Display Port 1x HDMI 2.0 1x DVI	3x Display Port 1x HDMI 2.0 1x DVI	3x Display Port 1x HDMI 2.0 1x DVI	3x Display Port 1.4 1x HDMI 2.0b 1x DVI	3x Display Port 1.4 1x HDMI 2.0b 1x DVI	TBC
Launch Date	September 2014	May 2015	March 2015	10th June 2016	27th May 2016	TBC
Launch Price	$549 US	$649 US	$999 US	$379 US	$599 US	N/A

The display out puts for the card include Display Port 1.4 (4K @ 120 Hz / 8K @ 60 Hz), HDMI 2.0b (4K @ 60 Hz) and DL-DVI which means that it is capable to support all next-gen displays with new standards. Along with new technologies implemented for gamers such as SMTP, multi-monitors gamers will be able to extract more image quality and better performance from their PCs.NVIDIA GeForce GTX 1080 and GTX 1070 Designed For HDR, VR and 4K Gaming – Support HDMI 2.0b and DP 1.4

The NVIDIA GeForce GTX 1080 and GeForce GTX 1070 fully support HDR technologies which will be arriving in the latest iteration of display panels from various manufacturers.

A Beautiful New Cooler Design – Founders Edition Features Great Acoustics and Advanced Vapor Chamber Cooling

One of the key features of the NVIDIA GeForce GTX 1080 Founders Edition and the GeForce GTX 1070 Founders Edition is that both cards are designed to look great. The founders edition features a beautifully crafted cooling shroud that is built with aluminum alloy. The craftsmanship extends beyond the shroud as the 1080 deploys an advanced vapor chamber to cool the graphics chip. Featuring a dense aluminum fin array, the heatsink allows a cool design that maintains low temperatures and huge room for overclocking.

NVIDIA has also designed a top-notch PCB for the GeForce GTX 1080 to match the looks of the card. This PCB features an array of high-quality components that deliver smooth current to the chip and memory for stable operation while gaming. The PCB also allows enthusiasts and overclockers to get the best clock frequencies out of the GeForce GTX 1080 and GeForce GTX 1070 reference cards.

The Overclockers Dream – 2.1 GHz Core and 5.5 GHz Memory Clocks on Air Cooling

The NVIDIA GeForce 10 series cards are insane in design and performance. As demoed by NVIDIA, the GeForce 10 series cards are a dream come true for overclockers. Even with higher base and boost clocks, the GeForce 10 series cards have a ton of headroom for overclocks. A small demonstration with a high-quality character model revealed that the GeForce GTX 1080 was able to mange overclocked speeds of up to 2.11 GHz for the core and 5.5 GHz (11 GHz effective) clock speed for the GDDR5X memory. All of this was achieved at temperatures of 67 degrees Celsius and at stable 60 FPS.

NVIDIA will be launching the GeForce GTX 1080 Founders Edition on 27th May for $699 US while the GeForce GTX 1070 Founders Edition will be showing up on 10th June for $449 US. AIBs will also be producing reference designs similar to the Founders Edition (same cooler design) and custom models at the launch day for the cards with MSRPs set at $379 US for the GTX 1070 and $599 US for the GTX 1080. Stay tuned for more details!