GTC2013

Cray colaborating with Nvidia for GPU-based XE6 blades

| 3 October, 2010

At the recent 2010 GPU Technology Conference in San Jose, California, Cray announced that it is developing blades based on the Nvidia Tesla 20-Series GPUs for use in the Cray XE6 product line. The combination of Cray’s new Gemini system interconnect (which is featured in the Cray XE6 system) paired with Nvidia GPUs will give Cray XE6 customers a powerful combination of scalability and production-quality GPU-based HPC in a single system.

‘We spend a significant amount of time meeting with our customers and talking about how we can build the most scalable HPC systems that blend industry-leading technologies designed to meet their supercomputing needs,’ said Barry Bolding, vice president of Cray’s products division. ‘Our customers have expressed a growing interest in having accelerator technology in Cray systems, and we believe a Cray XE6 blade with Nvidia GPUs will provide the performance, scalability, and reliability that a growing segment of our customer base is looking for. The use of GPUs as accelerators in the HPC marketplace is maturing, and we are excited to be working with Nvidia to offer this functionality to our customers.’

Cray XE6 Specifications

Processor Eight or 12-core 64-bit AMD Opteron 6100 Series processors; up to 192 per cabinet
64k L1 instruction cache, 64k L1 data cache, 512 KB L2 cache per processor core, 12 MB shared L3 cache
Memory 32 GB or 64 GB registered ECC DDR3 SDRAM per compute node
Memory Bandwidth: 85.3 GB/s per compute node
Compute Cabinet Cores: 1,536 or 2,304 processor cores per system cabinet
Peak Performance: 12.2 to 20.2 teraflops per system cabinet
Interconnect 1 Gemini routing and communications ASIC per two compute nodes
  48 switch ports per Gemini chip (160 GB/s switching capacity per chip)
  3-D torus interconnect
System Administration Cray System Management workstation
Graphical and command line system administration
  Single-system view for system administration
  System software rollback capability
Reliability Features (Hardware) Cray Hardware Supervisory System (HSS) with independent 100 Mb/s management fabric between all system blades and cabinet-level controllers
  Full ECC protection of all packet traffic in the Gemini network
  Redundant power supplies; redundant voltage regulator modules
  Redundant paths to all system RAID
  Variable-speed axial turbofan with integrated pressure and temperature sensors
Reliability Features (Software) HSS system monitors operation of all operating system kernels
Lustre file system object storage target failover; Lustre metadata server failover
  Software failover for critical system services including system database, system logger, and batch subsystems
NodeKARE (Node Knowledge and Reconfiguration)
Operating System Cray Linux Environment (components include SUSE Linux SLES11, HSS and SMW software)
Extreme Scalability Mode (ESM) and Cluster Compatibility Mode (CCM)
Compilers, Libraries & Tools PGI compilers, Cray Compiler Environment, PathScale
Support for Fortran 77, 90, 95; C/C++, UPC, Co-Array Fortran
MPI 2.0, Cray SHMEM, other standard MPI libraries using CCM
Cray Apprentice, Cray PAT and Cray Compiler included with systems
Job Management PBS Professional job management system
Moab Adaptive Computing Suite job management system
Platform LSF job management system
External I/O Interface InfiniBand, 10 Gigabit Ethernet, Fibre Channel (FC) and Ethernet
Disk Storage Full line of FC-attached disk arrays with support for FC and SATA disk drives
Parallel File System Lustre, Data Virtualization Service allows support for NFS, external Lustre and other file systems
Power 45-54.1 kW (45.9 – 55.2 kVA) per cabinet, depending on configuration
Circuit requirements: three-phase wye, 100 AMP at 480/277 and 125 AMP at 400/230 (three-phase, neutral and ground)
Cooling Air-cooled, air flow: 3,000 cfm (1.41 m3/s); intake: bottom; exhaust: top
Optional ECOphlex liquid cooling
Dimensions (Cabinet) H 93 in. (2,362 mm) x W 22.50 in. (572 mm) x D 56.75 in. (1,441 mm)
Weight (Maximum) 1,600 lbs. per cabinet (725 kg) air cooled; 2,000 lbs. per cabinet (907 kg) liquid cooled
Acoustical Noise Level 75 dBA at 3.3 ft (1.0 m)
Regulatory Compliance UL 60950-1, CAN/CSA – C 22.2 No. 60950-1, CE-mark, RoHS, WEEE
Safety FCC Class A, VCCI Class A, ICES-003, EN 50022:2006 Class A, AS/NZS CISPR 22:2006, EN 55024: 1998 +A1:2001 +A2:2003

With this future addition, Cray will be able to offer its customers a full range of accelerator solutions for HPC – from the deskside to the supercomputer. The Cray CX line of deskside and midrange systems, including the Cray CX1 and Cray CX1000 systems, are currently available with Nvidia Tesla GPUs.

Cray and Nvidia continue to work together on the development of future GPU accelerator technologies in HPC as a potential path towards exascale computing. Cray is partnering with Nvidia on a team that was recently awarded a $25m research grant from the Defense Advanced Research Projects Agency (DARPA) as part of its Ubiquitous High Performance Computing (UHPC) programme.

‘The Cray XE6 is designed to solve the system challenges of petascale computing just as Tesla GPUs are providing the computational power efficiency to bring this level of computing to the larger HPC community,’ said Andy Keane, general manager, Tesla business at Nvidia. ‘Cray’s planned Tesla GPU-based system will be a powerful combination of technologies that will provide the highest level of performance and scalability for power efficient supercomputing.

Tags: , ,

Category: HPC

Comments are closed.