Skip to main content
  • EPS
    Members: Free
    IEEE Members: Free
    Non-members: Free
    Length: 0:52:46
12 Sep 2024

Thermal management is becoming an ever more critical challenge for AI chips as the power density increases. Both chip-level and facility-level cooling solutions need to be developed and optimized in order to support the demand and needs. At the chip-level, advanced packaging technologies, such as chiplet architectures and heterogeneous architectures like 2.5D, 3D, and 3.5D hybrid bonded technologies, are becoming increasingly popular for driving performance and cost improvements in AI/ML hardware. However, these solutions also introduce additional complexity and thermal challenges. To address these challenges, ASIC cooling technology development is a key strategic enabler to ensure the competitiveness and scalability of AI/ML hardware roadmaps. These technologies aim to solve the high total power and increased power density challenges faced by AI/ML systems. On the other hand, at the facility-level, various cold plate designs and liquid cooling solutions are developed and become more mature to be deployed in large scale. This presentation identifies areas for future thermal technology exploration at both ASIC and facility-level that require investment to extend the cooling capabilities of future AI/ML roadmaps. These areas include: — Thermal characterization of on-die thermal models — Exploration of thermal interface materials — Optimization of cold plate performance — Evaluation of future embedded cooling solutions — Air Assisted Liquid Cooling (AALC) and Liquid Cooling solutions at the rack-level Investing in these areas will help ensure the continued development of high-performance and scalable AI/ML hardware.

More Like This

  • IAS
    Members: $150.00
    IEEE Members: $250.00
    Non-members: $450.00
  • PES
    Members: Free
    IEEE Members: $45.00
    Non-members: $70.00
  • IAS
    Members: $150.00
    IEEE Members: $250.00
    Non-members: $450.00