Funktionen

Print[PRINT]
.  Home  .  Lehre  .  Vorlesungen  .  Wintersemester 2019/20  .  Introdution to Power-Aware HPC

Introdution to Power-Aware HPC

Lecture in the winter-term 2019/20
Prof. Dr. D. Kranzlmüller,
Dr. Hayk Shoukourian

This course will be held in English!

Welcome to the course webpage for Introdution to Power-Aware HPC for winter-term 2019/20 at LMU Munich. Here you will be able to find the details on the lecture and the accompanying practical project.

News

27.08.2020
The results of the retake exam have been published via UNI2WORK.
30.06.2020
The retake of the exam is now set for Monday, 20.07.2020 from 14:30 to 15:30 CEST. Room: M 018. See the Exam section for more details.
11.03.2020
The retake of the exam is postponed until further notice.
06.03.2020
The retake of the exam is scheduled for Wednesday, 01.04.2020 from 10:15 to 11:15. Room: B U101 (Oettingenstr 67). See the Exam section for more details.
27.02.2020
The examination results have been published via UNI2WORK. The review of the exam results (Klausureinsicht) is scheduled for Friday, 06.03.2020 from 10:00 - 12:00 (Location: Oettingenstrasse 67, Room 056)
30.01.2020
The guest lecture is scheduled for next Wednesday, 05.02.2020.
20.12.2019
The final exam date is set for Tuesday, 11.02.2020. See the exam section for more details.
19.12.2019
As announced during the lecture sessions, the guided tour will take place on Wednesday, 08.01.2020 at 18:15. We will meet in seminar room 2 (LRZ, 85748 Garching bei Muenchen) instead of the lecture session in Oettingenstrasse 67. There will be signs guiding your way to the seminar room once you enter LRZ from the main enterance.

Please don't forget your IDs and please don't be late - we are going to start the tour at 18:15 sharp.

Merry Christmas and a Happy New Year!

30.10.2019
There will be no lecture on Wednesday, 13.11.2019 due to international business travels.
21.10.2019
The guided tour will take place on Wednesday, 08.01.2020 at 18:15. We will meet in seminar room 2 (LRZ, 85748 Garching bei Muenchen) instead of the lecture session in Oettingenstrasse 67. There will be signs guiding your way to the seminar room once you enter LRZ from the main enterance.
06.08.2019
Welcome to the course webpage Introdution to Power-Aware HPC for winter-term 2019/20 at LMU Munich.
  • Registration will be opened on the 15th of August via UNI2WORK
    (NOTE: registration closes on 17.10.2019 at 13:00)
  • The lectures are scheduled for Wednesdays from 18:00 to 20:00. The first lecture will take place on 16.10.2019 at 18:00. The room number is 115 in Oettingenstrasse 67

Contents of the lecture

Some of the current High Performance Computing (HPC) systems already consume more than 15 MW of power - a sufficient amount of power for sustaining a small city. Energy consumption is becoming a dominating factor for the Total Cost of Ownership of many HPC systems, making high-performance design and energy-efficient design in many ways synonymous.

Apart from the high power bills, power consumptions of these magnitudes act as a limiting factor in building and operating Exascale systems, i.e. next generation of HPC systems that are capable of performing 1018 floating point operations per second. This could already cause the entire data center's power delivery and cooling infrastructures to breach the safety limits as well as affect the environmental sustainability by producing high carbon footprint. Therefore, it is important to be preemptive in improving energy/power efficiency of HPC data centers.

This course explores different energy consumption issues in modern HPC data centers, discusses their impacts on the design of new computing systems and presents different strategies that aim to reduce the overall power consumption.

The lecture will cover the main concepts of energy consumption paradigms that should remain valid despite the continuous technological changes in the area.

Upon completion of this course the participants should acquire knowledge on:

  • the importance of power/energy-efficiency for modern data centers
  • the theory behind a variety of impacts that power dissipation in a CMOS chip has on HPC data centers
  • contemporary tools for monitoring different power consumption related metrics
  • diverse techniques on energy-efficiency tuning
  • power-related challenges for next generation HPC systems
  • contemporary resource management and scheduling techniques that are tuned for energy-efficiency
  • power variation in homogeneous HPC systems and the potential of possible cost savings
  • Intel's Model Specific Registers (MSRs) used for power management support
  • principles of various machine learning techniques and their applications for intelligent power management
  • high-frequency data collection techniques
  • datacenter basics (understand the building blocks of modern datacenters and learn about possible architectures)

Audience

The course is intended for master students of computer science and related fields. The lecture and the project work have a cumulative weight of 6 ECTS.

More formally, in German:
Die Vorlesung richtet sich an Master-Studierende der Informatik. Für die Vorlesung und die Projektarbeit werden 6 ECTS-Punkte vergeben.

The number of students will be limited to 20. The registration will open 15.08.2019 from 8:00 via UNI2WORK and will close on 17.10.2019 at 13:00.

Prerequisites:

  • Python knowledge
  • Interest in energy-efficient supercomputing
  • Interest in developing machine-learning frameworks

Dates

  • Lecture: Wednesdays, 18:15 to 19:45 in room 115 in Oettingenstraße 67. The first lecture is held on 16th of October 2019.
  • Guided Tour at Leibniz Supercomputing Centre (LRZ): Wednesday, 08.01.2020, 18:15 to 19:45. Meeting point: seminar room 2, LRZ, 85748 Garching bei Muenchen.
  • Exam: Tuesday, 11.02.2020 (see the exam section for more details)
  • Repeat Exam: Wednesday, 01.04.2020 (see the exam section for more details)

Project: "Increasing Cooling Efficiency of a Data Center"

LRZ_TwinCube

This project aims at building Machine-Learning (ML) based models for predicting the power consumption of a HPC data center's cooling loop. Participants will form groups, where each group will be assigned with an annual operational data obtained at Leibniz Supercomputing Centre (LRZ).

The provided data will contain various sensor measurements from LRZ's building infrastructure.

Each group of students would need to analyze the data, design and develop a ML-based model capable of predicting the power consumption of LRZ's warm-water cooling loop.

During this project students will gain an experience that could be applied not only to HPC data centers but also to other domains involving ML-based modeling.

The detailed description of the project assignment will follow during the lecture.

The training data can be found here Project Section.

Exam

There will be a written examination (closed book) which will be held in February 2020. The exact time and room will be published as soon as possible.

DATE: Tuesday, 11.02.2020

TIME: 10:30 - 11:30

ROOM: A 021, see: LMU Room Finder


Don't forget to register for the exam via Uni2Work. Registration and de-registration are possible before Friday, 31.01.2020 23:59:59 CET

The retake of the exam is scheduled for:

DATE: Wednesday, 01.04.2020

TIME: 10:15 - 11:15

ROOM: B U101, Oettingenstraße 67

Don't forget to register via Uni2Work. Registration and de-registration are possible: before Thursday, 26.03.2020

DATE: Monday, 20.07.2020

TIME: 14:30 - 15:30 CEST

ROOM: M 018, LMU

Don't forget to register via Uni2Work. Registration and de-registration are possible: before Wednesday, 15.07.2020 23:59:59 CEST

Scripts

The lecture notes are available in the Download Section.

Literature

Book

CMOS VLSI Design: A Circuits and Systems Perspective (4th Edition) by Neil Weste, David Harris

Book

Computer Organization and Design RISC-V Edition: The Hardware Software Interface by David A. Patterson, John L. Hennessy

Book

Energy-Efficient Distributed Computing Systems by Albert Y. Zomaya, Young Choon Lee

Book

Machine Learning: A Probabilistic Perspective by Kevin P. Murphy

Book

Machine Learning: An Algorithmic Perspective, second edition by Stephen Marsland

Book

Introduction to Apache Flink: Stream Processing for Real Time and Beyond By Ellen, M.D. Friedman, Kostas Tzoumas

Book

The Data Center as a Computer by Luiz André Barroso, Jimmy Clidaras, Urs Hölzle

Book

Additional scholary articles: sources will be indicated in the course slides






Contact

Via email, or per appointment, or after lectures.