Abstract
There is a growing awareness that high-end performance evaluation and tuning requires holistic program analysis. In addition to CPU performance characterization, observation of memory, network, and input/output (I/O) performance can help to identify execution bottlenecks related to these factors. Correctness of memory and communication operations is also an issue and can affect performance indirectly. This paper describes extensions to the TAU performance system to incorporate direct source-level code instrumentation for tracking dynamic memory management in FORTRAN codes that use allocate and de-allocate statements. TAU's lightweight profiling can then generate a detailed report of memory usage including the sizes of memory blocks allocated and de- allocated with precise program attribution: variable name, source line number, and file name. We report on results and experiences in applying TAU to the PTURBO application.