Profiling Your First CUDA App with GPUFlight Trace
A practical walkthrough of using gpufl trace to capture a CUDA application and upload the result to the GPUFlight dashboard.
GPUFlight engineering notes
Deep dives on building GPU software with C++, Python, CUDA, ROCm, and GPUFlight.
A practical walkthrough of using gpufl trace to capture a CUDA application and upload the result to the GPUFlight dashboard.
GPUFlight adds AMD GPU profiling with ROCm, HIP kernel tracing, telemetry, occupancy analysis, ISA disassembly, and hardware metrics.