Profiling

CPU Profiling

CPU 프로파일링 줄리아 코드를 위한 두 가지 주요 접근 방식이 있습니다:

Via `@profile`

주어진 호출에 대해 @profile 매크로를 통해 프로파일링이 활성화된 곳.

julia> using Profile

julia> @profile foo()

julia> Profile.print()
Overhead ╎ [+additional indent] Count File:Line; Function
=========================================================
    ╎147  @Base/client.jl:506; _start()
        ╎ 147  @Base/client.jl:318; exec_options(opts::Base.JLOptions)
...

Triggered During Execution

이미 실행 중인 작업은 사용자 트리거 시간에 고정된 시간 동안 프로파일링할 수 있습니다.

프로파일링을 트리거하려면:

MacOS 및 FreeBSD (BSD 기반 플랫폼): ctrl-t를 사용하거나 SIGINFO 신호를 julia 프로세스에 전달합니다. 즉, % kill -INFO $julia_pid
Linux: julia 프로세스에 SIGUSR1 신호를 전달합니다. 즉, % kill -USR1 $julia_pid
Windows: 현재 지원되지 않음.

먼저, 신호가 발생한 순간의 단일 스택 추적이 표시되고, 그 다음 1초 프로파일이 수집되며, 다음 양보 지점에서 프로파일 보고서가 제공됩니다. 이는 양보 지점이 없는 코드(예: 타이트 루프)의 경우 작업 완료 시점일 수 있습니다.

Optionally set environment variable JULIA_PROFILE_PEEK_HEAP_SNAPSHOT to 1 to also automatically collect a heap snapshot.

julia> foo()
##== the user sends a trigger while foo is running ==##
load: 2.53  cmd: julia 88903 running 6.16u 0.97s

======================================================================================
Information request received. A stacktrace will print followed by a 1.0 second profile
======================================================================================

signal (29): Information request: 29
__psynch_cvwait at /usr/lib/system/libsystem_kernel.dylib (unknown line)
_pthread_cond_wait at /usr/lib/system/libsystem_pthread.dylib (unknown line)
...

======================================================================
Profile collected. A report will print if the Profile module is loaded
======================================================================

Overhead ╎ [+additional indent] Count File:Line; Function
=========================================================
Thread 1 Task 0x000000011687c010 Total snapshots: 572. Utilization: 100%
   ╎147 @Base/client.jl:506; _start()
       ╎ 147 @Base/client.jl:318; exec_options(opts::Base.JLOptions)
...

Thread 2 Task 0x0000000116960010 Total snapshots: 572. Utilization: 0%
   ╎572 @Base/task.jl:587; task_done_hook(t::Task)
      ╎ 572 @Base/task.jl:879; wait()
...

Customization

프로파일링의 지속 시간은 Profile.set_peek_duration를 통해 조정할 수 있습니다.

프로필 보고서는 스레드와 작업별로 나뉩니다. 이를 재정의하려면 인수가 없는 함수를 Profile.peek_report[]에 전달하세요. 즉, Profile.peek_report[] = () -> Profile.print()를 사용하여 그룹화를 제거할 수 있습니다. 이는 외부 프로필 데이터 소비자에 의해 재정의될 수도 있습니다.

Reference

Profile.@profile — Macro

@profile

@profile <expression>는 주기적으로 백트레이스를 가져오면서 표현식을 실행합니다. 이러한 백트레이스는 내부 백트레이스 버퍼에 추가됩니다.