Profile your application using the Performance Estimator¶
Huxelerate Performance Estimator does not require any source code upload online, to guarantee the privacy of your code. To generate a profiling report, you need to slightly modify the way your source code is compiled, but do not worry, only 5 steps are required:
Compile each C/C++ source file to LLVM Intermediate Representation (IR)
Link all LLVM IR files to a single one
Instrument your LLVM IR linked file
Compile the instrumented LLVM IR file to obtain your executable
Execute your application with a representative dataset to obtain an accurate estimate
Compile sources to LLVM Intermediate Representation (IR)¶
We assume you have a Makefile to compile your application, if not, you can simply perform the same steps by running the commands manually. First of all, we define a convenient variable inside the Makefile to run Huxelerate Performance Estimator Docker Container:
HPE=docker run --rm -v $(shell pwd):/data huxelerate/performance_estimator
The /data
directory is the default working directory inside Huxelerate Performance Estimator Docker Container.
As you can see, when running the Docker Container we map the current working directory
(the one from which the makefile is run) to the /data
directory inside the Docker Container.
With this command we assume that the Makefile is located in the main directory of your application containing
all the source code and relevant files for compilation.
If this is not the case, you can simply change $(shell pwd)
with a specific path in your file system.
Then you need to change your default compiler to the clang compiler inside the Docker Container:
CC= $(HPE) clang
Finally, to compile into LLVM IR, it is enough to add -emit-llvm
flag during object generation:
%.o : %.c
$(CC) $(OTHER_FLAGS) -c -emit-llvm $< -o $@
Important
The Performance Estimator currently does not support profiling of C++ code that makes use of exceptions.
To profile C++ code, make sure to compile it using the -fno-exceptions
compiler flag.
Note
When running commands in the context of the Docker Container, absolute paths will be different from your local machine! Whenever possible try to use relative paths when compiling your code. If you really need absolute paths, make sure to change them so that they refer the same files inside the Docker Container.
Link all IR files to a single one¶
Link all LLVM IR files to a single LLVM IR file (linked.ll) using the llvm-link
command:
linked.ll : $(OBJS)
$(HPE) llvm-link $^ -o linked.ll
Instrument your linked file¶
Use hux-roofline-instrument
command to instrument the linked.ll file.
In this step, you also need to specify the function to profile using the --function
flag:
instrumented.ll : linked.ll
$(HPE) hux-roofline-instrument -i linked.ll -o instrumented.ll --function my_function_to_profile
Compile to obtain your executable¶
The last compiling step is to generate the executable from the instrumented.ll file.
The command to do so, is no different from compiling any other object file.
The only requirement here, is to make sure that the dl
and stdc++
libraries gets linked since they are needed for instrumentation purposes:
executable_name : instrumented.ll
$(CC) $(OTHER_FLAGS) -ldl -lstdc++ instrumented.ll -o $@
Finally, if using a Makefile, a simple make
command with your default option is enough to generate the specific executable file.
Execute your application with a representative dataset to allow a more accurate estimate¶
For performance Estimation purposes, the execution of your application must be performed inside the Huxelerate Performance Estimator Docker Container:
docker run --rm -v $(pwd):/data huxelerate/performance_estimator ./executable_name [executable_options]
The output of the execution is a profiling report file with .hprof extension that you need to upload online to obtain the performance estimation.