monitoring GPU load, temperature programatically
Hi, I'm using OpenCL in computer vision applications and because of the context of unmanned vehicles, I need to monitor the GPU load and temperature. There is a way to do this with amdconfig --dc...
View ArticleVGPRS randomly change
I have a function A which I use in 2 separate Kernels.I am analyzing Kernels with CodeXL Analyzer. Kernel 1 is using 227 VGPRs and I am trying to reduce it somehow.When I comment out function A in...
View ArticleWhat is right order to queue multiple kernels in loop
Hello everyone!My code works unstable, what is wrong with it: for(int i=0;i<numIter;i++){ // status =...
View Articleproblem with printf extention
I wrote a opencl program on kaveri (A10-7850K).But I want to printf the messages in kernel when I used the type "cl_device_type_gpu" .I added the "#pragma OPENCL EXTENSION cl_amd_printf : enable" in...
View Articlehd 7970 6gb mem aloc
Recently bough an hd 7970 6gb vapour-x but encountered an issue ;/ using app sdk 2.9 and 13.12 drivers the max alloc memory is 1766 mb even after GPU_MAX_HEAP_SIZE=100 GPU_MAX_ALLOC_SIZE=100...
View Article2D array using clSVMAlloc ?
Hi , I can write a 2D dynamic array as follow :"""int size = 256 ;int *temp[5] ;temp[0] = (int *)malloc(sizeof(int) * size);""" But when I want to dynamic create a 2D array by using clSVMAlloc as...
View ArticleSGPR usage trippled on GCN-1.2 (v8) GPUs
Hi, I've analyzed an OpenCL-kernel using CodeXL and I am quite happy with the register-usage - on GCN 1.0/1.1 devices per SIMD the maximum of 10 wavefronts can be queued, so hopefully memory latencies...
View ArticleclCreateKernel SIGSEGVs if used with binary image in openCL 1.2
Problem happens only when used with binary kernel image compiled for the same device. Building from kernel sources works fine.My program uses the FFT_Kernels.cl from your samples. Binary image was...
View ArticleIs global synchronization in OpenCL possible?
Hello everyone! As well known OpenCL barrier() function works only for single workgroup, and there is no direct possibility to synchronize workgroups. If it possible what's best approach for global...
View ArticleDevice queues
I don't mean to hijack your thread, but I'm having problems w/ clCreateCommandQueueWithProperties. Anytime I pass ANY properties through, the function returns CL_INVALID_QUEUE_PROPERTIES. I would...
View ArticleclCreateKernel SIGSEGVs if used with binary image in openCL 1.2
Problem happens only when used with binary kernel image compiled for the same device. Building from kernel sources works fine.My program uses the FFT_Kernels.cl from your samples. Binary image was...
View ArticleMistake in AMD GCN 3 ISA Reference (VOP_SDWA, VOP_DPP)
Hi, I am amateur. I found mistake in AMD GCN3 ISA Reference in VOP2, VOP1 and VOPC encoding format. In these places manual describes values in SRC0 field anddescribes 249 and 250 values as reserved....
View ArticleIs global synchronization in OpenCL possible?
Hello everyone! As well known OpenCL barrier() function works only for single workgroup, and there is no direct possibility to synchronize workgroups. If it possible what's best approach for global...
View ArticleDevice queues
I don't mean to hijack your thread, but I'm having problems w/ clCreateCommandQueueWithProperties. Anytime I pass ANY properties through, the function returns CL_INVALID_QUEUE_PROPERTIES. I would...
View ArticleCatalyst 14.12 OpenCL problems
Hey AMD Devs, I'm not sure if you know about the problems that were introduced with Catalyst 14.12 (compared to 14.9). I was able to track down at least one of them (the most important one) and wanted...
View ArticleBSOD / hang-up for OpenCL program
Hi, I have a work-station with three 7970Ghz Tahitis on which OpenCL simulations (self-written) run. After weeks without problems for simulation #1 now for simulation # 2 (most parts are identical, but...
View ArticleTwo S9150s, only one showing up in clGetDeviceIDs()
Hello, I have two S9150 GPUs and when I execute clGetDeviceIDs() I only see two devices: one Hawaii device and one CPU device. However, when I run "clinfo", it shows that I have three devices: two...
View ArticleOpenCL optimization: removing conditional assignments
Here is a quote from the AMD OpenCL optimization guide:...
View Article2D array using clSVMAlloc ?
Hi , I can write a 2D dynamic array as follow :"""int size = 256 ;int *temp[5] ;temp[0] = (int *)malloc(sizeof(int) * size);""" But when I want to dynamic create a 2D array by using clSVMAlloc as...
View ArticleCPU device openCL c version issue
hi all, I installed the latest AMD driver to support OpenCL 2.0. The samples work fine. However, when I check the clinfo, I found that my CPU device OpenCL C version is 1.2, while OpenCL 2.0...
View Article