Access to the clock register on GPU
I need to do some granular profiling of huge kernels. Nvidia provides an access to its clock register via inline assembler:uint clock_time(){ uint clock_time; asm("mov.u32 %0,...
View ArticleTexture reads return garbage for CLImage2D [with minimal example]
Hello everyone, first I should say that this is a cross-post from http://www.khronos.org/message_boards/viewtopic.php?f=56&t=5456Since I didn't get an answer after two weeks I figured I could post...
View ArticleSize of arguments passed in shuffle2 operation
Can we pass any size of arguments in the shuffle2 operation, i.e. if we have two arrays data1[1000], data2[1000] and a mask as mask[1000] can we declare it as...
View ArticleCompilation under windows
Hi I'm very new to opencl. I'm trying to compile c opencl code under windows 7 without Visual Studio. I always use console.How can i compile using only console (cmd) under windows 7. Can i use icl? I...
View ArticleHow do I parallelize the following sequential code?
I wanted to to make the following kernel code parallel, In the code below size of a is n, b and c is 8*n and of d is some value less than n(eg : 3*n/4) j=0; for(i=0;i<n;i++) { if...
View ArticleBug: compiler build option no-bin-llvmir
In the actual version of the AMD Accelerated Parallel ProcessingOpenCL, December 2012, rev2.4, section 2.1.4 AMD-DevelopedSupplemental Compiler Options there is mentioned the compiler build option...
View ArticleFind Device Topology on Windows
When will the 'CL_DEVICE_TOPOLOGY_AMD' be available on a Windows machine? I am writing code for multiple GPUs and need to map the GPUs to the PCIe bus numbers. It looks like it is impossible on a...
View ArticleOCL compile error
Hi, I'm running Win7/64, HD5770, Catalyst 13.1. When my program compiles I receive this output when targeting the GPU (runs fine on CPU): Select device - OpenCL Platform 1/1: Advanced Micro Devices,...
View ArticleHow to fill H264_picture_parameter_2 structure
Hi all, I'm trying to implement an H264 decoder using opencl 1.2 and I'm currently studying the OVDecodeRender sample project. SideNote: I have been trying to compile the OVDecodeRender.zip sample, but...
View ArticleOpenCL 8 GPU DGEMM (5.1 TFlop/s double precision). Programming infrastructure...
Anton Efremov, Institute of System Research Russian Academy of Science (NIISI), antonyef@mail.ruPavel Bogdanov, Institute of System Research Russian Academy of Science (NIISI), bogdanov@niisi.msk.ru...
View Articleamd app debug error
when i debug the project BasicDebug.sln,it show why this could happen.
View ArticleError: Invalid Address Space
I have attempted to write a radix sort code partly sequential and partly parallel. The code is giving a runtime error saying ..My code is as follows, I have highlighted the actual sequential part of...
View ArticleHow to bind same physical device between CAL and OpenCL ?
There are some AMD GPUs that supports CAL only. And some that supports both OpenCL and CAL. There can be some new devices that supports OpenCL only.So, if mix of such devices presents in PC one should...
View ArticleAPI design
Hi forum, OpenCL spec 1.2 is already there and NVIDIA has no plan so far to adopt it. I am planning to extend an existing API for OpenCL. Since i am provided with the nvidia resources for the...
View ArticleHD7970ghz Peak TFLOPS calculation
In the document 'AMD Accelerated Parallel Processing OpenCL Programming Guide' provided here...
View Articleconstant memory issue
Hi everyone, i already posted this on Khronos forums since i'm currently working on a NVIDIA card ( soon i'll get a hd Radeon 7950 ), but i need to resolve this and nvidia removed their opencl...
View ArticleProblem with the radix sort code written
In the radix sort code that I have written, the second kernel is not working as want it to work..., in the first kernel I have successfully managed to sort no.s ending with LSB 1 and LSB 0 into...
View ArticleIs HD Radeon 6630m missing OpenCL driver ?
I am using Mac mini 2011 with HD Radeon 6630m, after installing boot camp 4 in Windows 7, I can not find OpenCL installed and can not run OpenCL Application !I open Catalyst Control Center >...
View ArticleError in processing Image pixel values in a float4[image_size] array.
what is wrong with the code given below... Can we declare a float4 array of a big size(within the limits of float of course)... and access it as min_weight_arr.x[id] as i've done below , or is there...
View ArticleOpenGL object access
Hello. It's a silly question, I guess, but still: is there any way to access objects of OpenGL world from OpenCL code (light position for example)? All that I can touch are just VBOs and textures. What...
View Article