DS_WRITE GCN instruction
Hello! I've been trying to write GCN ISA assembly code by hand and I just can't get the "DS_" instructions to work.The docs said that the address shouldn't be the same in all threads, because it causes...
View ArticleWhat's wrong with this file?
THIS CODE IS FROM A PROGRAM CALLED BLENDER CYCLES...IT WORKS...BUT DOESN'T WORK WITH THE AMD OPENCL COMPILER,...IN THE BLENDER COMMUNITY SOME BELIEVE THAT THIS CODE IS PROGRAMMED USING CUDA SYNTAX,...
View Articlecompiling SDKUtil on cygwin with g++
Basically, my question in title. Is it possible?I have no problems compiling the library on mingw with g++, on cygwin however, it goes to cl. Is there a way to compile it with g++ on cygwin or are...
View ArticleFunction arguments appear to be ignored.
Test case attached, and reproduced below. The last one or two arguments to foo are always zero for some reason, regardless of the number of arguments. The number of valid arguments appears to always be...
View ArticleI have bought a Amd / Ati Brecht van Lommel (creator of cycles to Blender).
Ihavebought aAmd/AtiBrechtvan Lommel(creator ofcyclestoBlender).I'd thinkitwould be goodifamdwould givegraphicsprogrammersalguanleastlow-endso they canprovide solutions toproblems suchcycles.Andthe big...
View Articledata transfer between cpu and device
Hello forum, I guess There are two ways to transfer data between cpu and gpu in OpenCL. 1. The Natural way - clEnqueueWriteBuffer() / clEnqueueReadBuffer() 2. Map/Unmap way - clEnqueueMapBuffer()...
View ArticleAny hope of this running better?
I have an opencl kernel which reads from an opengl texture (the texture is a render target attached to a framebuffer object). Once the texture is rendered to, I need to run my kernel to calculate...
View ArticleLinux - 7790 memory clock doesn't rise up
I have installed AMD Radeon™ HD7790 Catalyst™ Performance Driver for Linux x86_64.What I have found is aticonfig works a bit different way than it used to and I can't find any guide.aticonfig --odsc...
View ArticleHelp with Aparapi
Hi, I have to design an implement a project that requires a GUI and takes advantage of parallel processing. I thought Aparapi will be the best option as it is easy to implement GUI's in Java. The...
View ArticleOpenCL kernel not run when passed a Renderbuffer
When an image2d_t created from an OpenGL Renderbuffer is passed to a kernel, the kernel completes instantly without actually running whenever it is enqueued. For example, in the following: __kernel...
View Articlemore on memory flags
Hi forum, I am some confusion on the use of memory flags that i want to be clear with. 1. CL_MEM_USE_HOST_PTR - it is only valid if the host pointer is not null. If, specified , it indicates that the...
View ArticlePlease help with clBuildProgram crash / segmentation fault
Hi, probably more for 'driver support' forum, but I would like here as well.I have a problem with openCL program, clBuildProgram does segfault / crash with newer drivers. Please help me improve the...
View ArticleWhy does increasing the number of kernel arguments impacts performance ?
GPU: 7970OS: Kubuntu 12.04 x64Driver: Catlyst 13.4 Hi, I know this sounds crazy but it is actually true. Increasing the number of kernel arguments beyond a certain number causes the performance to...
View ArticleProfiling time for blocking and non-blocking execution
Hi all, I'm new to OpenCL dev and i want to understand some mechanics. I've a simple matrix multiplication kernel, and i want to see the impact of the blocking option for the clEnqueue*...
View ArticleCan't start OpenCL application
Hello guysI'm new in OpenCL.I have Windows 7 64bit, processor amd phenom ii x4 955.When I try starting my project I get such window. And no errors in the log.
View ArticleOpencl code hangs compiler (code & KA2)
I just re-ordered the members of clScanlineContour (may not be aligned now, I haven't been able to test)and added Contour.mError += to lots of lines in MakeContoursFromScanlineGroups Now when I build...
View ArticleKernel performance variability with the same code and same data
Platform:OS: win7 64bitSDK: APP SDK 2.8.1CodeXL: latestedDriver: Catelyst 13.4 Problems:100 times kernel execution with same input data, there are 2-3 Kernel (besides the first one) execution time is...
View ArticleIncluding Extra kernels(user written) in addition to using clAmdBlas kernels.
Hi,i am using the clAmdBlas kernels for some matrix computations and they work very well. Questions:1. how do we include extra kernels(.cl) along with code which has already clAmdBlas functions...
View ArticleSDK 2.8.1 - Compiler crashes/Undeclared function index 1036 (with...
After upgrading drivers/catalyst control center (which allowed me to debug some kernels) I now get some compiler crashes.... The attached kernel code crashes in kernel analyzer (and crashes in code),...
View ArticleWhy discrete GPU works at the same low frequency as the integrated GPU in APU?
Hello, everyone Before describing my problem, I would like to provide my platform information first: OS: Windows 7 64 bit System: Motherboard: GA-A75M-UD2H...
View Article