CUDA中数组求最大值

 我来答

1个回答

#热议# 不吃早饭真的会得胆结石吗？

折柳成萌

高粉答主

2018-06-01 · 繁杂信息太多，你要学会辨别

知道顶级答主

回答量：4.4万

采纳率：96%

帮助的人：6203万

我也去答题访问个人页

关注

展开全部

cudaMallocPitch()
cudaMemcpy2D()
Get threadIdx blockIdx to access to each element, you might need to check the definition of pitch before you start to work on 2D matrix. It makes sure that the allocation is appropriatedly padded to meet the memory alignment requirements.

// Host code 
int width = 64, height = 64; 
float* devPtr; 
size_t pitch; 
cudaMallocPitch(&devPtr, &pitch, width * sizeof(float), height);
MyKernel<<<100, 512>>>(devPtr, pitch, width, height); 

// Device code 
__global__ void MyKernel(float* devPtr, size_t pitch, int width, int height)
{
for (int r = 0; r < height; ++r)
{
float* row = (float*)((char*)devPtr + r * pitch);
for (int c = 0; c < width; ++c)
{ float element = row[c]; }
} 
}

It would be easier to use 1D as a start point


本回答被网友采纳






已赞过已踩过<

你对这个回答的评价是？
评论收起

推荐律师服务：若未解决您的问题，请您详细描述您的问题，通过百度律临进行免费专业咨询

CUDA中数组求最大值

其他类似问题

为你推荐：