Skip to content

Support Quantization#996

Open
qinyiqun wants to merge 17 commits intoInfiniTensor:demo131from
qinyiqun:demo131_quant
Open

Support Quantization#996
qinyiqun wants to merge 17 commits intoInfiniTensor:demo131from
qinyiqun:demo131_quant

Conversation

@qinyiqun
Copy link
Collaborator

No description provided.

@qinyiqun qinyiqun requested a review from a team January 29, 2026 09:54
for (int ind = threadIdx.x; ind < K; ind += BLOCK_SIZE) {
thread_min = fminf(thread_min, (float)x[tid + ind]);
}
float local_min = BlockReduce(temp_storage).Reduce(thread_min, cub::Min());
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Image

所有cub的调用需要保护一下

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

量化算子目前是不是都不打算支持计算图?


void linear_w4a16_awq_(Tensor out, Tensor input, Tensor weight_packed, Tensor weight_scale, Tensor weight_zeros, std::optional<Tensor> bias);

} // namespace infinicore::op No newline at end of file
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

空行

DequantizeAWQ::dispatcher().registerAll(&calculate, false);
return true;
}();
} // namespace infinicore::op::dequantize_awq_impl::infiniop No newline at end of file
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

空行

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这两个文件写反了;
支持计算图的话要模仿一下隔壁的算子,比如gemm;

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

也写反了,以及考虑一下支持计算图的写法

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

也写反了,以及考虑一下支持计算图的写法

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

另外两个不用加么?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants