Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
描述
Infinicore赛题T1-1-12: 算子minimum、atan2、addcdiv、bucketize、binary_cross_entropy。GPU使用ninetoothed实现,ntops仓库pr地址。记录
atan2定义计算常量,使用多项式逼近结果,投射到四个象限计算,计算密集型目前cpu端没有什么好的优化手段boundaries测试脚本是无序测试,无序测试nvidia返回的无定义与Pytorch无定义无法对比,强制修改测试输入为有序binary_cross_entropy的ntops实现中在torch中构造标量张量进行计算会有精度无法对齐情况,目前.to()接口只支持换设备,不支持换类型,设置多个kernels进行计算测试截图
署名
HONOR_CODE.md
REFERENCE.md