Skip to content

[2025秋季][T1-1-12] PPPoint-t#1005

Open
PPPoint-t wants to merge 3 commits intoInfiniTensor:mainfrom
PPPoint-t:2025-autumn-PPPoint-t-T1-1-12
Open

[2025秋季][T1-1-12] PPPoint-t#1005
PPPoint-t wants to merge 3 commits intoInfiniTensor:mainfrom
PPPoint-t:2025-autumn-PPPoint-t-T1-1-12

Conversation

@PPPoint-t
Copy link

描述

Infinicore 赛题 T1-1-12 : 算子 minimumatan2addcdivbucketizebinary_cross_entropy
GPU 使用 ninetoothed 实现,ntops 仓库pr地址

记录

  1. atan2 定义计算常量,使用多项式逼近结果,投射到四个象限计算,计算密集型目前 cpu 端没有什么好的优化手段
  2. boundaries 测试脚本是无序测试,无序测试 nvidia 返回的无定义与 Pytorch 无定义无法对比,强制修改测试输入为有序
  3. binary_cross_entropyntops 实现中在 torch 中构造标量张量进行计算会有精度无法对齐情况,目前 .to() 接口只支持换设备,不支持换类型,设置多个 kernels 进行计算

测试截图

算子 CPU NVIDIA
minimum image image
atan2 image image
addcdiv image image
bucketize image image
binary_cross_entropy image image

署名

HONOR_CODE.md
REFERENCE.md

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant