This commit is contained in:
Dun Liang 2020-03-20 17:35:53 +08:00
parent bb16582683
commit 11c6a89dcc
1 changed files with 5 additions and 1 deletions

View File

@ -249,4 +249,8 @@ with open(report[1][1], 'r') as f:
Even faster than the previous implementation! From the output we can look at the function definition of func0. This is the main code of our convolution kernel, which is generated Just-in-time. Because the compiler knows the shapes of the kernel and more optimizations are used.
比之前的实现还要更快! 从输出中我们可以看一看`func0`的函数定义,这是我们卷积内核的主要代码,该内核代码是即时生成的。因为编译器知道内核的形状,所以使用了更多的优化方法。
比之前的实现还要更快! 从输出中我们可以看一看`func0`的函数定义,这是我们卷积内核的主要代码,该内核代码是即时生成的。因为编译器知道内核的形状,所以使用了更多的优化方法。
在这个教程中Jittor简单演示了元算子的使用并不是正真的性能测试所以使用了比较小的数据规模进行测试如果需要性能测试请打开`jt.flags.enable_tuner = 1`,会启动使用专门的硬件库加速。
In this tutorial, Jittor simply demonstrated the use of meta-operators, which is not a performance test. If you need a performance test, `jt.flags.enable_tuner = 1` will try to use the dedicated hardware library.