Commit Graph

341 Commits

Author SHA1 Message Date
xiaying 8ba2c49380 Fix bug for binary not equal float not imp 2024-04-19 12:54:34 +08:00
xiaying 5895243607 [MNN:Sync] Sync Internal 2.8.4 2024-04-19 11:58:21 +08:00
zhaode.wzd 2972fe71dc [MNN:Sync] Sync Internal 2.8.3 2024-03-13 14:55:54 +08:00
zhaode.wzd 970b63f3b4 [MNN:Sync] Sync Internal 2.8.2 2024-02-29 16:21:40 +08:00
zhaode.wzd 3b978d9d16 [MNN:Sync] Sync Internal 2.8.1 2023-12-27 17:26:44 +08:00
zhaode.wzd 72fa060815 [MNN::Bugfix] Some bugfix sync. 2023-12-12 10:49:15 +08:00
zhaode.wzd 387775be2a [MNN:Sync] Sync Internal 2.8.0 2023-12-04 11:19:10 +08:00
jxt1234 8d5d8b8844
Merge pull request #2645 from roachsinai/patch-1
fix: compile error: ‘int32_t’ is not a member of ‘std’
2023-11-15 15:39:01 +08:00
RoachZhao 476ee10e74
fix: compile error: ‘int32_t’ is not a member of ‘std’
> error: ‘int32_t’ is not a member of ‘std’; did you mean ‘int32_t’?
2023-11-03 17:55:20 +08:00
jokerz0624 3bf85d63b9 feat(iOS): add Apple A17 Pro support in CPU family 2023-10-23 18:54:57 +08:00
xiaying 476083aa8b [MNN:Bugfix] Fix bug for avx512 compile 2023-10-20 14:55:10 +08:00
xiaying b019bef591 [MNN:Bugfix] Fix bug of no sse poolgrad 2023-10-19 10:12:55 +08:00
xiaying a8c1f1ac20 [MNN:Bugfix] Fix compile bug of ios from CPUBinaryInt8 2023-10-18 11:36:32 +08:00
xiaying 067d0be896 [MNN:Bugfix] Fix bug for ios compile error of unary int8 2023-10-18 11:20:19 +08:00
xiaying 3ff49cbf4a [MNN:Sync] Sync Internal 2.7.2 2023-10-18 10:31:02 +08:00
zhaode.wzd bdf15442f4 [MNN:Sync] Sync Internal 2.7.1 2023-09-21 09:01:35 +08:00
xiaying ea4f13d3cf [MNN:Sync] Sync Internal 2.7.0 2023-09-04 10:42:11 +08:00
xiaying 98ba00c2f3 [MNN:Sync] Sync Internal 2.6.3 2023-08-21 14:53:27 +08:00
xiaying 1115d2f96f [MNN:Bugfix] Bugfix for sparse option not open for build ios package 2023-08-15 19:19:49 +08:00
xiaying 84d6bd75ae [MNN:Sync] Sync Internal 2.6.2 2023-07-31 14:24:48 +08:00
xiaying 78c664ede9 Fix bug for arm82 run fastrcnn crash 2023-07-23 11:32:13 +08:00
xiaying ac5b331845 [MNN:Sync] Sync Internal Gitlab 2023-07-18 09:36:26 +08:00
xiaying e1ae448030 [MNN:Bugfix] Fix bug for windows compiler error of ResizeFunction 2023-07-05 13:08:33 +08:00
xiaying 69130ebd78 [MNN:Sync] Sync Internal 2.6.0 2023-07-05 11:44:25 +08:00
zhaode.wzd 67eceb8abb [MNN:Sync] Sync Internal code, support low_memory for conv. 2023-06-27 10:33:16 +08:00
xiaying 930a9345c1 [MNN:Sync] Sync Internal 2.5.3 2023-06-16 09:42:45 +08:00
xiaying c70ecef660 [MNN:Sync] Sync Internal Gitlab: 2.5.1 2023-05-18 19:11:50 +08:00
zhaode.wzd 9fe780308c [MNN:Sync] Sync Internal 2.5.0 2023-04-27 15:11:05 +08:00
xiaying 39e2e7e45f [MNN:Sync] Sync Internal Gitlab 2.4.3 2023-04-18 18:54:46 +08:00
Colin Ian King 6fc74e29a0 Only yield at the end if the completion loop needs another pass
Currently the yield is occuring every time a completion loop
iterates and this is quite an expensive kernel system call. It is
not really required if we break out of the loop, so move the
yield to the end of the do-while loop to reduce the yielding overhead

Perf metrics show that the current code eats up ~2.4% CPU yielding
whereas this change reduces this down to ~0.6% of the total CPU run
time.

Signed-off-by: Colin Ian King <colin.king@intel.com>
2023-04-14 15:57:16 +01:00
xiaying f26fc2efd8 [MNN:Sync] Sync 2.4.2 2023-04-11 11:12:00 +08:00
xiaying 7629ba674e [MNN:Sync] Sync Internal 2.4.1 2023-03-20 11:32:29 +08:00
xiaying 4e2ad365e8 [MNN:Sync] Sync Internal Gitlab 2023-02-28 10:41:24 +08:00
王召德 86080f24c1
Merge pull request #2226 from DaydreamCoding/feature/fix_msvc_bf16
MSVC adapt for BF16
2023-02-17 10:09:56 +08:00
xiaying 4a609006eb [MNN:Sync] Sync Internal 2.3.1 2023-02-15 10:30:27 +08:00
xiaying d46b6b998d [MNN:Sync] Sync Internal 2.3.0 2022-12-30 15:18:58 +08:00
xiaying b1f5664ced [MNN:Internal] Sync to 2.2.3 2022-12-24 09:42:39 +08:00
xiaying ad5d243c9f [MNN:Sync] A few bugfixes
1. 支持 Onnx If 空子图的情况(这种情况是条件判断一定为真或假)
    2. 修正 Where 算子在 zeroshape 下维度计算出错的问题
    3. 修正 Reduce 计算 zeroshape 的非 prod 情况
    4. 修正 arch64-linux 上编译错误
    5. 修正 头文件 NNAPI 的注释错误
    6, 部分训练相关问题修正
2022-12-04 15:17:36 +08:00
xiaying 8908e80d44 [Sync] Sync Internal 2.2.2 2022-11-18 22:35:31 +08:00
zhaode.wzd c683c5c6c2 [Sync] Sync Internal Gitlab 2.2.1 2022-11-08 17:05:14 +08:00
xiaying acb3bb6c62 [Sync] Sync Internal Gitlab 2.2.0 2022-10-30 08:44:24 +08:00
xiaying db53f951e6 [Sync] Sync Internal 2.1.2 2022-09-30 10:02:52 +08:00
wtiandong 2768a397bd Merge remote-tracking branch 'origin/master' into interp3D 2022-09-27 17:13:37 +08:00
wtiandong 9e284352b5 merge interp3d_op_param into interp_op_param
merge interp3d_op_param into interp_op_param
2022-09-27 17:07:23 +08:00
jokerz0624 71d0975d37 improvement(TensorConvert): accelerate MNNPackTranspose with SIMD when channel=3 2022-09-24 16:49:20 +08:00
jxt1234 f001a65c81
Merge pull request #2051 from DaydreamCoding/patch-12
Fix ThreadPool behavior
2022-09-19 10:47:41 +08:00
jokerz0624 62d529379b feat(iOS/macOS): add Apple A16 and M2 support in CPU family 2022-09-18 12:58:17 +08:00
DaydreamCoding d5e08a913f Fix ThreadPool behavior : Pipeline::encode may anr when other session call ThreadPool::active()
Pipeline::encode -> GeometryComputerUtils::shapeComputeAndGeometryTransform -> onExecute
2022-09-08 14:37:31 +08:00
wtiandong 71aae927ff Add Interp3D Support
1. add PyTorch interpolation 3D to Onnx to MNN converter
2. add interpolation3D nearest CPU/OpenCL implementation

all added OPs are verified
update opencl_program.cc

update opencl_program.cc
2022-09-07 15:56:08 +08:00
jxt1234 41db47f2c6
Merge pull request #2038 from DaydreamCoding/patch-11
Fix VecHalf.hpp not include neon header
2022-09-05 10:33:40 +08:00