xiaying
8ba2c49380
Fix bug for binary not equal float not imp
2024-04-19 12:54:34 +08:00
xiaying
5895243607
[MNN:Sync] Sync Internal 2.8.4
2024-04-19 11:58:21 +08:00
zhaode.wzd
2972fe71dc
[MNN:Sync] Sync Internal 2.8.3
2024-03-13 14:55:54 +08:00
zhaode.wzd
970b63f3b4
[MNN:Sync] Sync Internal 2.8.2
2024-02-29 16:21:40 +08:00
zhaode.wzd
3b978d9d16
[MNN:Sync] Sync Internal 2.8.1
2023-12-27 17:26:44 +08:00
zhaode.wzd
72fa060815
[MNN::Bugfix] Some bugfix sync.
2023-12-12 10:49:15 +08:00
zhaode.wzd
387775be2a
[MNN:Sync] Sync Internal 2.8.0
2023-12-04 11:19:10 +08:00
jxt1234
8d5d8b8844
Merge pull request #2645 from roachsinai/patch-1
...
fix: compile error: ‘int32_t’ is not a member of ‘std’
2023-11-15 15:39:01 +08:00
RoachZhao
476ee10e74
fix: compile error: ‘int32_t’ is not a member of ‘std’
...
> error: ‘int32_t’ is not a member of ‘std’; did you mean ‘int32_t’?
2023-11-03 17:55:20 +08:00
jokerz0624
3bf85d63b9
feat(iOS): add Apple A17 Pro support in CPU family
2023-10-23 18:54:57 +08:00
xiaying
476083aa8b
[MNN:Bugfix] Fix bug for avx512 compile
2023-10-20 14:55:10 +08:00
xiaying
b019bef591
[MNN:Bugfix] Fix bug of no sse poolgrad
2023-10-19 10:12:55 +08:00
xiaying
a8c1f1ac20
[MNN:Bugfix] Fix compile bug of ios from CPUBinaryInt8
2023-10-18 11:36:32 +08:00
xiaying
067d0be896
[MNN:Bugfix] Fix bug for ios compile error of unary int8
2023-10-18 11:20:19 +08:00
xiaying
3ff49cbf4a
[MNN:Sync] Sync Internal 2.7.2
2023-10-18 10:31:02 +08:00
zhaode.wzd
bdf15442f4
[MNN:Sync] Sync Internal 2.7.1
2023-09-21 09:01:35 +08:00
xiaying
ea4f13d3cf
[MNN:Sync] Sync Internal 2.7.0
2023-09-04 10:42:11 +08:00
xiaying
98ba00c2f3
[MNN:Sync] Sync Internal 2.6.3
2023-08-21 14:53:27 +08:00
xiaying
1115d2f96f
[MNN:Bugfix] Bugfix for sparse option not open for build ios package
2023-08-15 19:19:49 +08:00
xiaying
84d6bd75ae
[MNN:Sync] Sync Internal 2.6.2
2023-07-31 14:24:48 +08:00
xiaying
78c664ede9
Fix bug for arm82 run fastrcnn crash
2023-07-23 11:32:13 +08:00
xiaying
ac5b331845
[MNN:Sync] Sync Internal Gitlab
2023-07-18 09:36:26 +08:00
xiaying
e1ae448030
[MNN:Bugfix] Fix bug for windows compiler error of ResizeFunction
2023-07-05 13:08:33 +08:00
xiaying
69130ebd78
[MNN:Sync] Sync Internal 2.6.0
2023-07-05 11:44:25 +08:00
zhaode.wzd
67eceb8abb
[MNN:Sync] Sync Internal code, support low_memory for conv.
2023-06-27 10:33:16 +08:00
xiaying
930a9345c1
[MNN:Sync] Sync Internal 2.5.3
2023-06-16 09:42:45 +08:00
xiaying
c70ecef660
[MNN:Sync] Sync Internal Gitlab: 2.5.1
2023-05-18 19:11:50 +08:00
zhaode.wzd
9fe780308c
[MNN:Sync] Sync Internal 2.5.0
2023-04-27 15:11:05 +08:00
xiaying
39e2e7e45f
[MNN:Sync] Sync Internal Gitlab 2.4.3
2023-04-18 18:54:46 +08:00
Colin Ian King
6fc74e29a0
Only yield at the end if the completion loop needs another pass
...
Currently the yield is occuring every time a completion loop
iterates and this is quite an expensive kernel system call. It is
not really required if we break out of the loop, so move the
yield to the end of the do-while loop to reduce the yielding overhead
Perf metrics show that the current code eats up ~2.4% CPU yielding
whereas this change reduces this down to ~0.6% of the total CPU run
time.
Signed-off-by: Colin Ian King <colin.king@intel.com>
2023-04-14 15:57:16 +01:00
xiaying
f26fc2efd8
[MNN:Sync] Sync 2.4.2
2023-04-11 11:12:00 +08:00
xiaying
7629ba674e
[MNN:Sync] Sync Internal 2.4.1
2023-03-20 11:32:29 +08:00
xiaying
4e2ad365e8
[MNN:Sync] Sync Internal Gitlab
2023-02-28 10:41:24 +08:00
王召德
86080f24c1
Merge pull request #2226 from DaydreamCoding/feature/fix_msvc_bf16
...
MSVC adapt for BF16
2023-02-17 10:09:56 +08:00
xiaying
4a609006eb
[MNN:Sync] Sync Internal 2.3.1
2023-02-15 10:30:27 +08:00
xiaying
d46b6b998d
[MNN:Sync] Sync Internal 2.3.0
2022-12-30 15:18:58 +08:00
xiaying
b1f5664ced
[MNN:Internal] Sync to 2.2.3
2022-12-24 09:42:39 +08:00
xiaying
ad5d243c9f
[MNN:Sync] A few bugfixes
...
1. 支持 Onnx If 空子图的情况(这种情况是条件判断一定为真或假)
2. 修正 Where 算子在 zeroshape 下维度计算出错的问题
3. 修正 Reduce 计算 zeroshape 的非 prod 情况
4. 修正 arch64-linux 上编译错误
5. 修正 头文件 NNAPI 的注释错误
6, 部分训练相关问题修正
2022-12-04 15:17:36 +08:00
xiaying
8908e80d44
[Sync] Sync Internal 2.2.2
2022-11-18 22:35:31 +08:00
zhaode.wzd
c683c5c6c2
[Sync] Sync Internal Gitlab 2.2.1
2022-11-08 17:05:14 +08:00
xiaying
acb3bb6c62
[Sync] Sync Internal Gitlab 2.2.0
2022-10-30 08:44:24 +08:00
xiaying
db53f951e6
[Sync] Sync Internal 2.1.2
2022-09-30 10:02:52 +08:00
wtiandong
2768a397bd
Merge remote-tracking branch 'origin/master' into interp3D
2022-09-27 17:13:37 +08:00
wtiandong
9e284352b5
merge interp3d_op_param into interp_op_param
...
merge interp3d_op_param into interp_op_param
2022-09-27 17:07:23 +08:00
jokerz0624
71d0975d37
improvement(TensorConvert): accelerate MNNPackTranspose with SIMD when channel=3
2022-09-24 16:49:20 +08:00
jxt1234
f001a65c81
Merge pull request #2051 from DaydreamCoding/patch-12
...
Fix ThreadPool behavior
2022-09-19 10:47:41 +08:00
jokerz0624
62d529379b
feat(iOS/macOS): add Apple A16 and M2 support in CPU family
2022-09-18 12:58:17 +08:00
DaydreamCoding
d5e08a913f
Fix ThreadPool behavior : Pipeline::encode may anr when other session call ThreadPool::active()
...
Pipeline::encode -> GeometryComputerUtils::shapeComputeAndGeometryTransform -> onExecute
2022-09-08 14:37:31 +08:00
wtiandong
71aae927ff
Add Interp3D Support
...
1. add PyTorch interpolation 3D to Onnx to MNN converter
2. add interpolation3D nearest CPU/OpenCL implementation
all added OPs are verified
update opencl_program.cc
update opencl_program.cc
2022-09-07 15:56:08 +08:00
jxt1234
41db47f2c6
Merge pull request #2038 from DaydreamCoding/patch-11
...
Fix VecHalf.hpp not include neon header
2022-09-05 10:33:40 +08:00