jxt1234
a65c57c3d0
Merge pull request #1772 from alibaba/feature/sync
...
[MNN:Sync] Sync internal gitlab
2021-11-30 10:16:48 +08:00
xiaying
69dba73dc7
[MNN:Sync] Sync internal gitlab
...
Main Feature:
1. Add OpenCV API and Numpy API Support
2. Protobuf move into MNN
3. Add more op for torchscript convert
4. Add recompute to speed up geometry compute
5. Add ModuleBasic Test
2021-11-30 10:10:53 +08:00
xiaying
71cd04e91c
Fix compile bug for sse fma
2021-11-19 10:23:50 +08:00
jxt1234
d099857714
Merge pull request #1756 from no5-aaron-wu/dev_aaron_wu
...
Add SSE instruction set optimization for ROIAlign and ROIPooling op
2021-11-18 20:10:26 +08:00
aaron-wu
f995ca6a8f
fix(op): replace the _mm_load_ps and _mm_store_ps with _mm_loadu_ps and _mm_storeu_ps, to avoid segment errors when not aligned
2021-11-16 16:07:50 +08:00
aaron-wu
e35ea54638
feat(op): Add SSE instruction set optimization for ROIAlign and ROIPooling op
2021-11-15 14:53:12 +08:00
xiaying
95402e79b4
[MNN:Bugfix] Fix Compile bug for other backends
2021-11-12 17:49:50 +08:00
jxt1234
8f934c28c6
Merge pull request #1754 from cmdbug/master
...
[fix] fix typo
2021-11-12 17:18:48 +08:00
jxt1234
e86c0ba30a
Merge pull request #1746 from no5-aaron-wu/dev_aaron_wu
...
add CPUROIAlign op and unit-test and so on
2021-11-12 17:13:04 +08:00
teng
c2e81a6cf3
[fix] fix typo
...
种 -> 中
2021-11-12 13:57:56 +08:00
teng
6c56682327
[fix] fix typo
...
MacOS -> macOS
2021-11-12 10:30:06 +08:00
teng
b7e6705ad5
[fix] fix typo
...
MacOS -> macOS
2021-11-12 10:29:44 +08:00
xiaying
361bbc90d5
Fix bug for DenseConvolutionTiledExecutor opt not care width = 1, but kernel X >1 and padX > 0
2021-11-12 09:56:59 +08:00
xiaying
0bcc70922d
[MNN:Bugfix] Fix compile bug for gnu of arm82 /bf16
2021-11-10 17:52:30 +08:00
aaron-wu
074bf5e275
fix(op): add assert to var samplingRatioW and samplingRatioH
2021-11-09 11:20:22 +08:00
aaron-wu
8e773602bf
fix(schema): merge parameters for RoiPooling and RoiAlign into one table as RoiParameters
2021-11-09 11:11:27 +08:00
aaron-wu
7afb6abd1b
fix(op): precalculate pos and area which shared by all channels; add defense programming for boundary case
2021-11-09 10:00:51 +08:00
aaron-wu
094d5697ae
feat(op): add neon realization of CPUROIAlign op
2021-11-09 10:00:51 +08:00
aaron-wu
1af7d6f4d1
fix(op): fix compile error in linux system
2021-11-09 10:00:50 +08:00
aaron-wu
cfac71f919
feat(op): add CPUROIAlign op and uint test
2021-11-09 10:00:50 +08:00
aaron-wu
f4a5314780
fix(test): add new test case in ROIPoolingTest.cpp
2021-11-09 10:00:50 +08:00
aaron-wu
9f26b548b3
feat(convert): add ROIAlignOnnx convert(the onnx model file exported from torchvision.ops.roi_align)
2021-11-09 10:00:50 +08:00
jxt1234
d983b77ef2
Merge pull request #1743 from Napoleon-Jm/dev
...
fix: cli param keep input format.
2021-11-05 11:18:15 +08:00
xiaying
75413768b0
Fix bug for onResize of CPURNNSequenceGRU
2021-11-04 12:55:59 +08:00
恺心
a0b14095c2
fix: cli param keep input format.
2021-11-03 14:22:50 +08:00
xiaying
2fdd11e718
[MNN:Bugfix] Use fabsf instead of abs
2021-11-02 12:06:10 +08:00
xiaying
06d3d23c57
Temply turn inputCount = 1's depthmulti to convolution
2021-11-02 11:59:28 +08:00
xiaying
87b76f594c
[Doc:Bugfix] Fix pic path error
2021-11-02 11:42:58 +08:00
jxt1234
0b69ba78d2
Merge pull request #1739 from jun-lv-17/fix-depthwiseconvint8-issue
...
Fix conv1d depthwise conv int8 calculation issue.
2021-11-02 11:39:32 +08:00
xiaying
b1d923e76c
Fix compile bug for bf16 when sse / neon is close
2021-11-02 11:34:14 +08:00
xiaying
ed8a2da0b4
[MNN:Bugfix] Fix bug for CPURaster for fuse singleConvert of dim == 3
2021-11-02 10:56:35 +08:00
xiaying
d199ee6f49
Fix compute small error for OnnxScatterND
2021-11-02 10:55:15 +08:00
xiaying
0fdb9d768f
Add Clamp for fp32 -> fp16
2021-11-01 14:25:34 +08:00
jxt1234
77b1d0ed71
Merge pull request #1738 from no5-aaron-wu/dev_aaron_wu
...
increase compatibility of NCHW format for inputs[1](rois) in CPUROIPooling op
2021-10-30 17:26:44 +08:00
aaron-wu
9acad284fa
fix(op): increase compatibility of NCHW format for inputs[1](rois) in CPUROIPooling op
2021-10-30 15:18:38 +08:00
jun.lv
0b299e951c
Fix conv1d depthwise conv int8 calculation issue.
2021-10-29 18:58:58 +08:00
jxt1234
67f8bc94ef
Merge pull request #1735 from no5-aaron-wu/dev_aaron_wu
...
add onnx convert and uint test for ROIPooling OP
2021-10-29 14:43:22 +08:00
xiaying
8caa162f87
Fix ConvDilateFuse bug for width pad not equal height pad
2021-10-29 11:37:10 +08:00
wuhao
fcd747ed7a
feat(test): add unit test for ROIPooling
2021-10-29 10:51:59 +08:00
insta360
cca07fdf98
feat(convert): add ROIPoolingOnnx Convert(the onnx model file export from torchvision.ops.roi_pool)
2021-10-29 10:26:05 +08:00
xiaying
19c2df11f5
[MNN:Bugfix] Fix compile bug for ios arm82
2021-10-25 11:10:44 +08:00
jxt1234
e121c1527a
Merge pull request #1718 from jokerz0624/acc/GridSample
...
improvement(GridSample): give areaRemain one better handle in Arm82
2021-10-25 10:56:14 +08:00
jxt1234
81c3d0f791
Merge pull request #1719 from DaydreamCoding/patch-9
...
fix MNN_BUILD_TORCH without MNN_BUILD_SHARED_LIBS
2021-10-25 10:55:51 +08:00
jxt1234
70cd0c5b27
Merge pull request #1724 from DaydreamCoding/patch-10
...
fix memory leak
2021-10-25 10:55:21 +08:00
jxt1234
410b1b38df
Merge pull request #1731 from jokerz0624/feat/arm82_fma_new
...
improvement(ConvWino): use fma to accelerate computation
2021-10-25 10:50:34 +08:00
Joker
af9c543115
improvement(ConvWino): use fma to accelerate computation
2021-10-22 14:24:29 +08:00
xiaying
da3688119d
[MNN:Bugfix] Fix shape compute and content bug for batch > 1's rnngru
2021-10-20 11:56:16 +08:00
DaydreamCoding
0ec11813f0
fix memory leak
2021-10-15 13:39:20 +08:00
xiaying
7f50ae689d
Fix zero shape bug for TensorArray
2021-10-14 15:00:01 +08:00
xiaying
a469efe8ad
Add MNNRevert2Buffer
2021-10-13 14:16:01 +08:00