Commit Graph

1159 Commits

Author SHA1 Message Date
jxt1234 a65c57c3d0
Merge pull request #1772 from alibaba/feature/sync
[MNN:Sync] Sync internal gitlab
2021-11-30 10:16:48 +08:00
xiaying 69dba73dc7 [MNN:Sync] Sync internal gitlab
Main Feature:
1. Add OpenCV API and Numpy API Support
2. Protobuf move into MNN
3. Add more op for torchscript convert
4. Add recompute to speed up geometry compute
5. Add ModuleBasic Test
2021-11-30 10:10:53 +08:00
xiaying 71cd04e91c Fix compile bug for sse fma 2021-11-19 10:23:50 +08:00
jxt1234 d099857714
Merge pull request #1756 from no5-aaron-wu/dev_aaron_wu
Add SSE instruction set optimization for ROIAlign and ROIPooling op
2021-11-18 20:10:26 +08:00
aaron-wu f995ca6a8f fix(op): replace the _mm_load_ps and _mm_store_ps with _mm_loadu_ps and _mm_storeu_ps, to avoid segment errors when not aligned 2021-11-16 16:07:50 +08:00
aaron-wu e35ea54638 feat(op): Add SSE instruction set optimization for ROIAlign and ROIPooling op 2021-11-15 14:53:12 +08:00
xiaying 95402e79b4 [MNN:Bugfix] Fix Compile bug for other backends 2021-11-12 17:49:50 +08:00
jxt1234 8f934c28c6
Merge pull request #1754 from cmdbug/master
[fix] fix typo
2021-11-12 17:18:48 +08:00
jxt1234 e86c0ba30a
Merge pull request #1746 from no5-aaron-wu/dev_aaron_wu
add CPUROIAlign op and unit-test and so on
2021-11-12 17:13:04 +08:00
teng c2e81a6cf3
[fix] fix typo
种 -> 中
2021-11-12 13:57:56 +08:00
teng 6c56682327
[fix] fix typo
MacOS -> macOS
2021-11-12 10:30:06 +08:00
teng b7e6705ad5
[fix] fix typo
MacOS -> macOS
2021-11-12 10:29:44 +08:00
xiaying 361bbc90d5 Fix bug for DenseConvolutionTiledExecutor opt not care width = 1, but kernel X >1 and padX > 0 2021-11-12 09:56:59 +08:00
xiaying 0bcc70922d [MNN:Bugfix] Fix compile bug for gnu of arm82 /bf16 2021-11-10 17:52:30 +08:00
aaron-wu 074bf5e275 fix(op): add assert to var samplingRatioW and samplingRatioH 2021-11-09 11:20:22 +08:00
aaron-wu 8e773602bf fix(schema): merge parameters for RoiPooling and RoiAlign into one table as RoiParameters 2021-11-09 11:11:27 +08:00
aaron-wu 7afb6abd1b fix(op): precalculate pos and area which shared by all channels; add defense programming for boundary case 2021-11-09 10:00:51 +08:00
aaron-wu 094d5697ae feat(op): add neon realization of CPUROIAlign op 2021-11-09 10:00:51 +08:00
aaron-wu 1af7d6f4d1 fix(op): fix compile error in linux system 2021-11-09 10:00:50 +08:00
aaron-wu cfac71f919 feat(op): add CPUROIAlign op and uint test 2021-11-09 10:00:50 +08:00
aaron-wu f4a5314780 fix(test): add new test case in ROIPoolingTest.cpp 2021-11-09 10:00:50 +08:00
aaron-wu 9f26b548b3 feat(convert): add ROIAlignOnnx convert(the onnx model file exported from torchvision.ops.roi_align) 2021-11-09 10:00:50 +08:00
jxt1234 d983b77ef2
Merge pull request #1743 from Napoleon-Jm/dev
fix: cli param keep input format.
2021-11-05 11:18:15 +08:00
xiaying 75413768b0 Fix bug for onResize of CPURNNSequenceGRU 2021-11-04 12:55:59 +08:00
恺心 a0b14095c2 fix: cli param keep input format. 2021-11-03 14:22:50 +08:00
xiaying 2fdd11e718 [MNN:Bugfix] Use fabsf instead of abs 2021-11-02 12:06:10 +08:00
xiaying 06d3d23c57 Temply turn inputCount = 1's depthmulti to convolution 2021-11-02 11:59:28 +08:00
xiaying 87b76f594c [Doc:Bugfix] Fix pic path error 2021-11-02 11:42:58 +08:00
jxt1234 0b69ba78d2
Merge pull request #1739 from jun-lv-17/fix-depthwiseconvint8-issue
Fix conv1d depthwise conv int8 calculation issue.
2021-11-02 11:39:32 +08:00
xiaying b1d923e76c Fix compile bug for bf16 when sse / neon is close 2021-11-02 11:34:14 +08:00
xiaying ed8a2da0b4 [MNN:Bugfix] Fix bug for CPURaster for fuse singleConvert of dim == 3 2021-11-02 10:56:35 +08:00
xiaying d199ee6f49 Fix compute small error for OnnxScatterND 2021-11-02 10:55:15 +08:00
xiaying 0fdb9d768f Add Clamp for fp32 -> fp16 2021-11-01 14:25:34 +08:00
jxt1234 77b1d0ed71
Merge pull request #1738 from no5-aaron-wu/dev_aaron_wu
increase compatibility of NCHW format for inputs[1](rois) in CPUROIPooling op
2021-10-30 17:26:44 +08:00
aaron-wu 9acad284fa fix(op): increase compatibility of NCHW format for inputs[1](rois) in CPUROIPooling op 2021-10-30 15:18:38 +08:00
jun.lv 0b299e951c Fix conv1d depthwise conv int8 calculation issue. 2021-10-29 18:58:58 +08:00
jxt1234 67f8bc94ef
Merge pull request #1735 from no5-aaron-wu/dev_aaron_wu
add onnx convert and uint test for ROIPooling OP
2021-10-29 14:43:22 +08:00
xiaying 8caa162f87 Fix ConvDilateFuse bug for width pad not equal height pad 2021-10-29 11:37:10 +08:00
wuhao fcd747ed7a feat(test): add unit test for ROIPooling 2021-10-29 10:51:59 +08:00
insta360 cca07fdf98 feat(convert): add ROIPoolingOnnx Convert(the onnx model file export from torchvision.ops.roi_pool) 2021-10-29 10:26:05 +08:00
xiaying 19c2df11f5 [MNN:Bugfix] Fix compile bug for ios arm82 2021-10-25 11:10:44 +08:00
jxt1234 e121c1527a
Merge pull request #1718 from jokerz0624/acc/GridSample
improvement(GridSample): give areaRemain one better handle in Arm82
2021-10-25 10:56:14 +08:00
jxt1234 81c3d0f791
Merge pull request #1719 from DaydreamCoding/patch-9
fix MNN_BUILD_TORCH without MNN_BUILD_SHARED_LIBS
2021-10-25 10:55:51 +08:00
jxt1234 70cd0c5b27
Merge pull request #1724 from DaydreamCoding/patch-10
fix memory leak
2021-10-25 10:55:21 +08:00
jxt1234 410b1b38df
Merge pull request #1731 from jokerz0624/feat/arm82_fma_new
improvement(ConvWino): use fma to accelerate computation
2021-10-25 10:50:34 +08:00
Joker af9c543115 improvement(ConvWino): use fma to accelerate computation 2021-10-22 14:24:29 +08:00
xiaying da3688119d [MNN:Bugfix] Fix shape compute and content bug for batch > 1's rnngru 2021-10-20 11:56:16 +08:00
DaydreamCoding 0ec11813f0
fix memory leak 2021-10-15 13:39:20 +08:00
xiaying 7f50ae689d Fix zero shape bug for TensorArray 2021-10-14 15:00:01 +08:00
xiaying a469efe8ad Add MNNRevert2Buffer 2021-10-13 14:16:01 +08:00