Commit Graph

37 Commits

Author SHA1 Message Date
xiaying 34f7abce16 [PATCH 130/160] [Vulkan:Bugfix] Fix bug for other op use reshape 2020-03-22 19:02:12 +08:00
xiaying ff5d81ec22 [PATCH 129/160] [Vulkan:Refractor] Delete gemm16x16Half, replace by macro 2020-03-22 19:02:12 +08:00
xiaying 1c1ee5b2b3 [PATCH 128/160] [Vulkan:Feature] Support squeeze, expandim 2020-03-22 19:02:12 +08:00
xiaying cd8e4d17d9 [PATCH 126/160] [Vulkan:Speed] Optimize the pre/post treat of deconvolution 2020-03-22 19:02:12 +08:00
xiaying c4d8538862 [PATCH 125/160] [Vulkan:Speed] Optimize convolution's im2col and col2im's localsize 2020-03-22 19:02:12 +08:00
xiaying 7f12a224e5 [PATCH 124/160] [Vulkan:Refractor] Rename VulkanMatrixMultier as VulkanMatrixMultier4x4 2020-03-22 19:02:12 +08:00
xiaying c77d7c4ab3 [PATCH 120/160] [Vulkan:Feature] Support multi-input for deconvolution 2020-03-22 19:02:11 +08:00
xiaying 076194754f [PATCH 116/160] [Vulkan:Bugfix] Fix bug for pad error of batchtospacend and spacetobatchnd 2020-03-22 19:02:11 +08:00
xiaying 905d4332f2 [PATCH 115/160] [Vulkan:Refractor] Use common convolution and deconvolution pad 2020-03-22 19:02:11 +08:00
xiaying 4dec63d499 [PATCH 103/160] [Vulkan:Feature, MNN:Refract] Support Reduce for vulkan 2020-03-22 19:02:10 +08:00
xiaying 9c697a4f46 [PATCH 100/160] [Vulkan:Feature] Support broadcast for binary, opt image add/sub 2020-03-22 19:02:09 +08:00
xiaying 2d71a17c47 [PATCH 099/160] [Vulkan:Bugfix, Demo:Bugfix] Fix bug for p20 can't run vulkan, Fix bug for android demo compile 2020-03-22 19:02:09 +08:00
xiaying bcf5b703b1 [PATCH 097/160] [Vulkan:Bugfix] fix deconvolution-dw crash bug, add print for copy buffer 2020-03-22 19:02:09 +08:00
xiaying eb132b63b0 [PATCH 096/160] [Vulkan:Feature] Support nc4hw4 unary 2020-03-22 19:02:09 +08:00
xiaying 6037692a07 [PATCH 095/160] [Vulkan:Bugfix] Fix bug for nchw convert, which cast kh != kw's convolution error 2020-03-22 19:02:09 +08:00
xiaying 9697d157e0 [PATCH 093/160] [Vulkan:Feature] Support neg, sign, log, square 2020-03-22 19:02:09 +08:00
xiaying c024ac5243 [PATCH 091/160] [Vulkan:Bugfix] Fix bug for compute of MatMul 2020-03-22 19:02:08 +08:00
xiaying 86d7732ddc [PATCH 089/160] [Vulkan:Feature] Support Relu / Relu6 with buffer, add matmul 2020-03-22 19:02:08 +08:00
xiaying 1a49ff15f8 [PATCH 088/160] [Vulkan:Feature] Support multi-input convolution, remove unuseful shader 2020-03-22 19:02:08 +08:00
xiaying efcd54e695 [PATCH 083/160] [Vulkan:Bugfix] Fix compute bug for softmax buffer 2020-03-22 19:02:07 +08:00
xiaying 7f5fb2c414 [PATCH 079/160] [Vulkan:Feature] Support large feature map for convolution im2col+GEMM 2020-03-22 19:02:07 +08:00
xiaying 79640387e5 [PATCH 071/160] [Vulkan:Feature] Support idst quan and fp16 model 2020-03-22 19:02:06 +08:00
xiaying 3a3d56ab46 [PATCH 067/160] [Vulkan:Bugfix] Fix compile bug for gcc don't link dl default 2020-03-22 19:02:06 +08:00
xiaying 71bb5597b9 [PATCH 062/160] [Vulkan:Bugfix] Fix execution crash when not support, test will not crash 2020-03-22 19:02:05 +08:00
海境 90e06944db
Update 2020-02-26 09:57:17 +08:00
Zhang 91b5ade49a
Sync. Fix OpenGL related building issues. Build the whole suite on Android CI (#580)
* Sync code with latest internal version

* Update CMake

* Fix logging issues

* Fix OpenGL Building

* Bump CMakeLists version. Update Podspec

* Update MetalLib Lookup logic

* Fix Windows Build
2020-01-16 16:55:46 +08:00
Zhang 002ac367e4
Update 2019-12-27 22:16:57 +08:00
liqing d6b00d04f4 - build:
- unify schema building in core and converter;
	- add more build script for android;
	- add linux build script for python;

- ops impl:
	- add floor mod support in binary;
	- use eltwise impl in add/max/sub/mul binary for optimization;
	- remove fake double support in cast;
	- fix 5d support for concat;
	- add adjX and adjY support for batch matmul;
	- optimize conv2d back prop filter;
	- add pad mode support for conv3d;
	- fix bug in conv2d & conv depthwise with very small feature map;
	- optimize binary without broacast;
	- add data types support for gather;
	- add gather ND support;
	- use uint8 data type in gather v2;
	- add transpose support for matmul;
	- add matrix band part;
	- add dim != 4 support for padding, reshape & tensor convert;
	- add pad type support for pool3d;
	- make ops based on TensorFlow Lite quantization optional;
	- add all & any support for reduction;
	- use type in parameter as output type in reduction;
	- add int support for unary;
	- add variable weight support for conv2d;
	- fix conv2d depthwise weights initialization;
	- fix type support for transpose;
	- fix grad outputs count for  reduce grad and reshape grad;
	- fix priorbox & detection output;
	- fix metal softmax error;

- python:
	- add runSessionWithCallBackInfo interface;
	- add max nodes limit (1400) for visualization tool;
	- fix save error in python3;
	- align default dim;

- convert:
	- add extra design for optimization;
	- add more post converting optimizers;
	- add caffe v1 weights blob support;
	- add cast, unary, conv transpose support for onnx model;
	- optimize batchnorm, conv with variable weights, prelu, reshape, slice, upsample for onnx model;
	- add cos/sin/atan/tan support for unary for tensorflow model;
	- add any/all support for reduction for tensorflow model;
	- add elu, conv3d, pool3d support for tensorflow model;
	- optimize argmax, batchnorm, concat, batch to space, conv with variable weights, prelu, slice for tensorflow model;

- others:
	- fix size computer lock;
	- fix thread pool deadlock;
	- add express & parameters in express;
	- rewrite blitter chooser without static map;
	- add tests for expr;
2019-10-29 13:37:26 +08:00
liqing 73ad3413cc - dynamic computation graph (beta)
- add supports (/express)
	- add tests
	- add benchmarks with it (/benchmark/exprModels)
- Python
	- MNN engine and tools were submitted to pip
	- available on Windows/macOS/Linux
- Engine/Converter
	- add supports for each op benchmarking
	- refactor optimizer by separating steps
- CPU
	- add supports for Conv3D, Pool3D, ELU, ReverseSequence
	- fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf
- OpenCL
	- add half transform in CPU
	- add broadcast supports for binary
	- optimize Conv2D, Reshape, Eltwise, Gemm, etc.
- OpenGL
	- add sub, real div supports for binary
	- add supports for unary
	- optimize Conv2D, Reshape
- Vulkan
	- add max supports for eltwise
- Metal
	- fix metallib missing problem
- Train/Quantization
	- use express to refactor training codes
2019-09-26 21:02:07 +08:00
liqing b995b25db9 beta 0.2.0.8
- add NaN check-up
- add quantification support for ScaleAdd Op
- add binary to eltwise optimization
- add console logs for quantization tool
- better document for quantization tool
- replace redundant dimension flags with dimension format
- optimize performance of TensorFlow Lite Quantized Convolution
- fix axis support for ONNX softmax
- fix get performance compile error on Windows
2019-08-22 20:13:46 +08:00
liqing ad759ebfae beta 0.2.0.1
- support both armv7/arm64 in podspec (pod version >= 1.5.0 required)
- refactor neg axis support
- fix memory overlap in de-conv
- fix CONVOLUTION_TILED_NUMBER spell error
- fix few warnings
- add binary / interp / permute / relu / reshape / softmax support and optimize conv for OpenGL backend
- add clean in nmake build script
2019-06-24 11:32:41 +08:00
liqing 0b085d6ffd beta 0.1.1.5
- cpu & gpu
	- add ceil mode in pool
	- fix softmax with neg axis
- cpu
	- add unsqueeze op
	- optimize lstm
- gpu
	- add 5x5 winograd in metal
	- add batch support for winograd in opencl
- onnx
	- add concat / gather / shape / squeeze / unsqueeze
	- fix data type support in constant
2019-06-05 10:45:59 +08:00
Sungmann Cho 455786f0dc Fix minor typos
1. bechmark -> benchmark
2. MMN -> MNN
3. nomalize -> normalize
4. paramater -> parameter
5. tflie -> tflite
2019-05-14 08:47:10 +09:00
nihuini b3e27f91a2 fix oops 2019-05-09 10:20:56 +08:00
nihui 195d439518
fix include guard oops 2019-05-06 12:39:25 +08:00
liqing 07e28c80d3 beta 0.1.1
- update resources and docs
  - unite tensor's width/height/channel/batch getter
  - optimize several ops
  - fix compile warnings and errors on Ubantu
  - some other bug fixes
2019-05-05 20:27:57 +08:00
liqing 5551108af8 beta 0.1.0 2019-04-19 20:50:09 +08:00