- add supports (/express)
- add tests
- add benchmarks with it (/benchmark/exprModels)
- Python
- MNN engine and tools were submitted to pip
- available on Windows/macOS/Linux
- Engine/Converter
- add supports for each op benchmarking
- refactor optimizer by separating steps
- CPU
- add supports for Conv3D, Pool3D, ELU, ReverseSequence
- fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf
- OpenCL
- add half transform in CPU
- add broadcast supports for binary
- optimize Conv2D, Reshape, Eltwise, Gemm, etc.
- OpenGL
- add sub, real div supports for binary
- add supports for unary
- optimize Conv2D, Reshape
- Vulkan
- add max supports for eltwise
- Metal
- fix metallib missing problem
- Train/Quantization
- use express to refactor training codes
- add NaN check-up
- add quantification support for ScaleAdd Op
- add binary to eltwise optimization
- add console logs for quantization tool
- better document for quantization tool
- replace redundant dimension flags with dimension format
- optimize performance of TensorFlow Lite Quantized Convolution
- fix axis support for ONNX softmax
- fix get performance compile error on Windows
- replace FreeImage with stb_image
- warn unicode error in Windows compiling
- separate clang/gcc build script for android
- add default values in fbs
- optimize CPU conv / conv depthwise / deconv / deconv depthwise / lstm / sigmoid
- add sub support in eltwise
- add reciprocal / log1p / log in unary
- add zero like / select / set diff 1d
- add batch support for permute
- add training codes
- fix metal error in dynamic separate storage type handling
- cpu & gpu
- add ceil mode in pool
- fix softmax with neg axis
- cpu
- add unsqueeze op
- optimize lstm
- gpu
- add 5x5 winograd in metal
- add batch support for winograd in opencl
- onnx
- add concat / gather / shape / squeeze / unsqueeze
- fix data type support in constant