- replace FreeImage with stb_image
- warn unicode error in Windows compiling
- separate clang/gcc build script for android
- add default values in fbs
- optimize CPU conv / conv depthwise / deconv / deconv depthwise / lstm / sigmoid
- add sub support in eltwise
- add reciprocal / log1p / log in unary
- add zero like / select / set diff 1d
- add batch support for permute
- add training codes
- fix metal error in dynamic separate storage type handling
- cpu & gpu
- add ceil mode in pool
- fix softmax with neg axis
- cpu
- add unsqueeze op
- optimize lstm
- gpu
- add 5x5 winograd in metal
- add batch support for winograd in opencl
- onnx
- add concat / gather / shape / squeeze / unsqueeze
- fix data type support in constant
- update resources and docs
- unite tensor's width/height/channel/batch getter
- optimize several ops
- fix compile warnings and errors on Ubantu
- some other bug fixes