- Added Python Express API implemented with pbind11 - Added demos for Python Express API - Performance improvements for ARM64, ARMv8.2, x86. - README update.