MindSpore Golden Stick

Overview

MindSpore Golden Stick is a model compression algorithm set jointly designed and developed by Huawei's Noah team and Huawei's MindSpore team. The architecture diagram of MindSpore Golden Stick is shown in the figure below, which is divided into five parts:

The underlying MindSpore Rewrite module provides the ability to modify the front-end network. Based on the interface provided by this module, algorithm developers can add, delete, query and modify the nodes and topology relationships of the MindSpore front-end network according to specific rules;
Based on MindSpore Rewrite, MindSpore Golden Stick will provide various types of algorithms, such as SimQAT algorithm, SLB quantization algorithm, SCOP pruning algorithm, etc.;
At the upper level of the algorithm, MindSpore Golden Stick also plans advanced technologies such as AMC (AutoML for Model Compression), NAS (Neural Architecture Search), and HAQ (Hardware-aware Automated Quantization). This feature will be provided in future;
In order to facilitate developers to analyze and debug algorithms, MindSpore Golden Stick provides some tools, such as visualization tool, profiler tool, summary tool, etc. This feature will be provided in future;
In the outermost layer, MindSpore Golden Stick encapsulates a set of concise user interface.

The architecture diagram is the overall picture of MindSpore Golden Stick, which includes the features that have been implemented in the current version and the capabilities planned in RoadMap. Please refer to release notes for available features in current version.

Design Features

In addition to providing rich model compression algorithms, an important design concept of MindSpore Golden Stick is try to provide users with the most unified and concise experience for a wide variety of model compression algorithms in the industry, and reduce the cost of algorithm application for users. MindSpore Golden Stick implements this philosophy through two initiatives:

Unified algorithm interface design to reduce user application costs:

There are many types of model compression algorithms, such as quantization-aware training algorithms, pruning algorithms, matrix decomposition algorithms, knowledge distillation algorithms, etc. In each type of compression algorithm, there are also various specific algorithms, such as LSQ and PACT, which are both quantization-aware training algorithms. Different algorithms are often applied in different ways, which increases the learning cost for users to apply algorithms. MindSpore Golden Stick sorts out and abstracts the algorithm application process, and provides a set of unified algorithm application interfaces to minimize the learning cost of algorithm application. At the same time, this also facilitates the exploration of advanced technologies such as AMC, NAS and HAQ based on the algorithm ecology.
Provide front-end network modification capabilities to reduce algorithm development costs：

Model compression algorithms are often designed or optimized for specific network structures. For example, perceptual quantization algorithms often insert fake-quantization nodes on the Conv2d, Conv2d + BatchNorm2d, or Conv2d + BatchNorm2d + Relu structures in the network. MindSpore Golden Stick provides the ability to modify the front-end network through API. Based on this ability, algorithm developers can formulate general network transform rules to implement the algorithm logic without needing to implement the algorithm logic for each specific network. In addition, MindSpore Golden Stick also provides some debugging capabilities, including visualization tool, profiler tool, summary tool, aiming to help algorithm developers improve development and research efficiency, and help users find algorithms that meet their needs.

General Process of Applying the MindSpore Golden Stick

Compress Phase

Taking the quantization algorithm as an example, the compression phase mainly includes transforming the network into a fake-quantized network, quantization retraining or calibration, quantizing parameter statistics, quantizing weights, and transforming the network into a real quantized network.
Deplyment Phase

The deployment phase is the process of inferring the compressed network in the deployment environment. Since MindSpore does not support serialization of the front-end network, the deployment also needs to call the corresponding algorithm interface to transform the network to load the compressed checkpoint file. The flow after loading the compressed checkpoint file is the same as the normal inference process.

For details about how to apply the MindSpore Golden Stick, see the detailed description and sample code in each algorithm section.

For details about the "ms.export" step in the process, see Exporting MINDIR Model.

For details about the "MindSpore infer" step in the process, see MindSpore Inference Runtime.

Documents

Installation

Please refer to MindSpore Golden Stick Installation.

Quick Start

Take Simulated Quantization (SimQAT) as an example for demonstrating how to use MindSpore Golden Stick.

Compression Algorithm

Overview
Architecture	Workflow	APIs		examples
AutoCompress(TBD)
Post-Training Quantization
PTQ		RoundToNearest
Quant-Aware Quantization
SimQAT		SLB
Pruner
SCOP	uni_pruning(demo)		LRP(demo)
Others
Ghost

Model Deployment

Please refer to MindSpore Golden Stick Model Deployment。

Community

Governance

MindSpore Open Governance

Communication

MindSpore Slack developer communication platform

Contributions

Welcome to MindSpore contribution.

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 1,288 Commits
.gitee		.gitee
.jenkins		.jenkins
docs		docs
example		example
mindspore_gs		mindspore_gs
tests		tests
tools		tools
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
OWNERS		OWNERS
README.md		README.md
README_CN.md		README_CN.md
RELEASE.md		RELEASE.md
RELEASE_CN.md		RELEASE_CN.md
SECURITY.md		SECURITY.md
Third_Party_Open_Source_Software_Notice		Third_Party_Open_Source_Software_Notice
build.sh		build.sh
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MindSpore Golden Stick

Overview

Design Features

General Process of Applying the MindSpore Golden Stick

Documents

Installation

Quick Start

Compression Algorithm

Model Deployment

Community

Governance

Communication

Contributions

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 11

Uh oh!

Languages

License

mindspore-ai/golden-stick

Folders and files

Latest commit

History

Repository files navigation

MindSpore Golden Stick

Overview

Design Features

General Process of Applying the MindSpore Golden Stick

Documents

Installation

Quick Start

Compression Algorithm

Model Deployment

Community

Governance

Communication

Contributions

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 11

Uh oh!

Languages

Packages