GitHub - cuixing158/yolo-tensorRT-cpp: 部署量化库，适合pc,jetson,int8量化， yolov3/v4/v5

本项目工程属于tensorRT yolov3/v4/v5 C++量化版本！

OverView

本工程含有2个项目，一个用于dll_detector产生dll或者so库文件，另一个为测试库文件的项目,yolov3/v4需要事先准备cfg,weights文件， yolov5需要事先准备yolov5s.yaml和yolov5s.pt文件。C++代码组织非常好，可以学习参考，另外关于tensorRT的量化过程也不错。此库非常适合windows10,ubuntu,嵌入式jetson环境部署。

TensorRT 量化流程

量化工作原理为：先判断是否有校订table文件存在，有的话直接读取，没有就对data/目录下的图像进行calibrate生成table，先调用函数readCalibrationCache,然后getBatch,最后writeCalibrationCache，getBatch()在校验过程中调用多次，其他函数调用一次。所有模型文件都转换为cfg,weights,解析是使用tensorRT C++ 自定义的API。

更新记录

2020.9.27记录：tensorRT量化进度，审阅代码到calibrator流程，是定义Int8EntropyCalibrator 继承tensorRT库下的 public nvinfer1::IInt8EntropyCalibrator，重写calibrator类.明天需要完成自己的球员网球检测器在量化后的表现
2020.9.28记录：量化了网球球员检测模型，速度10ms一帧，320×320，速度并未提高？校准表是中间生成？
2020.10.9 记录：弄清楚量化接口的调用过程，以便于部署其他模型的推理量化。在PC上测评tensorRT性能结果见此项目
2020.10.10记录：项目中使用engine推理图像大小是在cfg文件中定义的width,height进行的，而非实际输入图像大小。暂时终止此项目，改用onnxruntime进行推理，因为onnxruntime已经集成了tensorRT推理引擎或者onnx-tensorrt
或移步到我的face_jetson_pytorch
2020.11.11记录：量化部分仍旧回到本库C++ TensorRT模式！环境定向为cuda10.2+cudnn7.4.1+vs2019

yolov5s , yolov5m , yolov5l , yolov5x tutorial
yolov4 , yolov4-tiny
yolov3 , yolov3-tiny

Features

inequal net width and height
batch inference
support FP32,FP16,INT8
daynamic input size

WRAPPER

Prepare the pretrained .weights and .cfg model.

Detector detector;
Config config;

std::vector<BatchResult> res;
detector.detect(vec_image, res)

windows10

cuda环境dependency：tensorRT,cuda,cudnn版本要对应，TensorRT6.0.1.5+cuda10.1+cudnn7.6.4.38 或者 TensorRT 7.1.3.4 +cuda 11.0 + cudnn 8.0 或者 TensorRT7.0+cuda10.2+cudnn7.6.4.38
软件环境dependency : opencv4 , vs2015或其他版本
build:

open MSVC sln/sln.sln file
- dll project : the trt yolo detector dll
- demo project : test of the dll

ubuntu & L4T (jetson)

The project generate the libdetector.so lib, and the sample code. If you want to use the libdetector.so lib in your own project,this cmake file perhaps could help you .

git clone https://github.com/enazoe/yolo-tensorrt.git
cd yolo-tensorrt/
mkdir build
cd build/
cmake ..
make
./yolo-trt

API

struct Config
{
	std::string file_model_cfg = "configs/yolov4.cfg";

	std::string file_model_weights = "configs/yolov4.weights";

	float detect_thresh = 0.9;

	ModelType net_type = YOLOV4;

	Precision inference_precison = INT8;
	
	int gpu_id = 0;

	std::string calibration_image_list_file_txt = "configs/calibration_images.txt";

	int n_max_batch = 4;	
};

class API Detector
{
public:
	explicit Detector();
	~Detector();

	void init(const Config &config);

	void detect(const std::vector<cv::Mat> &mat_image,std::vector<BatchResult> &vec_batch_result);

private:
	Detector(const Detector &);
	const Detector &operator =(const Detector &);
	class Impl;
	Impl *_impl;
};

关于量化的一些原理知识

对于任意一个实数，量化为整数类型，省略bias，计算公式如下：
RealWorldValue = StoredInteger ✕ 2^(−FractionLength)
在matlab中使用定点计算可以表述上式，例如pi，可以用以下代码量化其值：

ntBP = numerictype(1,8,4);% 定义一种有符号8位并小数位占4位的符号位对象类型
x_BP = fi(pi,true,8) % 有符号8位定点数,小数位长度由软件自动推算
pi_cal = double(x_BP.storedInteger)*2^(-x_BP.FractionLength)+x_BP.Bias % 验证量化公式

yBP1 = quantize(x_BP,ntBP) % 指定x_BP为ntBP类型

output:

x_BP = 
   3.156250000000000

          DataTypeMode: Fixed-point: binary point scaling
            Signedness: Signed
            WordLength: 8
        FractionLength: 5
pi_cal =
   3.156250000000000
yBP1 = 
   3.125000000000000

          DataTypeMode: Fixed-point: binary point scaling
            Signedness: Signed
            WordLength: 8
        FractionLength: 4

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
CmFile		CmFile
configs		configs
data		data
extra		extra
modules		modules
samples		samples
scripts		scripts
sln		sln
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
yolov5_tutorial.md		yolov5_tutorial.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OverView

TensorRT 量化流程

更新记录

Features

WRAPPER

windows10

ubuntu & L4T (jetson)

API

关于量化的一些原理知识

REFERENCE

About

Releases

Packages

Languages

License

cuixing158/yolo-tensorRT-cpp

Folders and files

Latest commit

History

Repository files navigation

OverView

TensorRT 量化流程

更新记录

Features

WRAPPER

windows10

ubuntu & L4T (jetson)

API

关于量化的一些原理知识

REFERENCE

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages