DNN - Text Detection and Recognition

Text detection and recognition nodes can detect where text is in an image and then provide the output with bounding box position and text string.

Pass the output of text detection into the input of text recognition to enable the full process.

See Deep Learning in OpenCV for information about supported frameworks and supported layers.

Some tested models with parameters:

Text Detection:

Model	Size (W x H)	Mean	Scale	RGB
DB_IC15_resnet50	1280 x 736	122.68, 116.67, 104.01	0.00392	true
DB_IC15_resnet18	1280 x 736	122.68, 116.67, 104.01	0.00392	true
DB_TD500_resnet50	736 x 736	122.68, 116.67, 104.01	0.00392	true
DB_TD500_resnet18	736 x 736	122.68, 116.67, 104.01	0.00392	true

Text Recognition:

Model	Vocabulary	Size (W x H)	Mean	Scale	RGB
crnn	alphabet 36	100 x 32	127.5, 127.5, 127.5	0.0078431	false
crnn cs	alphabet 94	100 x 32	127.5, 127.5, 127.5	0.0078431	true
crnn cs CN	alphabet 3944 Chinese characters	100 x 32	127.5, 127.5, 127.5	0.0078431	true

FlowCV Documentation