20,000 Image Caption Data Of OCR In Natural Scenes, including Asian and European languages, a total of 14 languages, the collection environment includes shop plaques, stop signs, posters, road signs and other scenes, including a variety of shooting angles. The description language is English, which mainly describes the text arrangement, text content, color and other information.
For more details, please refer to the link: https://www.nexdata.ai/datasets/llm/1288?source=Github
20,000 pictures, 20,000 descriptions
Asian languages: Korean, Indonesian, Malay, Vietnamese, Thai, Chinese, Japanese European languages: French, German, Italian, Portuguese, Russian, Spanish, English
including store plaques, stop signs, posters, road signs, prompts and other scenes
including 14 languages, various natural scenes, and multiple shooting angles
image format is .jpg, text format is .txt
mobile phone, camera
English
in principle, 30~60 words, usually 3-5 sentences
text arrangement, text content, color, scene
the proportion of correctly labeled images is not less than 97%
Commercial License