@phucnsp
okay.
Can you do a simple experiment? I guess it will work for Japanese dataset!
- upload your input data(jpg/png/pdf) to google drive.
- open that image using google docs , file with the same name of type google doc will be generated in your google drive. This file will additionally contain extracted information from that input image.
- Make a parser and extract relevant text.
Please let me know if it works.