I’m writing a generator for training images for Tesseract OCR.
When generating a training image for a new font for Tesseract OCR, what are the best values for:
- The DPI
- The font size in points
- Should the font be anti-aliased or not
- Should the bounding boxes fit snugly:
, or not: 
I found the answer to the 4th question – “Should the bounding boxes fit snugly”.
It seems that fitting the rectangles as much as possible gives much better results.
For the other 12 pts and 300 dpi will be good enough, as @Yaroslav suggested. I think anti-aliasing is better turned off.